The hidden potential in diffusion models’ scaling space | by mike

Meet FreSca: a generalizable plug-and-play enhancement for diffusion fashions

Diffusion fashions have emerged as highly effective instruments for picture era and enhancing, however their full potential stays untapped, particularly in what researchers name the “scaling area.” This largely unexplored space — the place noise predictions are adjusted via scaling components — holds important promise for enhancing each picture enhancing and understanding duties.

FreSca, launched on this analysis, examines how the distinction between conditional and unconditional noise predictions (Δϵ) encodes task-specific data in diffusion fashions. Via Fourier analysis, the researchers uncovered that low-frequency and high-frequency parts evolve otherwise all through the diffusion course of. Low-frequency parts govern structural layouts whereas high-frequency parts encode fine-grained textures.

The important thing innovation of FreSca lies in its means to use steering scaling independently to completely different frequency bands within the Fourier area. This method enhances present picture enhancing strategies with out requiring retraining and extends successfully to picture understanding duties like depth estimation.

FreSca: A Generalizable Plug-and-Play Enhancement for Diffusion Fashions displaying each depth estimation enhancements (prime) and picture enhancing enhancements (backside).

Diffusion fashions have revolutionized content material era by progressively denoising random noise into coherent knowledge samples. Their versatility spans from picture synthesis to video manufacturing, with two major software domains examined on this analysis.

Approaches to picture enhancing utilizing diffusion fashions may be broadly categorized into two varieties: strategies that fine-tune or management diffusion fashions for particular enhancing duties (like DreamBooth, Null-text Inversion, and InstructPix2Pix), and…

Source link

Reinforcement Learning in the Age of Modern AI | by @pramodchandrayan | Jul, 2025

Finding the right tool for the job: Visual Search for 1 Million+ Products | by Elliot Ford | Kingfisher-Technology | Jul, 2025

Meanwhile in Europe: How We Learned to Stop Worrying and Love the AI Angst | by Andreas Maier | Jul, 2025

People are using AI to ‘sit’ with them while they trip on psychedelics

I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

Amazon and eBay to pay ‘fair share’ for e-waste recycling

Artificial Intelligence Concerns & Predictions For 2025

Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

Most Popular

Probability and Bayes’ Rule in Robot Localization | by Sophie Zhao | Jun, 2025

SwitchBot K20+ Pro Modular Home Robot at CES

TikTok Starts Working Again After Trump Says He Will Stall a Ban

Our Picks

People are using AI to ‘sit’ with them while they trip on psychedelics

Reinforcement Learning in the Age of Modern AI | by @pramodchandrayan | Jul, 2025

How This Man Grew His Beverage Side Hustle From $1k a Month to 7 Figures

The hidden potential in diffusion models’ scaling space | by mike | Apr, 2025

Meet FreSca: a generalizable plug-and-play enhancement for diffusion fashions

Related Posts