تخطي إلى المحتوى الرئيسي

قاموس

ما هو Diffusion Model؟

The AI architecture behind most image generators (Stable Diffusion, DALL-E, Midjourney) — generates images by progressively denoising random noise.

Diffusion models flipped image generation on its head in 2021-2022. The core idea: train a model to remove noise from an image; at generation time, start with pure random noise and run the denoiser many times (typically 20-50 steps), conditioning on a text prompt at each step. The model gradually 'sees' the prompted image emerging from the noise. Stable Diffusion, DALL-E 2/3, Midjourney, Adobe Firefly, and Imagen are all diffusion models. The architecture's strengths: photorealistic output, strong style control, manageable compute. Variants: latent diffusion (operates in compressed image space — much faster), conditional diffusion (controls like ControlNet), and video diffusion (Sora, Veo).

مصطلحات ذات صلة

العودة إلى قاموس AI