A comprehensive, curated list of State-of-the-Art (SOTA) Image Generation models, Generative AI research, and text-to-image tools. Stay updated with the latest advancements in Stable Diffusion, FLUX.1, Midjourney, and DALL-E.
- ๐ฅ FLUX.1 by Black Forest Labs (Aug 2024): The new benchmark for open-weights models. Features a 12B parameter Rectified Flow Transformer for elite prompt adherence.
- ๐จ Midjourney V7 (April 2025): Revolutionizing consistency with Omni Reference and blazing speed with Draft Mode.
- โก Stable Diffusion 3.5 (Oct 2024): Stability AI's flagship release featuring MMDiT architecture, perfect for local fine-tuning and LoRA development.
- ๐ค Google Imagen 4 (2025): Professional-grade photorealism and advanced typography, now native in the Gemini ecosystem.
- ๐ฌ OpenAI GPT-4o Multimodal (2024): Native, conversational image generation and iterative editing within ChatGPT.
- ๐ ๏ธ r/StableDiffusion: The heart of local AI generation and ComfyUI workflows.
- ๐ฌ r/MachineLearning: Deep dives into Rectified Flow and Diffusion Transformer (DiT) papers.
- ๐ The rise of ComfyUI: Why node-based pipelines are the future of professional AI art.
| Model Name | Year | Pretrained Weights | Codebase | Research Paper | Quality | License |
|---|---|---|---|---|---|---|
| Flux.1 [schnell] ๐ | 2024 | ๐ค HF Hub | ๐ป GitHub | ๐ Report | SOTA | Apache 2.0 |
| Stable Diffusion 3.5 โก | 2024 | ๐ค HF Hub | ๐ป GitHub | ๐ 2403.03206 | SOTA | Community |
| DALL-E 3 ๐ง | 2023 | API Only | Proprietary | ๐ System Card | SOTA | Proprietary |
| Midjourney V7 ๐๏ธ | 2025 | Web/Discord | Proprietary | -- | SOTA | Proprietary |
| Janus-Pro (7B) ๐ | 2025 | ๐ค HF Hub | ๐ป GitHub | ๐ 2501.14691 | A+ | MIT |
| Recraft V3 ๐ | 2024 | API/Web | Proprietary | -- | A+ | Commercial |
| UNIT (Legacy) ๐๏ธ | 2017 | ๐ป Model | ๐ป Code | ๐ 1703.00848 | B | CC Non-Comm |
- โจ Black Forest Labs: Experience the power of Flux.
- ๐ผ๏ธ Civitai: The ultimate repository for Stable Diffusion checkpoints, LoRAs, and embeddings.
- ๐ฎ Midjourney Explore: Interactive gallery and web generation.
- ๐งช Stability AI Platform: API access for SD3 and SDXL.
- ๐ Hugging Face Daily Papers: The pulse of the AI research community.
- ๐ ArXiv: Computer Vision: Latest pre-prints in image synthesis.
- ๐ง Arxiv-sanity-lite: A better way to browse and search AI papers.
If this consolidation helps your research or creative work, please consider supporting the maintenance of this landscape:
- โ Support via PayPal
- ๐ช Bitcoin:
3LZazKXG18Hxa3LLNAeKYZNtLzCxpv1LyD
Maintained by ishandutta2007. PRs are always welcome!