May 25, 2025
Alchemist: Turning Public Text-to-Image Data into Generative Gold
Pre-training equips text-to-image (T2I) models with broad world knowledge, but this alone is often…
May 24, 2025
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data
GPT-4o-level stylization consistency using only 2,600 pairs + 500 GPU hours!
May 6, 2025
New Tools for Brand Style Consistency and Control
Built to help creative teams explore, define, and apply a distinct brand style across every…
April 25, 2025
Step1X-Edit: A Practical Framework for General Image Editing
We release a state-of-the-art image editing model, Step1X-Edit, which can provide comparable…
April 24, 2025
DreamO: A Unified Framework for Image Customization
We propose DreamO, a unified image customization framework, which covers ID, IP, Tryon, and style…
April 24, 2025
JSON Style Templates for ChatGPT
Launching — JSON Visuals for ChatGPT . 50+ unique aesthetic codes, with attribute randomiser to…
April 21, 2025
DreamID achieves high-fidelity face swapping
DreamID: A Fast and High-Fidelity diffusion-based Face Swapping via Triplet ID Group Learning
April 19, 2025
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models
Instead of relying on artist names or style descriptions, SliderSpace automatically can map out how…