Home > Features
uni-1 Features & Capabilities
Everything you can do with uni-1's unified AI — from reasoning-driven generation to conversational editing and 76+ art styles.
Unified Architecture
Traditional image pipelines chain separate models — a language model parses the prompt, a diffusion model generates the image, and post-processors refine it. uni-1 collapses all of this into one unified transformer that thinks and creates simultaneously.
Traditional Pipeline
Information lost as each model interprets the previous one's output.
uni-1 Unified Model
uni-1
Unified Transformer
Single model. Zero information loss. Reasoning and creation are inseparable.
Visual Reasoning: Thinking While Drawing
uni-1 does not just interpret your prompt — it reasons about it. Before generating a single pixel, it decomposes complex instructions into sub-tasks, evaluates spatial and logical constraints, and plans the composition.
- Breaks down complex multi-element prompts into actionable sub-goals
- Handles contradictory or ambiguous instructions gracefully
- Logical reasoning score 2.1× higher than GPT-4o (0.52 vs 0.25)
- Understands spatial relationships: behind, above, partially obscured by

High-Quality Image Generation
uni-1 produces images at up to 2K resolution with precise prompt adherence and exceptional detail fidelity. Complex scenes with multiple interacting subjects, accurate perspective, and coherent lighting are handled reliably.
- 2K (2048×2048) maximum resolution output
- Accurate multi-subject scene composition
- Precise lighting simulation: natural studio atmospheres
- Minimal artifacts at high detail density

Multi-turn Conversational Image Editing
Refine your image through natural conversation. uni-1 maintains full context across the entire editing session — each follow-up message builds on the previous state without resetting.
- Full context retention across the entire conversation
- Alter specific elements without affecting the rest
- Progressive refinement: make minor adjustments across many turns
- Supports style, content, lighting, and composition edits simultaneously

Perfect Text in Generated Images
Text rendering has historically been the weakness of AI image generators. uni-1 eliminates this problem entirely with precise typographic control and multilingual support.
- Zero spelling errors in rendered text
- Supports Latin, Chinese, Japanese, Arabic, and more
- Complex typographic layouts with correct kerning
- Natural integration of text into scenes and compositions

76+ Artistic Styles
uni-1 ships with an extensive built-in style vocabulary covering fine art movements, photography genres, illustration techniques, and digital art aesthetics.
- 76+ style presets from photorealism to pixel art
- Fine art movements: Impressionism, Cubism, Surrealism, Expressionism
- Photography styles: film noir, high fashion, documentary, macro
- Mix styles with weighted modifiers: "70% watercolor, 30% digital concept art"

Multi-image Composition
Upload up to four reference images to guide uni-1's output. The model synthesizes style, character design, environmental elements, and color palettes from your references into a coherent new image.
- Accepts up to 4 reference images simultaneously
- Merges character designs from multiple sources while maintaining coherence
- Style transfer from references: generate in the style of fine art or concept work
- Environment combination: blend landscapes, interiors, and backgrounds from references

Start Creating with Uni-1
Free to try — no account needed. See what reasoning-first AI can do for your next project.