Home > Features

uni-1 Features & Capabilities

Everything you can do with uni-1's unified AI — from reasoning-driven generation to conversational editing and 76+ art styles.

Unified Architecture

Traditional image pipelines chain separate models — a language model parses the prompt, a diffusion model generates the image, and post-processors refine it. uni-1 collapses all of this into one unified transformer that thinks and creates simultaneously.

Traditional Pipeline

1Language Model (parse)→

2CLIP Encoder (embed)→

3Diffusion Model (generate)→

4Post-Processor (upscale)

Information lost as each model interprets the previous one's output.

uni-1 Unified Model

uni-1

Unified Transformer

InterpretationReasoningCreationRefinement

Single model. Zero information loss. Reasoning and creation are inseparable.

Visual Reasoning

Visual Reasoning: Thinking While Drawing

uni-1 does not just interpret your prompt — it reasons about it. Before generating a single pixel, it decomposes complex instructions into sub-tasks, evaluates spatial and logical constraints, and plans the composition.

Breaks down complex multi-element prompts into actionable sub-goals
Handles contradictory or ambiguous instructions gracefully
Logical reasoning score 2.1× higher than GPT-4o (0.52 vs 0.25)
Understands spatial relationships: behind, above, partially obscured by

Visual Reasoning: Thinking While Drawing

High-Quality Generation

High-Quality Image Generation

uni-1 produces images at up to 2K resolution with precise prompt adherence and exceptional detail fidelity. Complex scenes with multiple interacting subjects, accurate perspective, and coherent lighting are handled reliably.

2K (2048×2048) maximum resolution output
Accurate multi-subject scene composition
Precise lighting simulation: natural studio atmospheres
Minimal artifacts at high detail density

Conversational Editing

Multi-turn Conversational Image Editing

Refine your image through natural conversation. uni-1 maintains full context across the entire editing session — each follow-up message builds on the previous state without resetting.

Full context retention across the entire conversation
Alter specific elements without affecting the rest
Progressive refinement: make minor adjustments across many turns
Supports style, content, lighting, and composition edits simultaneously

Text Rendering

Perfect Text in Generated Images

Text rendering has historically been the weakness of AI image generators. uni-1 eliminates this problem entirely with precise typographic control and multilingual support.

Zero spelling errors in rendered text
Supports Latin, Chinese, Japanese, Arabic, and more
Complex typographic layouts with correct kerning
Natural integration of text into scenes and compositions

Art Styles

76+ Artistic Styles

uni-1 ships with an extensive built-in style vocabulary covering fine art movements, photography genres, illustration techniques, and digital art aesthetics.

76+ style presets from photorealism to pixel art
Fine art movements: Impressionism, Cubism, Surrealism, Expressionism
Photography styles: film noir, high fashion, documentary, macro
Mix styles with weighted modifiers: "70% watercolor, 30% digital concept art"

Multi-image Composition

Upload up to four reference images to guide uni-1's output. The model synthesizes style, character design, environmental elements, and color palettes from your references into a coherent new image.

Accepts up to 4 reference images simultaneously
Merges character designs from multiple sources while maintaining coherence
Style transfer from references: generate in the style of fine art or concept work
Environment combination: blend landscapes, interiors, and backgrounds from references

Start Creating with Uni-1

Free to try — no account needed. See what reasoning-first AI can do for your next project.

Start Generating Free