Models / Qwen
Image

Qwen Image 2.0 Pro

Highest-fidelity image generation for final production assets

About model

Qwen Image 2.0 Pro is the highest-fidelity variant of Alibaba's unified image generation and editing model, delivering stronger detail, composition, and text rendering than the standard tier. Built for final production assets where visual quality takes priority over generation speed, it produces native 2K output with professional-grade typography in English and Chinese across infographics, posters, and marketing materials.

Fidelity Tier

Pro

Stronger detail and composition for final production assets

Native Resolution

2K

Publication-ready output at 2048x2048

Asset Quality

Production

Optimized for final deliverables over rapid iteration

Model key capabilities
  • Highest Fidelity: Stronger detail, composition, and text rendering than the standard tier for production-grade output
  • Production Asset Quality: Optimized for final deliverables, print-ready materials, and client-facing visuals where accuracy takes priority
  • Unified Generation & Editing: Same architecture as Qwen Image 2.0 with enhanced fidelity for text-to-image and reference-based editing workflows
  • Professional Typography: Text rendering in English and Chinese optimized for publication-ready infographics, posters, and marketing materials
  • API usage

    • cURL
    • Python
    • Typescript

    Endpoint:

    Qwen/Qwen-Image-2.0-Pro

    curl -X POST "https://api.together.xyz/v1/images/generations" \
      -H "Authorization: Bearer $TOGETHER_API_KEY" \
      -H "Content-Type: application/json" \
      -d '{
        "model": "Qwen/Qwen-Image-2.0-Pro",
        "prompt": "Draw an anime style version of this image.",
        "width": 1024,
        "height": 768,
        "steps": 28,
        "n": 1,
        "response_format": "url",
        "image_url": "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png"
      }'
    
    from together import Together
    
    client = Together()
    
    imageCompletion = client.images.generate(
        model="Qwen/Qwen-Image-2.0-Pro",
        width=1024,
        height=768,
        steps=28,
        prompt="Draw an anime style version of this image.",
        image_url="https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png",
    )
    
    print(imageCompletion.data[0].url)
    
    
    
    import Together from "together-ai";
    
    const together = new Together();
    
    async function main() {
      const response = await together.images.create({
        model: "Qwen/Qwen-Image-2.0-Pro",
        width: 1024,
        height: 1024,
        steps: 28,
        prompt: "Draw an anime style version of this image.",
        image_url: "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png",
      });
    
      console.log(response.data[0].url);
    }
    
    main();
    
    
  • Model card

    Architecture Overview:
    • Highest-fidelity variant of the Qwen Image 2.0 family, optimized for production-quality output
    • Stronger detail, composition, and text rendering compared to the standard tier
    • Unified architecture for both text-to-image generation and instruction-based image editing
    • Native 2K resolution output (2048x2048) across flexible aspect ratios
    • Prompts up to 1,000 tokens for detailed scene and typography instructions
    • Reference image input for editing workflows

    Training Methodology:
    • Built on the same 8B Qwen3-VL encoder and 7B diffusion decoder architecture as Qwen Image 2.0
    • Optimized inference pipeline delivering higher fidelity textures, lighting, and compositional accuracy
    • Enhanced typography handling for complex multilingual text in professional design contexts

    Performance Characteristics:
    • Higher visual fidelity than standard Qwen Image 2.0 with richer textures, sharper detail, and improved composition
    • Stronger text rendering for production-grade infographics, posters, and marketing materials
    • Designed for final asset delivery where detail and accuracy take priority over generation speed
    • Multiple output generation with seed-based reproducibility

  • Prompting

    Together AI API Access:
    • Access Qwen Image 2.0 Pro via Together AI APIs using the endpoint Qwen/Qwen-Image-2.0-Pro
    • Authenticate using your Together AI API key in request headers
    • Control output dimensions with height/width parameters (total pixels: 262,144 to 4,194,304)
    • Use reference_images array for image editing workflows
    • Supports seed for reproducibility, negative prompts, and multiple outputs per request

  • Applications & use cases

    Final Production Assets:
    • Publication-ready marketing collateral with maximum detail and text accuracy
    • Print-resolution posters, banners, and editorial visuals requiring professional-grade fidelity
    • Client-facing presentations and infographics where quality cannot be compromised

    Professional Design:
    • Product photography and ecommerce hero images with accurate labels and packaging text
    • Brand assets with precise multilingual typography and compositional accuracy
    • Storyboarding and concept art with production-level detail

    Image Editing:
    • Reference-based editing with improved fidelity preservation for production workflows
    • Style transfer, object manipulation, and text editing within images
    • Visual refinement and detail enhancement for final deliverables

Related models
  • Model provider
    Qwen
  • Type
    Image
  • Main use cases
    Image Generation
  • Resolution/Duration
    512x512 to 2048x2048
  • Deployment
    Serverless
  • Parameters
    7B
  • Price

    $0.08 / image

  • Input modalities
    Text
    Image
  • Output modalities
    Image
  • Released
    February 9, 2026
  • Category
    Image