Out

Chat

disable

Wan 2.5 Preview

Its flexible dimension support and high-quality output make it ideal for use in creative apps, marketing tools, content management systems, and design software.

Free $1 Tokens for New Members

Text to Speech

Javascript

Python

                                        const main = async () => {
  const response = await fetch('https://api.ai.cc/v1/images/generations', {
    method: 'POST',
    headers: {
      Authorization: 'Bearer ',
      'Content-Type': 'application/json',
    },
    body: JSON.stringify({
      model: 'alibaba/wan2.5-t2i-preview',
      prompt: 'A jellyfish in the ocean',
    }),
  }).then((res) => res.json());

  console.log('Generation:', response);
};

main();

                                        import requests


def main():
    response = requests.post(
        "https://api.ai.cc/v1/images/generations",
        headers={
            "Authorization": "Bearer ",
            "Content-Type": "application/json",
        },
        json={
            "model": "alibaba/wan2.5-t2i-preview",
            "prompt": "A jellyfish in the ocean",
        },
    )

    response.raise_for_status()
    data = response.json()

    print("Generation:", data)


if __name__ == "__main__":
    main()

Docs

300+ AI Models for OpenClaw & AI Agents

Save 20% on Costs & $1 Free Tokens

Get API Key Explore Models

Wan 2.5 Preview

Product Detail

✨ Wan 2.5 Preview represents the cutting-edge in text-to-image generation, building upon the successful Wan series. This iteration introduces significant enhancements, primarily the removal of previous restrictions on image side length, granting users unparalleled flexibility in pixel dimension choices within a defined pixel area. It masterfully blends advanced AI architecture with meticulous pixel-level control to generate diverse, highly detailed, and high-fidelity visuals from simple textual prompts.

🔧 Technical Specifications

Model Type: Text-to-Image generative model
Architecture: Advanced diffusion-based generative network
Input: Text prompts in natural language
Output: Variable resolution images, any dimension within supported pixel range
Training Data: Diverse multimodal dataset, including art, photos, and digital illustrations
Languages Supported: Primarily English, adaptable to other languages with tokenization

📈 Performance Benchmarks

FID Score (Fréchet Inception Distance): 13.5 on standard image generation benchmarks, indicating high realism and quality.
Inference Speed: Average generation time of 4 seconds per 512x512 image on modern GPUs.
Memory Usage: Optimized to run on 12GB and above GPU VRAM configurations.
Resolution Support: Successfully generates images up to 4K and beyond without quality degradation.
Diversity: Generates a wide range of unique images for the same prompt, supporting creative exploration.

💲 API Pricing

Only $0.0315 per image

🔑 Key Features

High-Quality Detail: Produces crisp and intricate image features across various styles and subject matters.
Flexible Style Adaptation: Capable of generating artistic, realistic, or stylized images based on prompt context.
Fast Inference: Efficient model design enables quicker image generation compared to previous versions.
Scalable Resolution: Suitable for small digital thumbnails up to large-scale prints and presentations.

🚀 Use Cases

Digital Art Creation: Perfect for artists seeking custom artwork in any size and style.
Marketing & Advertising: Quickly produce high-quality visuals tailored to campaign needs.
Content Generation: Enhance blogs, social media, and websites with unique images.
Prototyping & Design: Generate concept art and product visuals during early development stages.
Educational Materials: Create engaging illustrations or infographics for teaching resources.
Entertainment & Media: Use for storyboarding, character concepting, and visual effects assets.

💻 Code Sample

<snippet data-name="image.flux" data-model="alibaba/wan2.5-t2i-preview"></snippet>

🔄 Comparison with Other Models

vs Stable Diffusion: Wan 2.5 is optimized for high-resolution images with fast inference and consistent quality at large sizes, while Stable Diffusion sometimes experiences quality degradation when scaling up.

vs DALL·E 3: Wan 2.5 Preview provides flexible dimension control enabling users to adapt output sizes freely, making it particularly advantageous for specialized design and print applications.

vs Midjourney: Wan 2.5 Preview is more versatile in dimension customization and supports both stylized and photorealistic outputs with rapid generation, appealing to users needing size flexibility without sacrificing detail.

vs Imagen: Wan 2.5 Preview surpasses Imagen by allowing free selection of image dimensions within pixel area limits, providing more adaptability for diverse use cases and print-ready results.

💭 Frequently Asked Questions (FAQ)

What is Wan 2.5 Preview?

Wan 2.5 Preview is the latest iteration of the Wan series text-to-image models, renowned for high-fidelity image generation from text prompts. Its key innovation is the removal of previous restrictions on image side length, offering flexible and unrestricted pixel dimension choices within a defined pixel area.

How does Wan 2.5 Preview compare to other leading models?

Wan 2.5 Preview stands out with its optimization for high-resolution images, fast inference, and consistent quality at large sizes, addressing quality degradation sometimes seen in Stable Diffusion when scaling. Compared to DALL·E 3, Midjourney, and Imagen, Wan 2.5 offers superior flexible dimension control, making it highly advantageous for specialized design, print applications, and versatile output customization without sacrificing detail.

What are the primary use cases for Wan 2.5 Preview?

It's ideal for a wide range of applications including digital art creation, marketing and advertising visuals, general content generation for blogs and social media, prototyping and design, educational materials, and entertainment and media production like storyboarding and visual effects.

What is the API pricing for Wan 2.5 Preview?

The API for Wan 2.5 Preview is priced at an accessible $0.0315 per image generated.

What are the key performance metrics of Wan 2.5 Preview?

It boasts an FID Score of 13.5 (high quality), an average inference speed of 4 seconds per 512x512 image, optimized memory usage for 12GB+ GPU VRAM, and supports resolutions up to 4K and beyond without quality degradation. It also excels in generating diverse images for the same prompt.

AI Playground

Test all API models in the sandbox environment before you integrate. We provide more than 300 models to integrate into your app.

Try For Free

300+ AI Models for
OpenClaw & AI Agents

Save 20% on Costs

Free $1 Tokens for New Members