



const main = async () => {
const response = await fetch('https://api.ai.cc/v2/generate/video/alibaba/generation', {
method: 'POST',
headers: {
Authorization: 'Bearer ',
'Content-Type': 'application/json',
},
body: JSON.stringify({
model: 'alibaba/wan-25-preview/text-to-video',
prompt: 'A DJ on the stand is playing, around a World War II battlefield, lots of explosions, thousands of dancing soldiers, between tanks shooting, barbed wire fences, lots of smoke and fire, black and white old video: hyper realistic, photorealistic, photography, super detailed, very sharp, on a very white background',
aspect_ratio: '16:9',
}),
}).then((res) => res.json());
console.log('Generation:', response);
};
main()
import requests
def main():
url = "https://api.ai.cc/v2/generate/video/alibaba/generation"
payload = {
"model": "alibaba/wan-25-preview/text-to-video",
"prompt": "A DJ on the stand is playing, around a World War II battlefield, lots of explosions, thousands of dancing soldiers, between tanks shooting, barbed wire fences, lots of smoke and fire, black and white old video: hyper realistic, photorealistic, photography, super detailed, very sharp, on a very white background",
"aspect_ratio": "16:9",
}
headers = {"Authorization": "Bearer ", "Content-Type": "application/json"}
response = requests.post(url, json=payload, headers=headers)
print("Generation:", response.json())
if __name__ == "__main__":
main()
-
AI Playground

Test all API models in the sandbox environment before you integrate.
We provide more than 300 models to integrate into your app.


Product Detail
Wan 2.5 is an advanced AI model revolutionizing video generation. It produces high-quality, photorealistic videos directly from text prompts, complete with synchronized audio. This model marks a significant leap in video generation technology, offering native 4K support, sophisticated cinematic controls, and incredibly natural motion synthesis.
Designed for creators aiming for professional-grade storytelling and emotional fidelity, Wan 2.5 delivers immersive, multi-minute video clips. Experience fluid motion and precise audio-visual synchronization, empowering you to bring your creative visions to life with unparalleled realism.
Technical Specifications
- ✅ Frame Rate: Typically 24 fps cinematic standard.
- ✅ Video Length: Generates videos up to several minutes long for continuous storytelling.
- ✅ Audio Support: Full audio integration allowing original sound input with precise lip-sync.
- ✅ Camera Controls: Pan, tilt, zoom, dolly, and rack focus for dynamic scene composition.
- ✅ Physics Engine: Advanced simulation for realistic motion and interaction effects.
Performance Benchmarks
- 🌟 Video Quality: Produces ultra-detailed, photorealistic videos with rich environmental and facial details.
- 🌟 Motion Smoothness: Superior motion stability with smooth transitions across both large and subtle movements.
- 🌟 Audio-Visual Sync: Robust one-pass synchronization of video with uploaded voice or sound effects, surpassing competitors like Google Veo 3.
- 🌟 Multilingual Performance: High accuracy lip-sync and voice matching across languages and accented speech.
- 🌟 Cost Efficiency: More budget-friendly in computational cost compared to similar high-end models in the market.
API Pricing
- 480p: $0.0525 / sec
- 720p: $0.105 / sec
- 1080p: $0.1575 / sec
Key Features
- 💡 Text-to-Video Generation: Create videos from detailed text descriptions.
- 💡 Native 4K Resolution Support: Produces ultra-high-definition video up to 4K quality.
- 💡 One-Pass Audio and Video Synchronization: Integrates voice, sound effects, and background music naturally aligned with visuals.
- 💡 Multilingual and Accent-Friendly: Supports multiple languages including Chinese and various accents with reliable lip-sync.
- 💡 Advanced Cinematic Controls: Fine control over camera movements (pan, tilt, zoom, dolly, rack focus) and lighting setups.
- 💡 Realistic Character & Motion Modeling: Near-photorealistic faces, nuanced expressions, natural body language, and interactions.
- 💡 Enhanced Physics Simulation: Realistic environmental interactions and smooth motion dynamics.
Use Cases
- 🎬 Filmmaking and cinematic production with AI
- 🎬 Advertising and marketing video generation
- 🎬 Storyboarding and pre-visualization
- 🎬 Social media content creation with audio-visual synchronization
- 🎬 Multilingual video content for global audiences
- 🎬 Character-driven narrative video with expressive emotions
Comparison with Other Models
Vs. Google Veo 3: Wan 2.5 stands out with native 4K video support, enabling longer clips and superior multilingual audio-visual synchronization, including Chinese. It also offers dynamic cinematic camera controls, a significant upgrade from Veo 3's limitations to 1080p, shorter clips, English-centric audio sync, and basic fixed shots. Furthermore, Wan 2.5 provides a more cost-efficient solution for creators, featuring full audio input support, unlike Veo 3's system-generated sound only.
Vs. Runway Gen-4: Wan 2.5 excels in efficient real-time audio-video synchronization and native 4K output. It delivers enhanced motion fidelity and flexible camera workflows, whereas Runway Gen-4 primarily focuses on post-production effects and in-browser editing features, with less emphasis on deep audio integration.
Vs. Pika Labs: Wan 2.5 generates longer, continuous narrative videos with finely tuned cinematic controls and comprehensive multilingual voice syncing. Pika Labs, conversely, specializes in faster short clip generation, mainly for social media formats, and lacks advanced camera or audio synchronization features.
Vs. Kling 2.5 Turbo: Wan 2.5 offers superior photorealistic character rendering and precise lip-sync across various languages, alongside multiple video size outputs. Kling 2.5 Turbo is optimized for high-speed generation and stylized animation effects but provides less robust audio-visual integration.
API Integration
Wan 2.5 is readily accessible via the AI/ML API. Comprehensive documentation is available here for developers and integrators.
Frequently Asked Questions (FAQ)
A: Wan 2.5 leverages advanced AI models for generating ultra-detailed environmental and facial features, combined with a sophisticated physics engine for realistic motion and interaction effects, achieving near-photorealistic output.
A: It features robust one-pass audio and video synchronization, ensuring precise lip-sync and voice matching across multiple languages, including Chinese, and various accented speeches with high accuracy.
A: Wan 2.5 provides advanced cinematic controls such as pan, tilt, zoom, dolly, and rack focus, allowing creators fine-grained control over camera movements and lighting setups for dynamic scene composition.
A: Absolutely. With native 4K support, multi-minute video generation, realistic character modeling, and advanced cinematic controls, Wan 2.5 is ideal for professional filmmaking, advertising, and high-quality marketing video generation.
A: Wan 2.5 is positioned as a more budget-friendly option in terms of computational cost compared to many similar high-end video generation models currently available in the market, making advanced video creation more accessible.
Learn how you can transformyour company with AICC APIs



Log in