



const { OpenAI } = require('openai');
const api = new OpenAI({
baseURL: 'https://api.ai.cc/v1',
apiKey: '',
});
const main = async () => {
const result = await api.chat.completions.create({
model: 'Qwen/Qwen2.5-72B-Instruct-Turbo',
messages: [
{
role: 'system',
content: 'You are an AI assistant who knows everything.',
},
{
role: 'user',
content: 'Tell me, why is the sky blue?'
}
],
});
const message = result.choices[0].message.content;
console.log(`Assistant: ${message}`);
};
main();
import os
from openai import OpenAI
client = OpenAI(
base_url="https://api.ai.cc/v1",
api_key="",
)
response = client.chat.completions.create(
model="Qwen/Qwen2.5-72B-Instruct-Turbo",
messages=[
{
"role": "system",
"content": "You are an AI assistant who knows everything.",
},
{
"role": "user",
"content": "Tell me, why is the sky blue?"
},
],
)
message = response.choices[0].message.content
print(f"Assistant: {message}")
-
AI Playground

Test all API models in the sandbox environment before you integrate.
We provide more than 300 models to integrate into your app.


Product Detail
🚀 Introducing Qwen 2.5 72B Instruct Turbo: A Powerful LLM for Advanced Tasks
Qwen 2.5 72B Instruct Turbo stands as a cutting-edge large language model engineered by Alibaba, designed to excel in a diverse array of natural language processing tasks. This model is particularly optimized for instruction following, sophisticated coding assistance, and complex mathematical problem-solving, making it an indispensable tool for developers and researchers alike.
✨ Key Information at a Glance
- Model Name: Qwen 2.5 72B Instruct Turbo
- Developer/Creator: Alibaba
- Release Date: September 19, 2024
- Version: 2.5
- Model Type: Text Generation
🌐 Overview & Capabilities
This state-of-the-art model is built to push the boundaries of AI-driven text generation and comprehension. Its robust design caters to demanding applications requiring high-quality, reliable outputs.
✅ Core Features of Qwen 2.5 72B Instruct Turbo:
- Context Window: Supports up to 128K tokens for comprehensive understanding.
- Instruction Following: Significantly enhanced capabilities for precise task execution.
- Coding & Math Prowess: Superior performance in complex coding and mathematical challenges.
- Open-Source: Licensed under Apache License 2.0 for flexible commercial and non-commercial use.
- Quality Index: Achieves a high-quality output with a robust Quality Index of 75.
🎯 Intended Use Cases
Qwen 2.5 72B Instruct Turbo is primarily aimed at software developers seeking advanced support for coding, natural language understanding, and structured output generation (e.g., JSON). It is exceptionally suited for scenarios demanding long-form content generation and intricate problem-solving.
🗣️ Comprehensive Language Support
While primarily supporting English, the model demonstrates impressive multilingual capabilities, understanding and generating text in numerous languages including Chinese, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.
🔧 Technical Deep Dive
🏗️ Architecture
Qwen 2.5 employs a sophisticated transformer architecture, an optimal choice for handling sequential data and managing extensive contexts over long inputs efficiently.
📚 Training Data & Knowledge Cutoff
The model underwent training on a diverse dataset encompassing various domains such as programming languages, mathematics, and general knowledge. This broad training base ensures robust understanding and responsiveness across a multitude of topics.
The training involved hundreds of gigabytes of text data sourced from open-source repositories, academic papers, and web content, guaranteeing a comprehensive representation of knowledge.
Knowledge Cutoff: The model's knowledge base is current as of September 2024.
🌍 Diversity and Bias Considerations
Qwen 2.5 was developed using a diverse dataset with the explicit goal of minimizing bias in its outputs. Continuous evaluations are an integral part of its operational protocol to identify and address any emerging biases.
📊 Performance Metrics
Qwen 2.5 72B Instruct demonstrates exceptional performance in logical reasoning and math tasks, scoring 95.8 on GSM8K and 83.1 on MATH. It also excels in human evaluation and programming benchmarks, achieving 86.6 on HumanEval and 88.2 on MBPP. While strong, it exhibits relatively lower performance on specific tests like GPQA (49.0) and LiveBench 0831 (52.3).

💡 Qwen 2.5 72B Instruct Turbo vs. Standard Version
The "Turbo" variant of Qwen 2.5 72B Instruct offers enhanced speed due to a reduced maximum token limit of 32K, compared to the standard model's 128K context window. This trade-off requires fewer computational resources for input processing, resulting in faster performance. The Turbo version is particularly efficient for tasks that do not necessitate the full 128K token context, while still delivering strong performance across most use cases.
📈 Comparison to Other Leading AI Models
Below, we examine the Quality and Speed performance of Qwen 2.5 72B Instruct against other prominent AI models in the industry.
Quality Benchmarks:
In terms of quality, Qwen 2.5 72B Instruct achieves a competitive score of 75, positioning it alongside top-tier models such as Gemini 1.5 Pro and Claude 3.5 Sonnet. It notably surpasses models like Llama 3.1 (405B) and GPT-4o in this metric.
Speed Performance:
Measuring output tokens per second, Qwen 2.5 72B Instruct performs at 35 tokens per second. While slightly behind Gemini 1.5 Flash and GPT-4o mini, it holds a strong lead over other well-known models like o1-preview and Llama 3.1. This demonstrates Qwen 2.5 72B Instruct as a well-balanced model, delivering a robust combination of quality and speed for demanding AI tasks.

🛠️ Usage & Integration
💻 Code Samples
The Qwen 2.5 72B Instruct Turbo model is readily available on the AI/ML API platform under the identifier "Qwen/Qwen2.5-72B-Instruct-Turbo".
📄 API Documentation
For comprehensive details on integrating and utilizing the model, access the detailed API Documentation here.
⚖️ Ethical Guidelines & Licensing
🤝 Ethical AI Development
The development of Qwen models strictly adheres to ethical standards, with a strong commitment to minimizing harm and promoting fairness in AI applications. Continuous monitoring for biases and the generation of inappropriate content is a core part of its operational protocol.
📜 Licensing Information
Qwen 2.5 72B Instruct Turbo is open-source under the Apache License 2.0, granting extensive commercial and non-commercial usage rights.
❓ Frequently Asked Questions (FAQ)
Q1: What are the primary applications of Qwen 2.5 72B Instruct Turbo?
A1: It is designed for advanced natural language processing tasks, including instruction following, complex coding assistance, and mathematical problem-solving, making it ideal for software developers and AI research.
Q2: What is the main difference between Qwen 2.5 72B Instruct Turbo and the standard Qwen 2.5 72B Instruct model?
A2: The Turbo version has a reduced maximum token limit of 32K (compared to 128K for the standard model). This reduction enhances its speed and efficiency for tasks that don't require an extensive context window, by consuming fewer computational resources.
Q3: What languages does Qwen 2.5 72B Instruct Turbo support?
A3: While its primary language is English, it boasts robust multilingual support, including Chinese, French, Spanish, German, Japanese, Korean, Arabic, and many others.
Q4: Is Qwen 2.5 72B Instruct Turbo open-source, and what are its licensing terms?
A4: Yes, it is open-source under the Apache License 2.0, which permits both commercial and non-commercial usage.
Q5: How does Qwen 2.5 72B Instruct Turbo compare in performance to other leading AI models?
A5: It ranks competitively in quality with a score of 75, comparable to Gemini 1.5 Pro and Claude 3.5 Sonnet, and performs at 35 tokens per second in speed, placing it ahead of many well-known models like Llama 3.1.
Learn how you can transformyour company with AICC APIs



Log in