



const axios = require('axios').default;
const api = axios.create({
baseURL: 'https://api.ai.cc/v1',
headers: { Authorization: 'Bearer ' },
});
const main = async () => {
const response = await api.post('/tts', {
model: 'openai/tts-1',
text: 'OpenAI TTS are fast and powerful language models. Use it to convert text to natural sounding spoken text.',
voice: 'coral',
});
console.log('Audio URL:', response.data.audio.url);
console.log('Characters:', response.data.usage.characters);
};
main();
import requests
def main():
url = "https://api.ai.cc/v1/tts"
headers = {
"Authorization": "Bearer ",
}
payload = {
"model": "openai/tts-1",
"text": "OpenAI TTS are fast and powerful language models. Use it to convert text to natural sounding spoken text.",
"voice": "coral"
}
response = requests.post(url, headers=headers, json=payload)
data = response.json()
print("Audio URL:", data["audio"]["url"])
print("Characters:", data["usage"]["characters"])
main()
-
AI Playground

Test all API models in the sandbox environment before you integrate.
We provide more than 300 models to integrate into your app.


Product Detail
💭 TTS-1 (Text-To-Speech) is an advanced neural network model developed by OpenAI, engineered to transform written text into natural and engaging speech. Leveraging state-of-the-art deep learning techniques in Natural Language Processing (NLP), TTS-1 synthesizes voice output that remarkably mimics human speech patterns, intonation, and rhythm, making digital interactions feel more intuitive and lifelike.
💻Technical Specifications
- ✔ Model Type: Deep learning-based TTS neural network
- ✔ Input: Text prompt including punctuation, supporting complex structures
- ✔ Output: High-fidelity audio waveform, various formats available
- ✔ Core Technology: NLP-driven acoustic feature prediction combined with neural vocoders for superior voice quality
- ✔ Deployment: Cloud or edge deployment compatible, offering flexibility
🏆Performance Benchmarks
- Exceptional Mean Opinion Score (MOS): Consistently achieves high scores in subjective listening tests, demonstrating significant user preference over traditional TTS systems due to its naturalness.
- Ultra-Low Latency: Significantly reduced latency compared to earlier TTS architectures, enabling near real-time speech synthesis for interactive applications.
- Competitive Word Error Rates (WER): When synthesized speech is utilized with speech recognition systems, TTS-1 maintains highly competitive word error rates, ensuring clarity and accuracy.
⭐Key Features of TTS-1 API
- Natural-sounding Speech: Delivers speech with human-like intonation, rhythm, and clear pronunciation.
- Context-Aware Synthesis: Intelligently captures and applies appropriate emotional tones based on the textual context.
- End-to-End Pipeline: A streamlined process from text analysis to high-quality audio output, ensuring consistency and efficiency.
- Robust Text Handling: Expertly manages varying sentence structures, complex punctuation, numbers, and abbreviations.
- Scalable Voice and Style Options: Adaptable for a diverse range of voice types and speaking styles, providing extensive customization.
💰Pricing & API Integration
💸TTS-1 API Pricing
Experience top-tier text-to-speech capabilities at competitive rates:
$0.01575 per 1,000 characters
This cost-effective structure makes TTS-1 an excellent choice for both small-scale projects and high-volume enterprise applications.
🔗Seamless API Integration
The TTS-1 model is readily accessible via a robust AI/ML API, designed for straightforward and efficient integration into your existing systems and new projects.
Comprehensive Documentation:
For detailed API references, parameter guides, and best practices, consult the official documentation:
Referenced from OpenAI TTS-1 API Documentation.
💻Code Sample for TTS-1
<snippet data-name="voice.tts-openai" data-model="openai/tts-1"></snippet>
❓Frequently Asked Questions (FAQ)
➡What is TTS-1 API?
TTS-1 is OpenAI's advanced Text-to-Speech synthesis model, designed to convert written text into natural-sounding, high-quality spoken audio with multiple voice options and extensive language support.
➡What are the main applications for TTS-1?
TTS-1 is ideal for a broad range of applications including voice assistants, audiobook generation, podcast creation, e-learning content, accessibility tools, IVR systems, and video narration.
➡How much does TTS-1 cost?
The TTS-1 API is priced at $0.01575 per 1,000 characters, offering a cost-effective solution for various scales of text-to-speech requirements.
➡What audio formats does TTS-1 support?
The model supports high-quality audio output in popular formats such as MP3, WAV, AAC, and OGG, with various bitrate options from 24kbps to 320kbps.
➡Does TTS-1 support SSML (Speech Synthesis Markup Language)?
Yes, TTS-1 fully supports SSML for advanced speech control, enabling precise management of pronunciation, pauses, emphasis, and phonetics for professional-grade results.
Learn how you can transformyour company with AICC APIs



Log in