



const { OpenAI } = require('openai');
const api = new OpenAI({
baseURL: 'https://api.ai.cc/v1',
apiKey: '',
});
const main = async () => {
const result = await api.chat.completions.create({
model: 'nousresearch/hermes-4-405b',
messages: [
{
role: 'system',
content: 'You are an AI assistant who knows everything.',
},
{
role: 'user',
content: 'Tell me, why is the sky blue?'
}
],
});
const message = result.choices[0].message.content;
console.log(`Assistant: ${message}`);
};
main();
import os
from openai import OpenAI
client = OpenAI(
base_url="https://api.ai.cc/v1",
api_key="",
)
response = client.chat.completions.create(
model="nousresearch/hermes-4-405b",
messages=[
{
"role": "system",
"content": "You are an AI assistant who knows everything.",
},
{
"role": "user",
"content": "Tell me, why is the sky blue?"
},
],
)
message = response.choices[0].message.content
print(f"Assistant: {message}")
-
AI Playground

Test all API models in the sandbox environment before you integrate.
We provide more than 300 models to integrate into your app.


Product Detail
✨ Unveiling Hermes 4 405B API: Next-Gen Hybrid Reasoning
Hermes 4 405B stands as a cutting-edge, hybrid reasoning language model, meticulously developed by Nous Research. Built on the robust foundation of Meta’s Llama-3.1-405B, this model is specifically engineered for advanced reasoning capabilities, producing structured outputs, and offering flexible user control. It emerges as a top-tier choice for highly demanding AI applications across critical domains such as math, code, STEM, and intricate logical reasoning tasks.
🧠 Technical Specifications
- ✔️ Base Architecture: Built on Llama-3.1-405B, recognized as one of the largest open-weight transformer models available.
- ✔️ Parameter Count: Features an impressive 405 billion parameters, enabling sophisticated computations.
- ✔️ Training Data: Instruction-tuned using approximately 60 billion tokens of high-quality post-training data, with a significant emphasis on developing robust reasoning traces.
🚀 Performance Benchmarks
- 💡 Math & Logic: Demonstrates superior performance, outperforming previous Hermes models and competing directly with leading closed-source models in complex math, code, and logical reasoning tasks.
- 💡 STEM & Creativity: Excels across scientific, technical, engineering, and mathematical domains, alongside showcasing strong capabilities in creative writing with improved format-faithful outputs.
- 💡 General Assistant: Maintains broad utility for a wide array of general-purpose tasks, exhibiting high coherence and contextual understanding in multi-turn conversations.
- 💡 Speed vs. Depth: Its innovative Hybrid Reasoning Mode empowers users to dynamically choose between rapid, direct responses and more profound, deliberative processing complete with explicit reasoning traces.
✨ Key Features
-
⚙️ Hybrid Reasoning Mode: Users can effortlessly toggle between swift, direct responses and detailed, trace-based reasoning processes using the designated
reasoningboolean flag. - 🎯 Enhanced Steerability: Offers unparalleled customizability with significantly improved alignment and reduced refusal rates for user-directed tasks, ensuring more precise and controlled outputs.
- 📜 Large Context Window: Engineered to proficiently handle extensive documents and navigate complex multi-step tasks, maintaining exceptional coherence and accuracy throughout.
💲 Hermes 4 405B API Pricing
- ➡️ Input: $1.05 per 1M tokens
- ⬅️ Output: $3.15 per 1M tokens
🎯 Versatile Use Cases
- 🔢 Advanced Reasoning: Ideal for tackling complex challenges in math, logic, and STEM problem-solving.
- 💻 Code Generation: Provides reliable code synthesis, efficient debugging assistance, and comprehensive technical documentation capabilities.
- ✍️ Creative Writing: Empowers engaging storytelling, dynamic roleplaying scenarios, and the generation of nuanced, subjective responses.
- 🏢 Enterprise Integration: Perfectly suited for developing sophisticated enterprise assistants, intelligent chatbots, and automating complex workflows within organizational structures.
</> Code Sample
<snippet data-name="open-ai.chat-completion" data-model="nousresearch/hermes-4-405b"></snippet>
⚖️ Comparison with Other Leading Models
- ➡️ vs Llama-3.1 Instruct: Hermes 4 405B offers superior reasoning, structured outputs, and enhanced steerability, further bolstered by a larger post-training corpus and its innovative hybrid reasoning mode.
- ➡️ vs GPT-4.1 nano: Hermes 4 405B consistently matches or exceeds GPT-4.1 nano in intelligence and reasoning benchmarks, while also featuring a much larger context window and significantly lower refusal rates.
- ➡️ vs Hermes 3: Hermes 4 represents a substantial advancement, featuring a 50x larger training dataset, alongside profoundly improved reasoning traces and enhanced schema adherence and function calling capabilities.
- ➡️ vs Claude 3: Hermes 4 405B particularly excels in math, code, and structured outputs, maintaining a strong emphasis on user control and neutrality in its responses.
❓ Frequently Asked Questions (FAQ)
Q1: What is Hermes 4 405B?
Hermes 4 405B is a state-of-the-art, hybrid reasoning language model developed by Nous Research. It's built on Meta's Llama-3.1-405B foundation and designed for advanced reasoning, structured outputs, and flexible user control in demanding AI applications across math, code, STEM, and logical reasoning.
Q2: What are the key features of Hermes 4 405B?
Key features include a Hybrid Reasoning Mode for toggling between fast and detailed responses, enhanced Steerability for customizable outputs, and a Large Context window capable of handling extensive documents and complex multi-step tasks.
Q3: How does Hermes 4 405B compare to other models?
Hermes 4 405B offers superior reasoning and steerability compared to Llama-3.1 Instruct, matches or exceeds GPT-4.1 nano in intelligence with a larger context window, and significantly improves upon Hermes 3 with a 50x larger training dataset. It also excels over Claude 3 in math, code, and structured outputs with a focus on user control.
Q4: What are the primary use cases for Hermes 4 405B?
Its main use cases span advanced reasoning in math, logic, and STEM; reliable code generation and debugging; creative writing for storytelling and roleplaying; and robust enterprise integration for assistants and workflow automation.
Q5: What is the API pricing for Hermes 4 405B?
The API is priced at $1.05 per 1M tokens for input and $3.15 per 1M tokens for output.
Learn how you can transformyour company with AICC APIs



Log in