131K

Out

Chat

disable

Hermes 4 405B

Its hybrid reasoning mode allows users to switch between fast, direct responses and deep, step-by-step analysis, making it highly adaptable for diverse use cases.

Free $1 Tokens for New Members

Text to Speech

Javascript

Python

                                        const { OpenAI } = require('openai');

const api = new OpenAI({
  baseURL: 'https://api.ai.cc/v1',
  apiKey: '',
});

const main = async () => {
  const result = await api.chat.completions.create({
    model: 'nousresearch/hermes-4-405b',
    messages: [
      {
        role: 'system',
        content: 'You are an AI assistant who knows everything.',
      },
      {
        role: 'user',
        content: 'Tell me, why is the sky blue?'
      }
    ],
  });

  const message = result.choices[0].message.content;
  console.log(`Assistant: ${message}`);
};

main();

                                        import os
from openai import OpenAI

client = OpenAI(
    base_url="https://api.ai.cc/v1",
    api_key="",    
)

response = client.chat.completions.create(
    model="nousresearch/hermes-4-405b",
    messages=[
        {
            "role": "system",
            "content": "You are an AI assistant who knows everything.",
        },
        {
            "role": "user",
            "content": "Tell me, why is the sky blue?"
        },
    ],
)

message = response.choices[0].message.content

print(f"Assistant: {message}")

Docs

One API 300+ AI Models

Save 20% on Costs & $1 Free Tokens

Get API Key Explore Models

Hermes 4 405B

Product Detail

✨ Unveiling Hermes 4 405B API: Next-Gen Hybrid Reasoning

Hermes 4 405B stands as a cutting-edge, hybrid reasoning language model, meticulously developed by Nous Research. Built on the robust foundation of Meta’s Llama-3.1-405B, this model is specifically engineered for advanced reasoning capabilities, producing structured outputs, and offering flexible user control. It emerges as a top-tier choice for highly demanding AI applications across critical domains such as math, code, STEM, and intricate logical reasoning tasks.

🧠 Technical Specifications

✔️ Base Architecture: Built on Llama-3.1-405B, recognized as one of the largest open-weight transformer models available.
✔️ Parameter Count: Features an impressive 405 billion parameters, enabling sophisticated computations.
✔️ Training Data: Instruction-tuned using approximately 60 billion tokens of high-quality post-training data, with a significant emphasis on developing robust reasoning traces.

🚀 Performance Benchmarks

💡 Math & Logic: Demonstrates superior performance, outperforming previous Hermes models and competing directly with leading closed-source models in complex math, code, and logical reasoning tasks.
💡 STEM & Creativity: Excels across scientific, technical, engineering, and mathematical domains, alongside showcasing strong capabilities in creative writing with improved format-faithful outputs.
💡 General Assistant: Maintains broad utility for a wide array of general-purpose tasks, exhibiting high coherence and contextual understanding in multi-turn conversations.
💡 Speed vs. Depth: Its innovative Hybrid Reasoning Mode empowers users to dynamically choose between rapid, direct responses and more profound, deliberative processing complete with explicit reasoning traces.

✨ Key Features

⚙️ Hybrid Reasoning Mode: Users can effortlessly toggle between swift, direct responses and detailed, trace-based reasoning processes using the designated reasoning boolean flag.
🎯 Enhanced Steerability: Offers unparalleled customizability with significantly improved alignment and reduced refusal rates for user-directed tasks, ensuring more precise and controlled outputs.
📜 Large Context Window: Engineered to proficiently handle extensive documents and navigate complex multi-step tasks, maintaining exceptional coherence and accuracy throughout.

💲 Hermes 4 405B API Pricing

➡️ Input: $1.05 per 1M tokens
⬅️ Output: $3.15 per 1M tokens

🎯 Versatile Use Cases

🔢 Advanced Reasoning: Ideal for tackling complex challenges in math, logic, and STEM problem-solving.
💻 Code Generation: Provides reliable code synthesis, efficient debugging assistance, and comprehensive technical documentation capabilities.
✍️ Creative Writing: Empowers engaging storytelling, dynamic roleplaying scenarios, and the generation of nuanced, subjective responses.
🏢 Enterprise Integration: Perfectly suited for developing sophisticated enterprise assistants, intelligent chatbots, and automating complex workflows within organizational structures.

</> Code Sample

<snippet data-name="open-ai.chat-completion" data-model="nousresearch/hermes-4-405b"></snippet>

⚖️ Comparison with Other Leading Models

➡️ vs Llama-3.1 Instruct: Hermes 4 405B offers superior reasoning, structured outputs, and enhanced steerability, further bolstered by a larger post-training corpus and its innovative hybrid reasoning mode.
➡️ vs GPT-4.1 nano: Hermes 4 405B consistently matches or exceeds GPT-4.1 nano in intelligence and reasoning benchmarks, while also featuring a much larger context window and significantly lower refusal rates.
➡️ vs Hermes 3: Hermes 4 represents a substantial advancement, featuring a 50x larger training dataset, alongside profoundly improved reasoning traces and enhanced schema adherence and function calling capabilities.
➡️ vs Claude 3: Hermes 4 405B particularly excels in math, code, and structured outputs, maintaining a strong emphasis on user control and neutrality in its responses.

❓ Frequently Asked Questions (FAQ)

Q1: What is Hermes 4 405B?

Hermes 4 405B is a state-of-the-art, hybrid reasoning language model developed by Nous Research. It's built on Meta's Llama-3.1-405B foundation and designed for advanced reasoning, structured outputs, and flexible user control in demanding AI applications across math, code, STEM, and logical reasoning.

Q2: What are the key features of Hermes 4 405B?

Key features include a Hybrid Reasoning Mode for toggling between fast and detailed responses, enhanced Steerability for customizable outputs, and a Large Context window capable of handling extensive documents and complex multi-step tasks.

Q3: How does Hermes 4 405B compare to other models?

Hermes 4 405B offers superior reasoning and steerability compared to Llama-3.1 Instruct, matches or exceeds GPT-4.1 nano in intelligence with a larger context window, and significantly improves upon Hermes 3 with a 50x larger training dataset. It also excels over Claude 3 in math, code, and structured outputs with a focus on user control.

Q4: What are the primary use cases for Hermes 4 405B?

Its main use cases span advanced reasoning in math, logic, and STEM; reliable code generation and debugging; creative writing for storytelling and roleplaying; and robust enterprise integration for assistants and workflow automation.

Q5: What is the API pricing for Hermes 4 405B?

The API is priced at $1.05 per 1M tokens for input and $3.15 per 1M tokens for output.

AI Playground

Test all API models in the sandbox environment before you integrate. We provide more than 300 models to integrate into your app.

Try For Free

One API
300+ AI Models

Save 20% on Costs

Free $1 Tokens for New Members