🚀 DeepSeek V4 Now Available — 1M Context, 384K Output

Access DeepSeek V4 Models
at Unbeatable Prices

OpenAI-compatible API gateway to DeepSeek V4-Pro and V4-Flash. Drop-in replacement. No code rewrite needed.

JavaScript
import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: process.env.OPENAI_API_KEY,
  baseURL: 'https://hnqyai.com/v1' // Just change this line!
});

// Everything else stays the same
const response = await client.chat.completions.create({
  model: 'deepseek-v4-flash',
  messages: [{ role: 'user', content: 'Hello!' }]
});

Supported Models

Direct access to DeepSeek V4 models through a single, unified API

DeepSeek-V4-Pro
DeepSeek
Flagship
1.6T params · 49B active · 1M context · 384K output
DeepSeek's most powerful model. Excels at complex reasoning, math, and code generation. Supports thinking mode.
$2.80 /M input
DeepSeek-V4-Flash
DeepSeek
Best Value
284B params · 13B active · 1M context · 384K output
Extreme cost efficiency for everyday tasks. Supports thinking mode. 91% cheaper than GPT-4o on input.
$0.22 /M input

Why QYAI?

Simple, powerful, and cost-effective AI infrastructure

💰

Up to 91% Cheaper

V4-Flash input at $0.22/M tokens vs GPT-4o's $2.50/M. V4-Pro output at $5.60/M vs GPT-4o's $10.00/M. Same quality, fraction of the cost.

🔌

Drop-in OpenAI Compatible

Change one line: base_url. Works with all OpenAI SDKs, LangChain, LlamaIndex, and any OpenAI-compatible tool. Also supports gpt-4o and gpt-4o-mini aliases.

🧠

Thinking Mode Support

Both V4-Pro and V4-Flash support DeepSeek's thinking mode for complex reasoning. Get deeper analysis when you need it, skip it when you don't.

Get Started in Seconds

Just change the base URL. Everything else stays the same.

Python
from openai import OpenAI

client = OpenAI(
    api_key="your-api-key",
    base_url="https://hnqyai.com/v1"  # Just change this line!
)

response = client.chat.completions.create(
    model="deepseek-v4-flash",
    messages=[{"role": "user", "content": "Hello!"}]
)

print(response.choices[0].message.content)

Simple, Transparent Pricing

Pay only for what you use. No hidden fees, no minimums.

Model Input Output Context
GPT-4o $2.50/M $10.00/M 128K
DeepSeek-V4-Pro 44% OFF output $2.80/M $5.60/M 1M
DeepSeek-V4-Flash 91% OFF input $0.22/M $0.44/M 1M
QYAI Assistant
🤖
Hi there! 👋 I'm the QYAI Assistant. How can I help you today?