🚀 DeepSeek V4 Now Available — 1M Context, 384K Output

Access DeepSeek V4 Models
at Unbeatable Prices

OpenAI-compatible API gateway to DeepSeek V4-Pro and V4-Flash. Drop-in replacement. No code rewrite needed.

JavaScript

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: process.env.OPENAI_API_KEY,
  baseURL: 'https://hnqyai.com/v1' // Just change this line!
});

// Everything else stays the same
const response = await client.chat.completions.create({
  model: 'deepseek-v4-flash',
  messages: [{ role: 'user', content: 'Hello!' }]
});

Supported Models

Direct access to DeepSeek V4 models through a single, unified API

DeepSeek-V4-Pro

DeepSeek

Flagship

1.6T params · 49B active · 1M context · 384K output

DeepSeek's most powerful model. Excels at complex reasoning, math, and code generation. Supports thinking mode.

$2.80 /M input

DeepSeek-V4-Flash

DeepSeek

Best Value

284B params · 13B active · 1M context · 384K output

Extreme cost efficiency for everyday tasks. Supports thinking mode. 91% cheaper than GPT-4o on input.

$0.22 /M input

Why QYAI?

Simple, powerful, and cost-effective AI infrastructure

💰

Up to 91% Cheaper

V4-Flash input at $0.22/M tokens vs GPT-4o's $2.50/M. V4-Pro output at $5.60/M vs GPT-4o's $10.00/M. Same quality, fraction of the cost.

🔌

Drop-in OpenAI Compatible

Change one line: base_url. Works with all OpenAI SDKs, LangChain, LlamaIndex, and any OpenAI-compatible tool. Also supports gpt-4o and gpt-4o-mini aliases.

🧠

Thinking Mode Support

Both V4-Pro and V4-Flash support DeepSeek's thinking mode for complex reasoning. Get deeper analysis when you need it, skip it when you don't.

Get Started in Seconds

Just change the base URL. Everything else stays the same.

Python

from openai import OpenAI

client = OpenAI(
    api_key="your-api-key",
    base_url="https://hnqyai.com/v1"  # Just change this line!
)

response = client.chat.completions.create(
    model="deepseek-v4-flash",
    messages=[{"role": "user", "content": "Hello!"}]
)

print(response.choices[0].message.content)

Simple, Transparent Pricing

Pay only for what you use. No hidden fees, no minimums.

Model	Input	Output	Context
GPT-4o	$2.50/M	$10.00/M	128K
DeepSeek-V4-Pro 44% OFF output	$2.80/M	$5.60/M	1M
DeepSeek-V4-Flash 91% OFF input	$0.22/M	$0.44/M	1M

Access DeepSeek V4 Modelsat Unbeatable Prices