🚀 Now Available Apache 2.0 License

Alibaba's Breakthrough 80B-Parameter Open-Source AI

Experience unprecedented efficiency with Qwen3-Next: 80B parameters, only 3B active per inference. Ultra-long 262K context, multilingual excellence, and enterprise-ready deployment.

80B
Parameters
3B
Active per Inference
262K
Context Window
119
Languages
qwen3-next-demo.py
$ pip install dashscope
# Import Qwen3-Next
from dashscope import Generation
# 80B params, 3B active ⚡
response = Generation.call(
model='qwen3-next-80b',
prompt='Explain quantum computing'
)

Unparalleled Efficiency and Scale

Revolutionary sparse MoE architecture that delivers cutting-edge performance while slashing compute costs by up to 90%

Extreme Efficiency

Activate only 3B of 80B parameters per token prediction. Achieve 90% cost savings with breakthrough sparse MoE design.

3.7% Active Parameters 90% Cost Reduction

Ultra-Long Context

Process 262K tokens in a single pass with hybrid DeltaNet and gated attention. Perfect for analyzing entire books or contracts.

262K Token Window Linear Time Complexity

Multi-Token Prediction

Generate multiple tokens per step, reducing latency and enhancing user experiences for real-time applications.

Lower Latency Faster Generation

Enterprise Ready

Apache 2.0 licensed with on-premises, VPC, and private cloud deployment options for maximum security and compliance.

Apache 2.0 License Private Deployment

Multilingual Excellence

Native support for 119 languages with seamless cross-lingual capabilities for global applications.

119 Languages Cross-lingual Tasks

Agentic Tool Use

Integrate with APIs, databases, and external tools for complex multi-step workflows and autonomous reasoning.

API Integration Multi-step Workflows

State-of-the-Art Performance

Qwen3-Next consistently outperforms leading models while maintaining extreme efficiency

Benchmark Qwen3-Next-80B Qwen3-32B Llama 3.1-70B
MMLU 78.5% 76.2% 77.8%
HumanEval 82.1% 79.5% 81.2%
GSM8K 91.2% 88.7% 90.1%
MT-Bench 84.3% 81.9% 83.5%
+6.7%
Better than Qwen3-32B on average
90%
Lower training costs vs dense models
3.7%
Parameter activation rate

Get Started in Minutes

Integrate Qwen3-Next into your applications with simple API calls

import os
from dashscope import Generation

# Set your API key
os.environ['DASHSCOPE_API_KEY'] = 'your_api_key'

# Call Qwen3-Next with 80B params, 3B active
response = Generation.call(
    model='qwen3-next-80b-a3b-instruct',
    prompt='Explain the benefits of sparse MoE architectures.',
    max_tokens=200,
    temperature=0.7
)

if response.status_code == 200:
    print(response.output['text'])
else:
    print(f"Error: {response.message}")
curl -X POST 'https://dashscope.aliyuncs.com/api/v1/services/aigc/text-generation/generation' \
  -H 'Authorization: Bearer YOUR_API_KEY' \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "qwen3-next-80b-a3b-instruct",
    "input": {
      "prompt": "Explain the benefits of sparse MoE architectures."
    },
    "parameters": {
      "max_tokens": 200,
      "temperature": 0.7
    }
  }'
import { Generation } from '@alicloud/dashscope';

const client = new Generation({
  apiKey: process.env.DASHSCOPE_API_KEY
});

async function callQwen3Next() {
  try {
    const response = await client.call({
      model: 'qwen3-next-80b-a3b-instruct',
      prompt: 'Explain the benefits of sparse MoE architectures.',
      max_tokens: 200,
      temperature: 0.7
    });
    
    console.log(response.output.text);
  } catch (error) {
    console.error('Error:', error.message);
  }
}

callQwen3Next();
1

Get API Key

Sign up for Alibaba Cloud and obtain your DashScope API key

2

Install SDK

Use pip, npm, or curl to integrate with your preferred stack

3

Start Building

Make API calls and experience the power of 80B sparse parameters

Build Smarter Applications

From long-document analysis to agentic workflows, Qwen3-Next powers next-generation AI applications

📚

Long-Document Understanding

Process entire books, legal contracts, or research papers in a single 262K context window. Eliminate chunking limitations and maintain global coherence.

Example: Analyze 300-page contracts for compliance issues and generate executive summaries.
🤖

Agentic Tool Use

Build autonomous agents that integrate with APIs, databases, and external tools for complex multi-step workflows and reasoning tasks.

Example: Automated research assistant that retrieves → analyzes → computes → writes → reviews → reports.
🏢

Enterprise Compliance

Deploy securely in your infrastructure to meet governance, audit, and data privacy requirements with full control over your AI workloads.

Example: Private cloud deployment for sensitive financial document processing.

Ready to Experience the Future of AI?

Join thousands of developers building with Qwen3-Next. Start your journey with 80B parameters and 3B active efficiency.

✨ Free tier includes 10,000 API calls per month. No credit card required.