Question 1

Which models are available in Tongyi Qianwen?

Accepted Answer

Official versions include Max, Flash, Omni, Omni-Realtime, and QVQ, optimized for deep reasoning, speed, multimodal, or visual inference respectively.

Question 2

How do I call the API?

Accepted Answer

Activate Alibaba Cloud Bailian service to get an HTTPS endpoint; Python, Go, and Java SDKs are provided for integration within ten minutes.

Question 3

How is pricing calculated?

Accepted Answer

Pay-as-you-go by input + output tokens; e.g., Flash costs 0.00015 yuan/1k input tokens and 0.0015 yuan/1k output tokens up to 128k context, with 1 million free tokens for new users.

Question 4

Does it support private deployment?

Accepted Answer

Yes, dedicated cloud and on-premise appliance deployments keep data local, meeting strict compliance requirements for finance and government sectors.

Question 5

What is the maximum context length?

Accepted Answer

Flash supports 1 million tokens and Max supports 250k tokens; enterprises can choose according to business needs.

Question 6

How is content safety ensured?

Accepted Answer

Built-in Alibaba Cloud Green Network moderation, TLS 1.3 encrypted transmission, and support for Class-III cybersecurity protection and national cryptography standards.

Question 7

Can I fine-tune the model?

Accepted Answer

The console allows lightweight fine-tuning with private corpora and integration with vector databases for RAG to improve domain-specific accuracy.

Tongyi Qianwen

Features of Tongyi Qianwen

Use Cases of Tongyi Qianwen

FAQ about Tongyi Qianwen

QWhich models are available in Tongyi Qianwen?

QHow do I call the API?

QHow is pricing calculated?

QDoes it support private deployment?

QWhat is the maximum context length?

QHow is content safety ensured?

QCan I fine-tune the model?