Qwen2-VL 72B Instruct

32K Context
0.4/M Input Tokens
0.4/M Output Tokens
0.578/K Image Tokens

Qwen
Text image 2 text
02 Dec, 2024

Chat with Model

Qwen2 VL 72B is a multimodal LLM from the Qwen Team with the following key enhancements:

SoTA understanding of images of various resolution & ratio: Qwen2-VL achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, etc.
Understanding videos of 20min+: Qwen2-VL can understand videos over 20 minutes for high-quality video-based question answering, dialog, content creation, etc.
Agent that can operate your mobiles, robots, etc.: with the abilities of complex reasoning and decision making, Qwen2-VL can be integrated with devices like mobile phones, robots, etc., for automatic operation based on visual environment and text instructions.
Multilingual Support: to serve global users, besides English and Chinese, Qwen2-VL now supports the understanding of texts in different languages inside images, including most European languages, Japanese, Korean, Arabic, Vietnamese, etc.

For more details, see this blog post and GitHub repo.

Usage of this model is subject to Tongyi Qianwen LICENSE AGREEMENT.

Qwen 2 7B Instruct

Text 2 text

Qwen2 7B is a transformer-based model that excels in language understanding, multilingual capabilities, coding, mathematics, and reasoning. It features SwiGLU activation, attention QKV bias, and gro ...

Qwen 32K context $0.054/M input tokens $0.054/M output tokens

FREE

Qwen 2 7B Instruct (free)

Text 2 text

# Free

Qwen 32K context $0 input tokens $0 output tokens

Qwen2-VL 7B Instruct

Text image 2 text

Qwen2 VL 7B is a multimodal LLM from the Qwen Team with the following key enhancements:SoTA understanding of images of various resolution & ratio: Qwen2-VL achieves state-of-the-art performance o...

Qwen 32K context $0.1/M input tokens $0.1/M output tokens $0.144/K image tokens

Qwen2.5 72B Instruct

Text 2 text

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2:Significantly more knowledge and has greatly improved capabilities in coding a...

Qwen 128K context $0.35/M input tokens $0.4/M output tokens

Qwen2.5 7B Instruct

Text 2 text

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2:Significantly more knowledge and has greatly improved capabilities in coding an...

Qwen 128K context $0.27/M input tokens $0.27/M output tokens

FREE

Qwen: Qwen VL Plus (free)

Text image 2 text

# Free

Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pix ...

Qwen 7.32K context $0 input tokens $0 output tokens

Qwen2-VL 72B Instruct

Tags :

Share :

Related Posts

Qwen 2 7B Instruct

Qwen 2 7B Instruct (free)

Qwen2-VL 7B Instruct

Qwen2.5 72B Instruct

Qwen2.5 7B Instruct

Qwen: Qwen VL Plus (free)