What are the differences?
Between the Qwen1.5 72B Chat and Llama 3 8B Instruct LLM models, which follows best instructions?
Compare

to

Qwen1.5 72B Chat
Alibaba Cloud
Llama 3 8B Instruct
Meta
Overview
![]() Qwen1.5 72B Chat | Llama 3 8B Instruct | |
|---|---|---|
Provider Organization responsible for this model. | ![]() Alibaba Cloud | Meta |
Input Context Window The total number of tokens that the input context window can accommodate. | 33K | 8K |
Maximum Output Tokens The maximum number of tokens this model can produce in one operation. | Not specified. | 2K |
Release Date The initial release date of the model. | February 5, 2024 21 months ago | April 18, 2024 19 months ago |
Knowledge Cutoff The latest date for which the information provided is considered reliable and current. | 2024/2 |
Pricing
![]() Qwen1.5 72B Chat | Llama 3 8B Instruct | |
|---|---|---|
Input Costs associated with the data input to the model. | Not specified. | Not specified. |
Output Costs associated with the tokens produced by the model. | Not specified. | Not specified. |
Benchmark
![]() Qwen1.5 72B Chat | Llama 3 8B Instruct | |
|---|---|---|
MMLU Assesses LLMs' ability to acquire knowledge in zero-shot and few-shot scenarios. | 77.44 | 68.4 |
MMMU Comprehensive benchmark covering multiple disciplines and modalities. | ||
HellaSwag A demanding benchmark for sentence completion tasks. | 86.42 | |
Arena Elo Ranking metric for LMSYS Chatbot Arena. | 1147 | 1146 |
5000+ teams use Lunary to build reliable AI applications
Compare more models
Building an AI chatbot?
Open-source GenAI monitoring, prompt management, and magic.
Open Source
Self Hostable
1-line Integration
Prompt Templates
Chat Replays
Analytics
Topic Classification
Agent Tracing
Custom Dashboards
Score LLM responses
PII Masking
Feedback Tracking
Open Source
Self Hostable
1-line Integration
Prompt Templates
Chat Replays
Analytics
Topic Classification
Agent Tracing
Custom Dashboards
Score LLM responses
PII Masking
Feedback Tracking



