What are the differences?
Between the Qwen1.5 14B Chat and Starling LM 7B Alpha LLM models, which follows best instructions?
Compare

to


Qwen1.5 14B Chat
Alibaba Cloud

Starling LM 7B Alpha
Berkeley Nest
Overview
![]() Qwen1.5 14B Chat | ![]() Starling LM 7B Alpha | |
|---|---|---|
Provider Organization responsible for this model. | ![]() Alibaba Cloud | ![]() Berkeley Nest |
Input Context Window The total number of tokens that the input context window can accommodate. | 33K | 3.1K |
Maximum Output Tokens The maximum number of tokens this model can produce in one operation. | Not specified. | 4.1K |
Release Date The initial release date of the model. | February 5, 2024 21 months ago | November 15, 2023 24 months ago |
Knowledge Cutoff The latest date for which the information provided is considered reliable and current. | 2024/2 |
Pricing
![]() Qwen1.5 14B Chat | ![]() Starling LM 7B Alpha | |
|---|---|---|
Input Costs associated with the data input to the model. | Not specified. | Not specified. |
Output Costs associated with the tokens produced by the model. | Not specified. | Not specified. |
Benchmark
![]() Qwen1.5 14B Chat | ![]() Starling LM 7B Alpha | |
|---|---|---|
MMLU Assesses LLMs' ability to acquire knowledge in zero-shot and few-shot scenarios. | 63.9 | |
MMMU Comprehensive benchmark covering multiple disciplines and modalities. | ||
HellaSwag A demanding benchmark for sentence completion tasks. | ||
Arena Elo Ranking metric for LMSYS Chatbot Arena. | 1108 | 1088 |
5000+ teams use Lunary to build reliable AI applications
Compare more models
Building an AI chatbot?
Open-source GenAI monitoring, prompt management, and magic.
Open Source
Self Hostable
1-line Integration
Prompt Templates
Chat Replays
Analytics
Topic Classification
Agent Tracing
Custom Dashboards
Score LLM responses
PII Masking
Feedback Tracking
Open Source
Self Hostable
1-line Integration
Prompt Templates
Chat Replays
Analytics
Topic Classification
Agent Tracing
Custom Dashboards
Score LLM responses
PII Masking
Feedback Tracking


