What are the differences?
Between the Starling LM 7B Beta and Vicuna 33B LLM models, which follows best instructions?
Compare

to


Starling LM 7B Beta
Berkeley Nest

Vicuna 33B
LMSYS
Overview
![]() Starling LM 7B Beta | ![]() Vicuna 33B | |
|---|---|---|
Provider Organization responsible for this model. | ![]() Berkeley Nest | ![]() LMSYS |
Input Context Window The total number of tokens that the input context window can accommodate. | 3.1K | 2K |
Maximum Output Tokens The maximum number of tokens this model can produce in one operation. | 4.1K | Not specified. |
Release Date The initial release date of the model. | November 15, 2023 24 months ago | June 22, 2023 29 months ago |
Knowledge Cutoff The latest date for which the information provided is considered reliable and current. | 2024/3 |
Pricing
![]() Starling LM 7B Beta | ![]() Vicuna 33B | |
|---|---|---|
Input Costs associated with the data input to the model. | Not specified. | Not specified. |
Output Costs associated with the tokens produced by the model. | Not specified. | Not specified. |
Benchmark
![]() Starling LM 7B Beta | ![]() Vicuna 33B | |
|---|---|---|
MMLU Assesses LLMs' ability to acquire knowledge in zero-shot and few-shot scenarios. | 63.9 | 59.2 |
MMMU Comprehensive benchmark covering multiple disciplines and modalities. | ||
HellaSwag A demanding benchmark for sentence completion tasks. | ||
Arena Elo Ranking metric for LMSYS Chatbot Arena. | 1119 | 1090 |
5000+ teams use Lunary to build reliable AI applications
Building an AI chatbot?
Open-source GenAI monitoring, prompt management, and magic.
Open Source
Self Hostable
1-line Integration
Prompt Templates
Chat Replays
Analytics
Topic Classification
Agent Tracing
Custom Dashboards
Score LLM responses
PII Masking
Feedback Tracking
Open Source
Self Hostable
1-line Integration
Prompt Templates
Chat Replays
Analytics
Topic Classification
Agent Tracing
Custom Dashboards
Score LLM responses
PII Masking
Feedback Tracking




