What are the differences?

Between the GPT-4 0613 and Claude 3 Opus LLM models, which follows best instructions?

Compare

GPT-4 0613

OpenAI

Claude 3 Opus

Anthropic

Overview

	GPT-4 0613	Claude 3 Opus
Provider Organization responsible for this model.	OpenAI	Anthropic
Input Context Window The total number of tokens that the input context window can accommodate.	8.2K	200K
Maximum Output Tokens The maximum number of tokens this model can produce in one operation.	8.2K	4.1K
Release Date The initial release date of the model.	June 13, 2023 37 months ago	March 4, 2024 28 months ago
Knowledge Cutoff The latest date for which the information provided is considered reliable and current.	2021/9	2023/8

Pricing

	GPT-4 0613	Claude 3 Opus
Input Costs associated with the data input to the model.	$0.03	$15.00
Output Costs associated with the tokens produced by the model.	$0.06	$75.00

Benchmark

	GPT-4 0613	Claude 3 Opus
MMLU Assesses LLMs' ability to acquire knowledge in zero-shot and few-shot scenarios.		88.2
MMMU Comprehensive benchmark covering multiple disciplines and modalities.		59.4
HellaSwag A demanding benchmark for sentence completion tasks.		95.4
Arena Elo Ranking metric for LMSYS Chatbot Arena.	1161	1251

Powering the world's best AI teams.

From next-gen startups to established enterprises.

Compare more models

GPT-4 0613

vs.

GPT-4 Turbo 2024-04-09

GPT-4 0613

vs.

GPT-3.5 Turbo 1106

GPT-4 0613

vs.

GPT-4 0314

GPT-4 0613

vs.

Claude 1

Claude 3 Opus

vs.

Vicuna 33B

Claude 3 Opus

vs.

Claude 2.1

Claude 3 Opus

vs.

Vicuna 13B

Claude 3 Opus

vs.

GPT-4

Building an AI chatbot?

GenAI monitoring, prompt management, and magic.

Learn More Browse the docs

Own Your Data

Self Hostable

1-line Integration

Prompt Templates

Chat Replays

Analytics

Topic Classification

Agent Tracing

Custom Dashboards

Score LLM responses

PII Masking

Feedback Tracking

Own Your Data

Self Hostable

1-line Integration

Prompt Templates

Chat Replays

Analytics

Topic Classification

Agent Tracing

Custom Dashboards

Score LLM responses

PII Masking

Feedback Tracking