Articles
Showing 1-12 of 66 articles

Claude Sonnet 5: strong agentic performance at a higher cost per task
June 30, 2026

Measuring time per task in AA-Briefcase
June 24, 2026

Announcing the Artificial Analysis Speech to Speech Index
June 23, 2026

Announcing AA-Briefcase: a frontier knowledge work evaluation
June 18, 2026

GLM-5.2 is the new leading open weights model on the Artificial Analysis Intelligence Index
June 16, 2026

Artificial Analysis Intelligence Index v4.1: a shift toward agentic workloads
June 15, 2026

First results from AA-AgentPerf: the hardware benchmark for the agent era
June 12, 2026

Benchmarking guardrail models for safety, refusal, and latency
June 11, 2026

Claude Fable 5 Launches at #1 on the Artificial Analysis Intelligence Index
June 9, 2026

Claude Fable 5: the first public Mythos-class model
June 9, 2026

North Mini Code: Cohere's small coding-focused MoE model
June 9, 2026

MiniMax-M3: Leading open weights model, once the weights are released
June 8, 2026