LLM benchmarks, explained

An open resource for understanding and tracking LLM performance across different tasks and domains.