Pinned Loading
Repositories
Showing 10 of 238 repositories
- evals Public
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
openai/evals’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…