LightEval Interoperability Test Fixtures
Status: Active (2026-03-13)
This document describes the deterministic fixture set used to exercise the contract between
LightEvalConfigService, the LiteLLM factory, and the lighteval endpoint litellm CLI/Python
entrypoints.
Objectives
- validate generated
config.yamlfiles for both CLI and Python LightEval paths - cover success, retry, and failure semantics
- avoid external API calls in CI and local smoke runs
- keep contract fixtures reproducible
Fixture Layout
The provider fixture points at a local OpenAI-compatible test gateway. Contract tests may rewrite the base URL dynamically when they start an ephemeral local instance.
Test Matrix
Test Gateway Behavior
The local test gateway is OpenAI-compatible and keyed by model name:
- normal model names succeed
- names containing
fail-oncefail on the first call, then succeed - names containing
always-failfail every time
This keeps retry coverage deterministic without introducing alternate runtime execution branches in production code.
Fixture Regeneration
Refresh the fixture set with:
The builder updates:
- provider JSON
- dataset package
- golden LightEval config
- golden CLI stdout
CI Usage
pytest -m lighteval --no-covruns in CI and nightly flows- Docker integration remains independent but uses the same local-gateway approach
- failures should upload logs and invocation metadata for debugging
