⇩ Markdown

evals evaluate system behavior that is closely tied to AI functionality