Skip to content

Deterministic LLM-judge metrics

Latest
Compare
Choose a tag to compare
@penguine-ip penguine-ip released this 31 Jan 10:44
· 9 commits to main since this release

Here are the new features we're bringing to you in the latest release:
💥 Releasing beta for Deep, Acyclic, Graph. A new deterministic way in deepeval to build decision trees for deterministic outputs for LLM evaluation: https://docs.confident-ai.com/docs/metrics-dag
⚙️ Open-sourcing all LLM red teaming vulnerabilities: https://docs.confident-ai.com/docs/red-teaming-introduction
🪄 Fixes to synthetic dataset generation pipeline