llm-eval
5 AI tools
Advertisement
AI tools5
DeepEval
Open-source pytest-style evaluation framework for LLMs and agents
AI Directory
→
LangWatch
Open-source LLMOps platform for tracing, evaluation and optimization
AI Directory
→
Maxim AI
End-to-end evaluation, simulation and observability platform for AI agents
AI Directory
→
Opik
Open-source platform to debug, evaluate and monitor LLM and agent apps
AI Directory
→
Promptfoo
Open-source evaluation and red-teaming for LLM apps, agents and RAG.
AI Directory
→
Advertisement