Unleash the
Power of AI Oversight
With a few quick steps, send your MLflow experiment traces from Databricks to Patronus AI’s OTel Collector for effective, detailed insights into your ML workflows.


Patronus AI
We are a leading AI evaluation and optimization company. Our research-backed product offerings enable AI engineers to optimize their AI products, access SOTA evaluation models, and automatically detect LLM performance issues across 50+ modes.
Our platform consists of a suite of features providing end-to-end solution coverage to confidently deploy LLM applications at scale.
Percival,
by Patronus AI
Percival is a SOTA agent evaluator and is capable of detecting 20+ failure modes in agentic traces and suggesting optimizations for agentic systems, spanning reasoning, planning & coordination, and system execution errors.

Key Benefits
we leverage the power of Percival to provide:
1. Real-time Monitoring
See current, in-progress performance of your MLflow experiments
2. Evaluation Metrics
Automatic extraction of performance insights from your traces
3. Alerting
Instant and proactive anomaly detection to inform you of any degradation in your workflow
4. No Extra Setup
Integrate with provided environment variables with no additional code required
Why Us
We take a research-first approach
The team at Patronus has been testing LLMs since before the GenAI boom
Our approach is state-of-the-art → +18% better at detecting hallucinations than other OpenAI LLM-based evaluators*
We offer production-ready LLM evaluators for general, custom, and RAG-enabled use cases
Our off-the-shelf evaluators cover your bases (e.g. toxicity, PII leakage) while our custom evaluators cover the rest (e.g., brand alignment)
We support real-time evaluation with fast API response times (as low as 100ms)
You can start using the Patronus API with a single line of code
We offer flexible hosting options with enterprise-grade security
No need to worry about managing servers with our Cloud Hosted solution
Our On-Premise offering is also available for customers with the strictest data privacy needs
You can rest assured that your proprietary data will never be shared outside our organization
We get vetted by third-party security companies yearly
We are trusted by a strong array of customers and partners
Patronus is the only company to provide an SLA guarantee of 90% alignment between our evaluators and human evaluators
Our customers include OpenAI, HP, and Pearson
Our partners include AWS, Databricks, and MongoDB

Get Started Today
Simple integration documentation to help you get the most out of trace evaluation with Databricks and Patronus AI