Browse All AI Tools | Find AI List

All ToolsEvaluation

7 results

📂

Evaluation

Browse AI tools related to Evaluation.

Core Capabilities

Patronus AI

AI Infrastructure

Simulating the World's Intelligence to accelerate progress toward human-aligned AGI

Updated 13d ago

Has API

PricingFree

Free

LLM Testing

Hallucination Detection

AI Evaluation

Compare

Stanford HELM

AI Evaluation & Benchmarking

The industry-standard framework for holistic, multi-metric evaluation of large language models.

Updated 13d ago

Has API

PricingFree

Free

Automated Model Benchmarking

Bias and Toxicity Detection

Robustness Testing

Compare

Verity AI

Development & IT

Enterprise Hallucination Detection and Factual Verification Platform

Updated 13d ago

Has API

PricingEnterprise

Free to $850/yr

Verify factual accuracy

Monitor API outputs

Detect AI hallucinations

Compare

Braintrust (bt)

Development

The enterprise-grade stack for evaluating, logging, and refining AI applications with 10x developer velocity.

Updated 13d ago

Has API

PricingFreemium

Free to $100/yr

Automated AI Evaluation

Production LLM Logging

Dataset Management

Compare

Argilla

Development

The open-source data curation platform for LLMs and Generative AI alignment.

Updated 13d ago

Has API

PricingOpen Source

Free to $30/yr

RLHF Data Collection

Model Evaluation

DPO Preference Ranking

Compare

Inspect

AI Safety & Evaluation

The open-source framework for rigorous large language model evaluation and safety testing.

Updated 13d ago

Has API

PricingFree

Free

LLM Benchmarking

Safety Red Teaming

Agentic Workflow Testing

Compare

Tonic Validate

AI Evaluation

Open-source RAG evaluation tool for assessing accuracy, context quality, and latency of RAG systems.

Updated 13d ago

Has API

PricingFree

Free

RAG evaluation

Performance monitoring

Experiment tracking

Compare