OpenSourceProjects logo
agenta logo

agentaThe open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

4,082 stars
516 forks
TypeScript
NOASSERTION
agents
evaluation
llm-as-a-judge
llm-evaluation
llm-framework
llm-monitoring
agenta screenshot

agenta

Agenta is an open-source LLMOps platform that streamlines the entire lifecycle of building production-grade LLM applications. It combines prompt management, evaluation, and observability into a single integrated platform, enabling engineering and product teams to collaborate efficiently and deploy reliable AI applications faster.

Key Features

  • Prompt Management & Playground : Collaborate on prompt engineering with interactive side-by-side comparisons, support for 50+ LLM models, version control with branching, and complex configuration schemas for subject matter experts
  • LLM Evaluation : Systematically evaluate applications with flexible testsets, pre-built and custom evaluators, LLM-as-judge capabilities, and human feedback integration accessible via UI and API
  • LLM Observability : Monitor production applications with cost and performance tracking, detailed LLM tracing for debugging, OpenTelemetry-native standards, and pre-built integrations for popular models and frameworks

Use Cases

  • Prompt Optimization : Test and iterate on prompts with teams before deploying to production
  • Quality Assurance : Evaluate LLM outputs systematically using both automated and human-driven feedback mechanisms
  • Production Monitoring : Track performance, latency, costs, and behavior of LLM applications in real-world deployments

Who Is It For

Agenta is built for engineering teams and product managers who need to experiment with LLMs safely, ensure quality through rigorous evaluation, and maintain visibility into production LLM applications. It's particularly valuable for organizations prioritizing collaboration between technical teams and subject matter experts.

Trending Open Source Projects