langwatch

langwatch

The platform for LLM evaluations and AI agent testing

Stars: 2824

Visit
 screenshot

LangWatch is a monitoring and analytics platform designed to track, visualize, and analyze interactions with Large Language Models (LLMs). It offers real-time telemetry to optimize LLM cost and latency, a user-friendly interface for deep insights into LLM behavior, user analytics for engagement metrics, detailed debugging capabilities, and guardrails to monitor LLM outputs for issues like PII leaks and toxic language. The platform supports OpenAI and LangChain integrations, simplifying the process of tracing LLM calls and generating API keys for usage. LangWatch also provides documentation for easy integration and self-hosting options for interested users.

README:

012d1688-24ae-4759-ae70-5f8f81a13c0e

chat on Discord langwatch Python package on PyPi langwatch npm package follow on X

Why LangWatch?

The platform for LLM evaluations and AI agent testing. We help teams test, simulate, evaluate, and monitor LLM-powered agents end-to-end — before release and in production. Built for teams that need regression testing, simulations, and production observability without building custom tooling.

LangWatch gives you full visibility into agent behavior and the tools to systematically improve reliability, performance, and cost, while keeping you in control of your AI system

Getting Started

Cloud ☁️

The easiest way to get started with LangWatch.

Create a free account → create a project → get started/ copy your API key.

Local setup 💻

Get up and running on your own machine using docker compose:

git clone https://github.com/langwatch/langwatch.git
cd langwatch
cp langwatch/.env.example langwatch/.env
docker compose up -d --wait --build

Once running, LangWatch will be available at http://localhost:5560, where you can create your first project and API key.

Deployment options ⚓️

Run LangWatch on your own infrastructure:

  • Docker Compose - Run LangWatch on your own machine.
  • Kubernetes (Helm) - Run LangWatch on a Kubernetes cluster using Helm.
  • OnPrem - Cloud-specific setups for AWS, Google Cloud, and Azure.
Hybrid (OnPrem data) 🔀

For companies that have strict data residency and control requirements, without needing to go fully on-prem.

Read more about it on our docs.

Local Development 👩‍💻

You can also run LangWatch locally without docker to develop and help contribute to the project.

Start just the databases using docker and leave it running:

docker compose up redis postgres opensearch

Then, on another terminal, install the dependencies and start LangWatch:

make install
make start

🚀 Quick Start

Ship safer agents in minutes. Create a free account, then dive into these guides:

🗺️ Integrations

LangWatch builds and maintains several integrations listed below. Our tracing platform is built on top of OpenTelemetry, so we support any OpenTelemetry-compatible library out of the box.

Frameworks:
LangChain · LangGraph · Vercel AI SDK · Mastra · CrewAI · Google ADK

Model Providers:
OpenAI · Anthropic · Azure · Google Cloud · AWS · Groq · Ollama

Platforms

LangFlow · Flowise · n8n

and many more…

Are you using a platform that could benefit from a direct LangWatch integration? We'd love to hear from you, please fill out this very quick form.

💬 Support

Have questions or need help? We're here to support you in multiple ways:

  • Documentation: Our comprehensive documentation covers everything from getting started to advanced features.
  • Discord Community: Join our Discord server for real-time help from our team and community.
  • X (Twitter): Follow us on X for updates and announcements.
  • GitHub Issues: Report bugs or request features through our GitHub repository.
  • Enterprise Support: Enterprise customers receive priority support with dedicated response times. Our pricing page contains more information.

🤝 Collaborating

Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

Please read our Contribution Guidelines for details on our code of conduct, and the process for submitting pull requests.

✍️ License

Please read our LICENSE.md file.

👮‍♀️ Security + Compliance

As a platform that has access to data that is highly likely to be sensitive, we take security incredibly seriously and treat it as a core part of our culture.

Legal Framework Current Status
GDPR Compliant. DPA available upon request.
ISO 27001 Certified. Certification report available upon request on our Enterprise plan.

Please refer to our Security page for more information. Contact us at [email protected] if you have any further questions.

Vulnerability Disclosure

If you need to do a responsible disclosure of a security vulnerability, you may do so by email to [email protected], or if you prefer you can reach out to one of our team privately on Discord.

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for langwatch

Similar Open Source Tools

For similar tasks

For similar jobs