helicone

helicone

๐ŸงŠ Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 ๐Ÿ“

Stars: 2988

Visit
 screenshot

Helicone is an open-source observability platform designed for Language Learning Models (LLMs). It logs requests to OpenAI in a user-friendly UI, offers caching, rate limits, and retries, tracks costs and latencies, provides a playground for iterating on prompts and chat conversations, supports collaboration, and will soon have APIs for feedback and evaluation. The platform is deployed on Cloudflare and consists of services like Web (NextJs), Worker (Cloudflare Workers), Jawn (Express), Supabase, and ClickHouse. Users can interact with Helicone locally by setting up the required services and environment variables. The platform encourages contributions and provides resources for learning, documentation, and integrations.

README:

helicone logo

๐Ÿ” Observability ๐Ÿ•ธ๏ธ Agent Tracing ๐Ÿ’ฌ Prompt Management
๐Ÿ“Š Evaluations ๐Ÿ“š Datasets ๐ŸŽ›๏ธ Fine-tuning

Open Source

Docs โ€ข Discord โ€ข Roadmap โ€ข Changelog โ€ข Bug reports

See Helicone in Action! (Free)

Contributors GitHub stars GitHub commit activity GitHub closed issues Y Combinator

Helicone is the all-in-one, open-source LLM developer platform

  • ๐Ÿ”Œ Integrate: One-line of code to log all your requests to OpenAI, Anthropic, LangChain, Gemini, TogetherAI, LlamaIndex, LiteLLM, OpenRouter, and more
  • ๐Ÿ“Š Observe: Inspect and debug traces & sessions for agents, chatbots, document processing pipelines, and more
  • ๐Ÿ“ˆ Analyze: Track metrics like cost, latency, quality, and more. Export to PostHog in one-line for custom dashboards
  • ๐ŸŽฎ Playground: Rapidly test and iterate on prompts, sessions and traces in our UI
  • ๐Ÿง  Prompt Management: Version and experiment with prompts using production data. Your prompts remain under your control, always accessible.
  • ๐Ÿ” Evaluate: Automatically run evals on traces or sessions using the latest platforms: LastMile or Ragas (more coming soon)
  • ๐ŸŽ›๏ธ Fine-tune: Fine-tune with one of our fine-tuning partners: OpenPipe or Autonomi (more coming soon)
  • ๐Ÿ›œ Gateway: Caching, custom rate limits, LLM security, and more with our gateway
  • ๐Ÿ›ก๏ธ Enterprise Ready: SOC 2 and GDPR compliant

๐ŸŽ Generous monthly free tier (100k requests/month) - No credit card required!

Quick Start โšก๏ธ One line of code

  1. Get your write-only API key by signing up here.

  2. Update only the baseURL in your code:

    import OpenAI from "openai";
    
    const openai = new OpenAI({
      apiKey: process.env.OPENAI_API_KEY,
      baseURL: `https://oai.helicone.ai/v1/${process.env.HELICONE_API_KEY}`,
    });

or - use headers for more secure environments

import OpenAI from "openai";

const openai = new OpenAI({
  apiKey: process.env.OPENAI_API_KEY,
  baseURL: `https://oai.helicone.ai/v1`,
  defaultHeaders: {
   "Helicone-Auth": `Bearer ${process.env.HELICONE_API_KEY}`,
  },
});
  1. ๐ŸŽ‰ You're all set! View your logs at Helicone.

This quick start uses Helicone Cloud with OpenAI. For other providers or self-hosted options, see below.

Get Started For Free

Helicone Cloud (Recommended)

The fastest and most reliable way to get started with Helicone. Get started for free at Helicone US or Helicone EU. Your first 100k requests are free every month, after which you'll pay based on usage. Try our demo to see Helicone in action!

Integrations: View our supported integrations.

Latency Concerns: Helicone's Cloud offering is deployed on Cloudflare workers and ensures the lowest latency (~10ms) add-on to your API requests. View our latency benchmarks.

Self-Hosting Open Source LLM Observability with Helicone

Docker

Helicone is simple to self-host and update. To get started locally, just use our docker-compose file.

Pre-Request:

  • Copy the shared directory to the valhalla directory
  • Create a valhalla folder in the valhalla directory and put /valhalla/jawn in it
# Clone the repository
git clone https://github.com/Helicone/helicone.git
cd docker
cp .env.example .env

# Start the services
docker compose up

Helm

For Enterprise workloads, we also have a production-ready Helm chart available. To access, contact us at [email protected].

Manual (Not Recommended)

Manual deployment is not recommended. Please use Docker or Helm. If you must, follow the instructions here.

Architecture

Helicone is comprised of five services:

  • Web: Frontend Platform (NextJS)
  • Worker: Proxy Logging (Cloudflare Workers)
  • Jawn: Dedicated Server for serving collecting logs (Express + Tsoa)
  • Supabase: Application Database and Auth
  • ClickHouse: Analytics Database
  • Minio: Object Storage for logs.

LLM Observability Integrations

Main Integrations

Integration Supports Description
Generic Gateway Python, Node.js, Python w/package, LangChain JS, LangChain, cURL Flexible integration method for various LLM providers
Async Logging (OpenLLMetry) JS/TS, Python Asynchronous logging for multiple LLM platforms
OpenAI JS/TS, Python -
Azure OpenAI JS/TS, Python -
Anthropic JS/TS, Python -
Ollama JS/TS Run and use large language models locally
AWS Bedrock JS/TS -
Gemini API JS/TS -
Gemini Vertex AI JS/TS Gemini models on Google Cloud's Vertex AI
Vercel AI JS/TS AI SDK for building AI-powered applications
Anyscale JS/TS, Python -
TogetherAI JS/TS, Python -
Hyperbolic JS/TS, Python High-performance AI inference platform
Groq JS/TS, Python High-performance models
DeepInfra JS/TS, Python Serverless AI inference for various models
OpenRouter JS/TS, Python Unified API for multiple AI models
LiteLLM JS/TS, Python Proxy server supporting multiple LLM providers
Fireworks AI JS/TS, Python Fast inference API for open-source LLMs

Supported Frameworks

Framework Supports Description
LangChain JS/TS, Python -
LlamaIndex Python Framework for building LLM-powered data applications
CrewAI - Framework for orchestrating role-playing AI agents
Big-AGI JS/TS Generative AI suite
ModelFusion JS/TS Abstraction layer for integrating AI models into JavaScript and TypeScript applications

Other Integrations

Integration Description
PostHog Product analytics platform. Build custom dashboards.
RAGAS Evaluation framework for retrieval-augmented generation
Open WebUI Web interface for interacting with local LLMs
MetaGPT Multi-agent framework
Open Devin AI software engineer
Mem0 EmbedChain Framework for building RAG applications
Dify LLMOps platform for AI-native application development

This list may be out of date. Don't see your provider or framework? Check out the latest integrations in our docs. If not found there, request a new integration by contacting [email protected].

Community ๐ŸŒ

Learn this repo with Greptile

learnthisrepo.com/helicone |

Contributing

We โค๏ธ our contributors! We warmly welcome contributions for documentation, integrations, costs, and feature requests.

License

Helicone is licensed under the Apache v2.0 License.

Additional Resources

For more information, visit our documentation.

Contributors

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for helicone

Similar Open Source Tools

For similar tasks

For similar jobs