Langfuse Roadmap

Langfuse is open source and we want to be fully transparent what we’re working on and what’s next. This roadmap is a living document and we’ll update it as we make progress.

Your feedback is highly appreciated. Feel like something is missing? Add new ideas on GitHub or vote on existing ones. Both are a great way to contribute to Langfuse and help us understand what is important to you.

🚀 Released

10 most recent changelog items:

PostHog integration is now GA(Dec 21, 2024)
Improved cost tracking(Dec 20, 2024)
Langfuse v3 stable release(Dec 9, 2024)
Extensive example notebook for JS/TS SDK(Dec 4, 2024)
SSO with GitHub Enterprise and Keycloak(Dec 3, 2024)
New documentation for Google Vertex AI and Gemini tracing(Dec 2, 2024)
Google Vertex AI support for LLM Playground and Evaluations incl. Gemini models(Nov 28, 2024)
Prompt Experiments on Datasets with LLM-as-a-Judge Evaluations(Nov 22, 2024)
All new Datasets, Experimentation and Evaluation documentation(Nov 21, 2024)
Full multi-modal support, including audio, images, and attachments(Nov 20, 2024)

Subscribe to our mailing list to get occasional email updates about new features.

🚧 In progress

Various improvements, including:
- Use “log levels” to simplify large agent traces, filter by them in UI
- Typesafe sdk methods for all API routes
- Commit messages on prompts
- Duplicate prompts to edit name
- User screen that’s more helpful to summarize metrics
- Evaluation runs on historical data
- Eval execution states across Langfuse Tracing and Datasets
- Use JSON path to extract structured data from traces for llm-as-judge evals
- Save table filters as views
- “Peek view” for tables, preview trace without switching the page
- New observation table
- …

🔮 Planned

Agent Observability: Improve native support for agentic applications
- Visualizations of agent executions, e.g. Graph (#2509)
- Filtering for tool calls used within an execution
- Opinionated agent evaluations
Tracing Enhancements
- Environments: Native support for separating different environments within a project (#5005)
- Realtime LLM: Bespoke traces for datastreams of realtime APIs, especially multi-modal (#4757)
- Open Telemetry: Add support for OpenTelemetry-based instrumentations by adding an OpenTelemetry exporter/collector (#2509)
Prompt Management
- Diff view of prompt changes (#1105)
- Custom RBAC for prompt change management (#4480)
- LLM-assisted prompt engineering
- Prompt composability to increase reuse of common prompts (#1660)
- Folders (#4874)
- Native support for tool calls (#2624)
- Placeholders for chat messages (#2210)
Evaluation
- Non-LLM evaluators: classifiers, custom code, regex, etc.
- Evals on repeated spans/observations within a trace
- Comparisons of different evaluation metrics
- Session-level scores (#2728)
Query 100s of millions of traces:
- Full-text search: Search across the inputs/outputs of traces, sessions, and datasets (#939)
- Self-serve dashboards: Create and share custom dashboards based on metrics extracted from traces (#1011)
Data Platform
- Webhooks to subscribe to changes within your Langfuse project (#1033)
- Batched exports to own storage buckets for downstream use and archival
- Rule-based routing between Langfuse product features
Other
- UI/UX: improvements of all core product features
- Onboarding: simplify getting started with Langfuse when not reading the docs
- Data retention: Project-level data retention policies (#2268)

🙏 Feature requests and bug reports

The best way to support Langfuse is to share your feedback, report bugs, and upvote on ideas suggested by others.

Langfuse Roadmap

🚀 Released

🚧 In progress

🔮 Planned

🙏 Feature requests and bug reports

Feature requests

Bug reports

Was this page useful?

Questions? We're here to help

Subscribe to updates