Langfuse Roadmap
Langfuse is open source and we want to be fully transparent what we’re working on and what’s next. This roadmap is a living document and we’ll update it as we make progress.
Your feedback is highly appreciated. Feel like something is missing? Add new ideas on GitHub or vote on existing ones. Both are a great way to contribute to Langfuse and help us understand what is important to you.
🚀 Released
10 most recent changelog items:
- PostHog integration is now GA
- Improved cost tracking
- Langfuse v3 stable release
- Extensive example notebook for JS/TS SDK
- SSO with GitHub Enterprise and Keycloak
- New documentation for Google Vertex AI and Gemini tracing
- Google Vertex AI support for LLM Playground and Evaluations incl. Gemini models
- Prompt Experiments on Datasets with LLM-as-a-Judge Evaluations
- All new Datasets, Experimentation and Evaluation documentation
- Full multi-modal support, including audio, images, and attachments
Subscribe to our mailing list to get occasional email updates about new features.
🚧 In progress
- Various improvements, including:
- Use “log levels” to simplify large agent traces, filter by them in UI
- Typesafe sdk methods for all API routes
- Commit messages on prompts
- Duplicate prompts to edit name
- User screen that’s more helpful to summarize metrics
- Evaluation runs on historical data
- Eval execution states across Langfuse Tracing and Datasets
- Use JSON path to extract structured data from traces for llm-as-judge evals
- Save table filters as views
- “Peek view” for tables, preview trace without switching the page
- New observation table
- …
🔮 Planned
- Agent Observability: Improve native support for agentic applications
- Visualizations of agent executions, e.g. Graph (#2509)
- Filtering for tool calls used within an execution
- Opinionated agent evaluations
- Tracing Enhancements
- Environments: Native support for separating different environments within a project (#5005)
- Realtime LLM: Bespoke traces for datastreams of realtime APIs, especially multi-modal (#4757)
- Open Telemetry: Add support for OpenTelemetry-based instrumentations by adding an OpenTelemetry exporter/collector (#2509)
- Prompt Management
- Evaluation
- Non-LLM evaluators: classifiers, custom code, regex, etc.
- Evals on repeated spans/observations within a trace
- Comparisons of different evaluation metrics
- Session-level scores (#2728)
- Query 100s of millions of traces:
- Data Platform
- Webhooks to subscribe to changes within your Langfuse project (#1033)
- Batched exports to own storage buckets for downstream use and archival
- Rule-based routing between Langfuse product features
- Other
- UI/UX: improvements of all core product features
- Onboarding: simplify getting started with Langfuse when not reading the docs
- Data retention: Project-level data retention policies (#2268)
🙏 Feature requests and bug reports
The best way to support Langfuse is to share your feedback, report bugs, and upvote on ideas suggested by others.