Skip to content

Export Prometheus Metrics to expose resource usage to users #3552

Closed
@HyeockJinKim

Description

@HyeockJinKim

Motivation  

  • Currently, users cannot monitor their GPU/NPU utilization in the BAI console. Providing Prometheus metrics for resource usage will allow external tools like Grafana to display utilization data, addressing transparency and user needs.

Required Features

  • Export Prometheus Metrics:
    • Enable resource usage metrics (e.g., GPU/NPU utilization) to be exported via Prometheus for external monitoring.
      • GPU/NPU real-time usage
      • GPU/NPU cumulative usage

Impact  

  • Prometheus Integration
    • Metrics export functionality needs to be implemented to expose GPU/NPU utilization data.
  • External Monitoring Tools
    • Enables tools like Grafana to visualize and monitor the metrics.

Testing Scenarios  

  • Integration with Grafana:
    • Test that the exported Prometheus metrics can be visualized in Grafana.

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions