Skip to content

Latest commit

 

History

History
121 lines (89 loc) · 3.94 KB

README.md

File metadata and controls

121 lines (89 loc) · 3.94 KB

GPU Sentinel Pro

"Information should not be displayed all at once; let people gradually become familiar with it." - Edward Tufte

Transform GPU monitoring from complex metrics into intuitive visual patterns. Enterprise-grade NVIDIA GPU monitoring with real-time analytics, intelligent alerts, and historical analysis.

CodeQL X Follow License Built with Codeium

Dark Mode Dashboard Real-time GPU metrics visualized for instant comprehension

Project Overview

Why GPU Sentinel Pro?

Do you find yourself:

  • Parsing dense terminal output when you should be focusing on your work?
  • Missing critical temperature spikes or memory leaks?
  • Lacking historical data for performance analysis?
  • Needing better alerting for GPU health?
  • Managing multiple GPUs across different workloads?

Features

🎯 Intuitive Monitoring

  • Real-time visual dashboard
  • Color-coded temperature ranges
  • Resource utilization patterns
  • Multi-GPU support
  • Dark/light mode

🔔 Intelligent Alerts

  • Configurable thresholds
  • Temperature spike detection
  • Resource bottleneck warnings
  • Email/webhook notifications

📊 Advanced Analytics

  • Historical performance data
  • Time-series analysis
  • Usage pattern recognition
  • Power efficiency metrics

🛠 Enterprise Integration

  • RESTful API
  • Supabase time-series storage
  • Zero-config deployment
  • Multi-user access

Visual Intelligence

Traditional Output vs. GPU Sentinel Pro

Traditional vs Modern Transform dense metrics into intuitive patterns

Real-world Usage

Machine Learning Workloads

ML Monitoring Clear resource utilization during ML model training

Critical Temperature Monitoring

Temperature Alerts Immediate visual alerts during intensive workloads

Temperature Monitoring

Intuitive color-coding for instant recognition:

  • 🔴 ≥85°C: Critical
  • 🟠 75-84°C: Warning
  • 🟡 65-74°C: Normal
  • 🟢 50-64°C: Optimal
  • 🔵 <50°C: Cool

Quick Start

# Clone repository
git clone https://github.com/jackccrawford/gpu-sentinel-pro.git

# Start services
./backend/src/service/run_service.sh
./frontend/run_frontend.sh

# Access dashboard
http://localhost:3055

See Installation Guide for detailed setup instructions.

Documentation

Tech Stack

Python FastAPI React TypeScript Supabase

License

MIT License - Free for personal and commercial use.


Built for Orphaned NVIDIAs