Monitoring

Dugite provides two complementary monitoring tools: a terminal dashboard (dugite-monitor) for quick at-a-glance status, and a Prometheus-compatible metrics endpoint for production alerting and dashboards.

Terminal Dashboard (dugite-monitor)

dugite-monitor is a standalone binary that renders a real-time status dashboard in the terminal by polling the node's Prometheus endpoint. It requires no external infrastructure and works over SSH.

dugite-monitor TUI walkthrough

# Monitor a local node (default: http://localhost:12798/metrics)
dugite-monitor

# Monitor a remote node
dugite-monitor --metrics-url http://192.168.1.100:12798/metrics

# Custom refresh interval (default: 2 seconds)
dugite-monitor --refresh-interval 5

The dashboard displays four panels:

Chain Status — sync progress, current slot/block/epoch, tip age, GSM state
Peers — out/in/total connection counts, hot/warm/cold breakdown, EWMA latency
Performance — block rate sparkline, replay throughput, transaction counts
Governance — treasury balance, DRep count, active proposals, pool count

Color-coded health indicators (green/yellow/red) reflect tip age and sync progress. The block rate sparkline shows the last 30 data points so you can spot throughput trends at a glance.

Keyboard navigation: q to quit, Tab to cycle panels, j/k (vim-style) to scroll within a panel.

Prometheus Metrics Endpoint

Dugite exposes a Prometheus-compatible metrics endpoint for monitoring node health and sync progress.

Metrics Endpoint

The metrics server runs on port 12798 by default and responds to any HTTP request with Prometheus exposition format metrics:

http://localhost:12798/metrics

Example response:

# HELP dugite_blocks_received_total Total blocks received from peers
# TYPE dugite_blocks_received_total gauge
dugite_blocks_received_total 1523847

# HELP dugite_blocks_applied_total Total blocks applied to ledger
# TYPE dugite_blocks_applied_total gauge
dugite_blocks_applied_total 1523845

# HELP dugite_slot_number Current slot number
# TYPE dugite_slot_number gauge
dugite_slot_number 142857392

# HELP dugite_block_number Current block number
# TYPE dugite_block_number gauge
dugite_block_number 11283746

# HELP dugite_epoch_number Current epoch number
# TYPE dugite_epoch_number gauge
dugite_epoch_number 512

# HELP dugite_sync_progress_percent Chain sync progress (0-10000, divide by 100 for %)
# TYPE dugite_sync_progress_percent gauge
dugite_sync_progress_percent 9542

# HELP dugite_utxo_count Number of entries in the UTxO set
# TYPE dugite_utxo_count gauge
dugite_utxo_count 15234892

# HELP dugite_mempool_tx_count Number of transactions in the mempool
# TYPE dugite_mempool_tx_count gauge
dugite_mempool_tx_count 42

# HELP dugite_peers_connected Number of connected peers
# TYPE dugite_peers_connected gauge
dugite_peers_connected 8

Health Endpoint

The metrics server exposes a /health endpoint for monitoring node status:

GET http://localhost:12798/health

Returns JSON with three possible statuses:

healthy: Sync progress >= 99.9%
syncing: Actively catching up to chain tip
stalled: No blocks received for > 5 minutes AND sync < 99%

{
  "status": "healthy",
  "uptime_seconds": 3421,
  "slot": 142857392,
  "block": 11283746,
  "epoch": 512,
  "sync_progress": 99.95,
  "peers": 8,
  "last_block_received": "2026-03-14T12:34:56.789Z"
}

Readiness Endpoint

For Kubernetes readiness probes:

GET http://localhost:12798/ready

Returns 200 OK when sync_progress >= 99.9%, 503 Service Unavailable otherwise:

{"ready": true}

or:

{"ready": false, "sync_progress": 75.42}

Available Metrics

Counters

Metric	Description
`dugite_blocks_received_total`	Total blocks received from peers
`dugite_blocks_applied_total`	Total blocks successfully applied to the ledger
`dugite_transactions_received_total`	Total transactions received
`dugite_transactions_validated_total`	Total transactions validated
`dugite_transactions_rejected_total`	Total transactions rejected
`dugite_rollback_count_total`	Total number of chain rollbacks
`dugite_blocks_forged_total`	Total blocks forged by this node
`dugite_leader_checks_total`	Total VRF leader checks performed
`dugite_leader_checks_not_elected_total`	Leader checks where node was not elected
`dugite_forge_failures_total`	Block forge attempts that failed
`dugite_blocks_announced_total`	Blocks successfully announced to peers
`dugite_n2n_connections_total`	Total N2N (peer-to-peer) connections accepted
`dugite_n2c_connections_total`	Total N2C (client) connections accepted
`dugite_validation_errors_total{error="..."}`	Transaction validation errors, broken down by error type
`dugite_protocol_errors_total{error="..."}`	Protocol-level errors by type (e.g. handshake failures, connection errors)

Gauges

Metric	Description
`dugite_peers_connected`	Number of connected peers
`dugite_peers_cold`	Number of cold (known but unconnected) peers
`dugite_peers_warm`	Number of warm (connected, not syncing) peers
`dugite_peers_hot`	Number of hot (actively syncing) peers
`dugite_sync_progress_percent`	Chain sync progress (0-10000; divide by 100 for percentage)
`dugite_slot_number`	Current slot number
`dugite_block_number`	Current block number
`dugite_epoch_number`	Current epoch number
`dugite_utxo_count`	Number of entries in the UTxO set
`dugite_mempool_tx_count`	Number of transactions in the mempool
`dugite_mempool_bytes`	Size of the mempool in bytes
`dugite_delegation_count`	Number of active stake delegations
`dugite_treasury_lovelace`	Total lovelace in the treasury
`dugite_drep_count`	Number of registered DReps
`dugite_proposal_count`	Number of active governance proposals
`dugite_pool_count`	Number of registered stake pools
`dugite_uptime_seconds`	Seconds since node startup
`dugite_disk_available_bytes`	Available disk space on the database volume
`dugite_n2n_connections_active`	Currently active N2N connections
`dugite_n2c_connections_active`	Currently active N2C connections
`dugite_p2p_enabled`	Whether P2P governance is active (0 or 1)
`dugite_diffusion_mode`	Current diffusion mode (0=InitiatorOnly, 1=InitiatorAndResponder)
`dugite_peer_sharing_enabled`	Whether peer sharing is active (0 or 1)
`dugite_tip_age_seconds`	Seconds since the tip slot time
`dugite_chainsync_idle_seconds`	Seconds since last ChainSync RollForward event
`dugite_ledger_replay_duration_seconds`	Duration of last ledger replay in seconds
`dugite_mem_resident_bytes`	Resident set size (RSS) in bytes

Histograms

Metric	Buckets (ms)	Description
`dugite_peer_handshake_rtt_ms`	1, 5, 10, 25, 50, 100, 250, 500, 1000, 2500, 5000, 10000	Peer N2N handshake round-trip time
`dugite_peer_block_fetch_ms`	(same)	Per-block fetch latency

Histograms expose _bucket, _count, and _sum suffixes for standard Prometheus histogram queries.

Prometheus Configuration

Add the Dugite node as a scrape target in your prometheus.yml:

scrape_configs:
  - job_name: 'dugite'
    scrape_interval: 15s
    static_configs:
      - targets: ['localhost:12798']
        labels:
          network: 'mainnet'
          node: 'relay-1'

Grafana Dashboard

Dugite ships with a pre-built Grafana dashboard at config/monitoring/grafana-dashboard.json. The dashboard covers all node metrics organized into nine sections:

Overview — Sync progress gauge, block height, epoch, slot, connected peers, blocks forged
Node Health — Uptime, disk available (stat + time series)
Sync & Throughput — Sync progress over time, block apply/receive rate (blk/s), block height, rollbacks
Peers — Connected peer count over time, peer state breakdown (hot/warm/cold stacked)
Mempool & Transactions — Mempool tx count, mempool size (bytes), transaction rate (received/validated/rejected)
Ledger State — UTxO set size, stake delegations, treasury balance (ADA), registered stake pools
Governance — Registered DReps, active governance proposals
Block Production — Total blocks forged, block forge rate (blk/h)
Network Latency — Handshake RTT and block fetch latency percentiles (p50/p95/p99), request counts
Validation Errors — Error breakdown by type (stacked bars), error totals (bar chart)

Quick Start (Docker)

The fastest way to start a local monitoring stack is with the included script:

# Start Prometheus + Grafana
just monitor-start         # or: ./scripts/monitoring/start.sh

# Open the dashboard (admin/admin)
open http://localhost:3000/d/dugite-node/dugite-node

# Check status
just monitor-status        # or: ./scripts/monitoring/start.sh status

# Stop
just monitor-stop          # or: ./scripts/monitoring/start.sh stop

The script starts Prometheus (port 9090) and Grafana (port 3000) as Docker containers, auto-configures the Prometheus datasource, and imports the Dugite dashboard. Prometheus data is persisted in .monitoring-data/ so metrics survive restarts.

Environment variables for port customization:

Variable	Default	Description
`PROMETHEUS_PORT`	9090	Prometheus web UI port
`GRAFANA_PORT`	3000	Grafana web UI port
`DUGITE_METRICS_PORT`	12798	Port where Dugite exposes metrics

Importing the Dashboard

Open Grafana and go to Dashboards > Import
Click Upload JSON file and select config/monitoring/grafana-dashboard.json
Select your Prometheus data source when prompted
Click Import

The dashboard includes an instance template variable so you can monitor multiple Dugite nodes (relays + block producer) from a single dashboard. It auto-refreshes every 30 seconds.

Provisioning

To auto-provision the dashboard, copy it into your Grafana provisioning directory:

cp config/monitoring/grafana-dashboard.json /etc/grafana/provisioning/dashboards/dugite.json

Add a dashboard provider in /etc/grafana/provisioning/dashboards/dugite.yaml:

apiVersion: 1
providers:
  - name: Dugite
    folder: Cardano
    type: file
    options:
      path: /etc/grafana/provisioning/dashboards
      foldersFromFilesStructure: false

Quick Start (macOS)

To quickly preview the dashboard locally with Homebrew:

# Install Prometheus and Grafana
brew install prometheus grafana

# Configure Prometheus to scrape Dugite
cat > /opt/homebrew/etc/prometheus.yml << 'EOF'
global:
  scrape_interval: 5s

scrape_configs:
  - job_name: dugite
    static_configs:
      - targets: ['localhost:12798']
EOF

# Provision the datasource
cat > "$(brew --prefix)/opt/grafana/share/grafana/conf/provisioning/datasources/dugite.yaml" << 'EOF'
apiVersion: 1
datasources:
  - name: Prometheus
    type: prometheus
    access: proxy
    url: http://localhost:9090
    isDefault: true
    uid: DS_PROMETHEUS
EOF

# Provision the dashboard
cat > "$(brew --prefix)/opt/grafana/share/grafana/conf/provisioning/dashboards/dugite.yaml" << 'EOF'
apiVersion: 1
providers:
  - name: Dugite
    folder: Cardano
    type: file
    options:
      path: /opt/homebrew/var/lib/grafana/dashboards
EOF

mkdir -p /opt/homebrew/var/lib/grafana/dashboards
sed 's/${DS_PROMETHEUS}/DS_PROMETHEUS/g' config/monitoring/grafana-dashboard.json \
  > /opt/homebrew/var/lib/grafana/dashboards/dugite.json

# Start services
brew services start prometheus
brew services start grafana

# Open the dashboard (default login: admin/admin)
open "http://localhost:3000/d/dugite-node/dugite-node"

To stop:

brew services stop prometheus grafana

Key Queries

Panel	PromQL
Sync progress	`dugite_sync_progress_percent / 100`
Block throughput	`rate(dugite_blocks_applied_total[5m])`
Transaction rejection rate	`rate(dugite_transactions_rejected_total[5m])`
Treasury balance (ADA)	`dugite_treasury_lovelace / 1e6`
Block forge rate (per hour)	`rate(dugite_blocks_forged_total[1h]) * 3600`
Handshake RTT p95	`histogram_quantile(0.95, rate(dugite_peer_handshake_rtt_ms_bucket[5m]))`
Block fetch latency p95	`histogram_quantile(0.95, rate(dugite_peer_block_fetch_ms_bucket[5m]))`
Validation errors by type	`rate(dugite_validation_errors_total[5m])`
Protocol errors by type	`rate(dugite_protocol_errors_total[5m])`
Leader election rate	`rate(dugite_leader_checks_total[5m])`
Active N2N connections	`dugite_n2n_connections_active`
Disk available	`dugite_disk_available_bytes`

Console Logging

In addition to the Prometheus endpoint, Dugite logs sync progress to the console every 5 seconds. The log output includes:

Current slot and block number
Epoch number
UTxO count
Sync percentage
Blocks-per-second throughput

Example log line:

2026-03-12T12:34:56.789Z  INFO dugite_node::node: Syncing progress="95.42%" epoch=512 block=11283746 tip=11300000 remaining=16254 speed="312 blk/s" utxos=15234892

Log output can be directed to stdout, file, or systemd journal. See Logging for full details on output targets, file rotation, and log level configuration.

Dugite Documentation