Monitoring
The Operator Node can be configured to send Opentelemetry metrics, logs and traces to an OpenTelemetry Collector. The OpenTelemetry Collector can then be configured to send the metrics to a variety of backends such as Prometheus, Grafana, or other monitoring tools.
To configure the Operator Node to send metrics to an OpenTelemetry Collector, you can use the following CLI arguments:
drosera-operator node --otel-export-endpoint <endpoint> --otel-export-metadata <metadata> --otel-resource-attributes <attributes>
Description of the CLI arguments can be found on the configuration section of the Run the Node page.
Logging
If running the Operator Node as a systemd service, all logs will be persisted on the machine and will consume storage space. To avoid this, you can configure the otel export endpoint to send logs to a remote logging service and send the stdout logs to /dev/null
when starting the the Operator Node. This will prevent logs from being stored on the machine.
drosera-operator node --otel-export-endpoint <endpoint> --log-output stdout > /dev/null
Metrics
The Operator Node collects the following metrics:
System Metrics
drosera_process_cpu_usage
: The CPU usage of the Operator Node processdrosera_process_disk_space_usage
: The disk space usage of the Operator Node processdrosera_process_memory_usage
: The memory usage by the Operator Node processtotal_memory
: The total memory available on the system
Drosera Metrics
execute_trap_duration
: The duration of time it takes to execute a Trapattestation_consensus_duration
: The duration of time it takes to reach consensus on an attestation of a Trap result
More metrics will be added in the future.