Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feedback for node dashboard #2

Open
packetstracer opened this issue Aug 7, 2024 · 0 comments
Open

Feedback for node dashboard #2

packetstracer opened this issue Aug 7, 2024 · 0 comments

Comments

@packetstracer
Copy link

packetstracer commented Aug 7, 2024

Blockscope.net would like to provide in this issue, our vision from our experience implementing and using grafana dashboards, in case it is useful for creating next versions of the dashboard.

  • 1.- In our experience hardware and system usage metrics are nice to have, but usually node operators rely on their own "standard" grafana dashboards. These dashboards are already configured by node operators with custom alerts and metrics. So hardware metrics are good to have but not mandatory IMO. Below the ones we use for our servers.

https://grafana.com/grafana/dashboards/15172-node-exporter-for-prometheus-dashboard-based-on-11074/
https://grafana.com/grafana/dashboards/1860-node-exporter-full/

  • 2.- Thus we think most important metrics are the ones related to consensus, blocks, transactions, rounds, epochs, and other interesting Supra network related metrics. Creating widgets with this information could expose sincronization, connectivity, network congestion, etc. issues. This would allow also to implement specific alerts for Supra network on the Grafana dashboard itself. Below a dashboard we've built for Sui network that exposes interesting network metrics.

image

  • 3.- Probably would be good to publish / OS the grafana dashboard code for node operators that already have their own prometheus/grafana stack in place (most of node operators do). So N.O. can tailor and modify the dashboard to their needs besides having an official one. Previous Sui dashboard below.

https://gitlab.com/blockscope-net/sui-tools/-/blob/main/README.md?ref_type=heads

  • 4.- Another idea, would be to implement a network generic metrics dashboard in https://monitoring.services.supra.com and accessible for NO to check issues that may arise, or to correlate observations looking to other nodes charts (such as DC or geolocation related issues). Below an example of this kind of dashboard.

image

I've seen that these consensus/network metrics are already in the todo list as "Application level" metrics (https://github.com/orgs/Entropy-Foundation/projects/13/views/1), so waiting for those to be implemented. We'll be willing to help build the next dashboard version to include these metrics, and also explore the possibility to build a monitoring CLI tool around these metrics as we've done in other networks.

Hope this information can help you on evolving this dashboard, and feel free to discuss anything related to it and our vision.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant