Skip to main content

ONES 3.1 Enhances SONiC Features for Better Monitoring

The latest release of ONES 3.1 introduces enhancements that enable improved monitoring capabilities for systems utilizing SONiC. IT decision-makers can utilize these features to maintain optimized network operations.

System Health Monitoring

The platform offers real-time insights into system health monitoring, focusing on critical performance metrics.

CPU-Intensive Services

Operators can identify the top CPU-consuming services, which is essential for managing performance and resource allocation effectively.

Unhealthy Devices with Failure Codes

The update provides mechanisms for tracking devices that exhibit failure codes, allowing for quicker diagnostics and repairs.

SONiC Docker Transitions

There are new features that streamline Docker transitions within managed networks, enhancing operational efficiency.

Automatic IP Detection, Alerting and Rediscovery

The IP Transition Widget tracks IP changes within devices over time, generating alerts for any conflicts. This capability is supported by the ONES Rule Engine, which notifies operators of management IP transitions to ensure operational continuity.

Detailed summaries of device IP transitions are also included, helping operators manage network modifications effectively.

Rule Engine: Enhanced Alerts

The ONES Rule Engine facilitates customized alert configurations to automate monitoring processes. When preset conditions are triggered, alerts are generated, enabling immediate responses to network issues.

New metrics include:

  • Monitoring Docker CPU/Memory utilization to prevent overload.
  • Issuing alerts when any Docker container goes down.
  • Identifying failures in unhealthy devices.
  • Tracking IP transitions for proactive management.

This version also enhances capabilities for SONiC network monitoring with real-time failure detection and automated IP management.

FAQs

ONES 3.1 provides in-depth SONiC observability, giving teams the ability to track system health comprehensively. This enhances troubleshooting effectiveness.

New alerts for Docker Down Status help maintain service continuity by promptly alerting operators to issues.

Automatic IP rediscovery facilitates seamless management IP updates without requiring manual changes, maintaining consistent monitoring capabilities.

Operators gain visibility into Central Processing Unit (CPU) utilization, enabling prompt identification and resolution of performance-related issues.

Advanced observability tools in ONES 3.1 further support proactive management of system operations.