Skip to main content

Monitoring

Monitoring is the continuous collection, measurement, and analysis of data about systems, networks, applications, or processes to assess their state, detect deviations from expected behavior, and support operational, security, and compliance objectives.

Expanded Explanation

1. Technical Function and Core Characteristics

Monitoring collects telemetry such as logs, metrics, traces, and events from infrastructure, applications, networks, and security controls. It evaluates this data against defined thresholds, policies, or baselines to identify performance, availability, integrity, or security deviations.

Monitoring systems often provide alerting, visualization, and reporting capabilities that support real-time and historical analysis. They integrate with automation and incident management workflows so that detected conditions can trigger responses, escalation, or further investigation.

2. Enterprise Usage and Architectural Context

Enterprises use monitoring to observe distributed systems, cloud services, on-premises (on-prem) infrastructure, and business applications in production and preproduction environments. Monitoring architectures commonly include data collectors, time-series or log storage, analytics engines, and dashboards.

In enterprise architectures, monitoring spans multiple domains, including IT operations, Security Operations (SecOps), network operations, and DevOps practices such as Site Reliability Engineering (SRE). Organizations define monitoring strategies and service-level objectives so that observability data aligns with operational and business requirements.

3. Related or Adjacent Technologies

Monitoring relates closely to observability, which focuses on understanding internal system state from external outputs, and to logging, metrics collection, and distributed tracing. It also relates to application performance monitoring, infrastructure monitoring, and Network Performance Monitoring (NPMO).

Security monitoring intersects with Security Information and Event Management (SIEM), intrusion detection systems, Endpoint Detection And Response (EDR), and security orchestration and automated response tools. In regulated environments, compliance monitoring aligns with frameworks and standards that specify control effectiveness and audit requirements.

4. Business and Operational Significance

Monitoring supports service availability, performance, and reliability objectives by enabling early detection of faults, capacity issues, and configuration errors. It helps operations teams reduce mean time to detect and mean time to respond during incidents.

Monitoring also supports risk management and regulatory compliance by evidencing control operation, detecting policy violations, and providing audit trails. It supplies data that decision-makers use for capacity planning, cost management, and evaluation of technology investments.