Metro network management monitoring

Metro network management monitoring

Overview

Challenges Addressed:

  • The network infrastructure comprises multi-vendor devices from Cisco, Juniper, Tellabs, ALU, and Ciena, resulting in interoperability issues where the management systems (MMS) of one vendor are incompatible with devices from another.
  • Lack of centralized aggregation and correlation of operational data (e.g., fault incidents, bandwidth utilization, user traffic) from disparate systems impedes comprehensive analysis and predictive forecasting of network quality.
  • Absence of a unified, panoramic view of the entire network’s operational status, device connectivity, and alert conditions complicates real-time monitoring and impedes efficient incident resolution.
  • Inability to correlate data across different systems to identify root causes and provide effective troubleshooting guidance.

The Metro Network Management System is designed for a telecom operator to deliver end-to-end visibility, monitoring, and management of multi-vendor network equipment, including devices from Cisco, Juniper, Tellabs, ALU, and Ciena. The system leverages SNMP (Simple Network Management Protocol) to capture and consolidate telemetry data from geographically dispersed nodes, providing granular insights into device health and performance. All telemetry is centralized in a Network Operations Center (NOC) for streamlined real-time surveillance, anomaly detection, and incident management.

Functional Modules:

  • Multi-Tenant User Access Management: Manages and assigns access rights for different user groups across multiple tenants, ensuring data security and operational independence between customer groups.
  • Site Device Management: Monitors device-specific metrics such as traffic sensors, CPU utilization, RAM usage, response time (ping time), temperature sensors, and transmission power at each site.
  • Automated Device Configuration: Automates device configuration and updates according to predefined scenarios, minimizing errors and reducing deployment time.
  • Network Topology Management: Displays device status, connections, and disconnection alerts, providing a comprehensive overview of device connectivity and network topology.
  • Full Network Digital Map Management: Provides a digital map representing the entire network, supporting device localization and status monitoring across different geographical areas.
  • Network-Wide Traffic Monitoring: Continuously monitors and analyzes network traffic to detect anomalies, optimize performance, and prevent congestion.
  • Network Connectivity Monitoring: Tracks the connectivity status of network devices, ensuring continuous operation and network stability.
  • Network Service Monitoring: Monitors and manages key network services such as DNS, DHCP, and critical web applications, ensuring uninterrupted service availability.
  • Incident Management (Ticketing): Creates, tracks, and resolves incident tickets to streamline issue resolution and ensure efficient fault management.
  • Network-Wide Alert Management: Manages and displays real-time alerts across the network, allowing operators to quickly identify and address potential issues.
  • Statistical Reporting: Generates periodic reports on network status, device and service performance, supporting system evaluation and future upgrade planning.

Resources and Timeline

This project presents a considerable level of complexity, as it requires managing a diverse array of OID (Object Identifier) metrics from multiple vendors. To tackle these challenges, we not only engaged a team of skilled developers but also brought in experienced network and integration experts to collaboratively design the system architecture and optimize its performance.

Team size

15

Duration

5 months

Requirement Stability

70 %

Customer Satisfaction

95 %

Achievements

Total metro routers managed
10.000
System uptime rate
99
.95%
Incident detection and response time
60
seconds
Data collection and SNMP message processing capacity
60.000
Scalability
30.000