Umbrella Fault Management (IBM Netcool / Impact)

In S&T Ukraine’s solutions, Umbrella Fault Management functions are automated on levels shown in picture below.

Following levels of fault / event management are distinguished:

• Events collection, consolidation and normalization;
• Network and services events and faults management automation;
• Information presentation (Inform), including Alert Management

On events collection and consolidation level special software modules - Netcool probes (terms “connector”, “mediator”, “adapter” can also be used) receive raw data from EMS / NMS (Element / Network Management System) via northbound interfaces. CORBA, SNMP, Q3 and other NBIs are supported.

Received events are consolidated and transformed into unified representation. In addition, unified problem-resolution-clearance logic is applied. These transformations are performed in real-time. Events also are stored in historical database for reporting, quality, and availability issues analysis.

Key outcome communication service providers receive from S&T Ukraine Fault Management Solution at Collection and Consolidation level:

• Events are available for NOC engineers from single console – no need to gather data from fragmented management systems.
• Reduced number of events to analyze due to configuration of basic correlations: deduplication and problem-resolution.
• Unified naming convention and event representation allows engineers to understand faulty situation from a glance.
• Events are available not only for NOC staff, but for all operations and maintenance engineers.
• Reports automation and historical data for analysis.

S&T Ukraine has many discussions with communication service providers from Central, Eastern Europe and Asia. All of them agree that Fault Management System brings most valuable results on next level - network and services events and faults management automation.

Key outcome communication service providers receive from S&T Ukraine Fault Management Solution at Automation level:

1. Events categorization according to their impact on services:
- Events that impact on services;
- Events that currently do not affect services, but could impact in case situation will be unresolved (e.g. batteries will be drained) or additional faults will occur (e.g. protection circuit will be broken);
- Events that will not affect services.
2. Root-Cause analysis.
3. Fault prioritization (in what time fault has to be restored).
4. Intelligent trouble tickets creation and dispatching (manually and automatically).
5. Run-book automation.
6. Site Access automation.

Examples of engineers’ tasks automation scenarios are provided in Operational Process Automation Scenarios.
To make these outcomes possible S&T Ukraine has developed modules Correlation Engine and Site Access.

At Inform level information is presented in form convenient to use by engineers:

• NOC engineers (Network Operations Centre) - Event List, BTS statuses on GIS, etc.;
• Field engineers – OSS Mobile Client.

As a result of Fault Management Automation:

1. All routine, repetitive network monitoring and fault management tasks are performed automatically.
2. NOC engineers are focused on network anomalies analysis and preventive activities.