Monitoring Reachability to OT
Systems on Edge Nodes During the FIFA Event

`

Overview

Arpatech faced a significant challenge while monitoring the reachability and status of over 40,000 Operational Technology (OT) systems at each stadium during the FIFA event. Before implementing a solution, the existing data pipeline, consisting of multiple hops, delayed the identification of OT connectivity issues, often requiring several hours to pinpoint the source of failure. Arpatech addressed this issue by developing a an agent capable of direct endpoint probing & reporting using the Azure Application Insights SDK.

Global IT Services

Challenges

1. Complex Data Pipeline

The existing data pipeline, with multiple hops, introduced significant delays in detecting connectivity issues.

2. Distributed Infrastructure

The geographically dispersed nature of the OT systems required centralized & accurate monitoring.

3. Scalability

Handling data traffic peaks during the event was essential for reliability.

4. Real-time Monitoring

Immediate identification of connectivity issues was crucial to minimize downtime.

5. Operational Efficiency

Hours spent in diagnosing connectivity problems hampered the effectiveness of the support team.

Solution Approach

Arpatech developed a monitoring agent with integrated Azure Application Insights SDK to streamline monitoring, eliminate delays, and simplify data aggregation.

Implementation

Development of Agent

  • List Marker The agent was designed to probe endpoints directly, reducing dependency on the intermediate hops in the data pipeline.
  • List Marker Each OT endpoint was monitored for availability, latency, & other connectivity metrics.

Integration with Azure Application Insights

  • List Marker Azure Application Insights provided unified telemetry storage, anomaly detection, & visualization.
  • List Marker Automated alerts enabled rapid issue detection and timely response by the operations team.

Alerting

  • List Marker Alerts from Azure Monitor were configured & integrated with MS Teams, so respective team(s) can be alerted promptly about the event.

Results

Improved Monitoring Accuracy

Direct endpoint probing minimized false positives & expedited the identification of genuine connectivity issues.

Reduced Detection Time

The centralized monitoring through Application Insights allowed rapid identification & isolation of connectivity issues, reducing detection time from hours to minutes.

Operational Efficiency

Automated alerts ensured the operations team could promptly address problems, improving the efficiency of the SRE team.

Enhanced Insights

With comprehensive telemetry data, Arpatech gained deeper insights into OT system performance and reachability trends, aiding in proactive maintenance.

Conclusion

Arpatech's solution of developing an agent with Azure Application Insights SDK successfully eliminated delays in identifying connectivity issues during the 2022 FIFA event. The improved monitoring framework facilitated near-instantaneous detection, streamlined data reporting, and enhanced overall SRE efficiency

Let's Do Something Great Together!

As they say, it takes two to tango! Just tell us your specific needs and we will come up with an innovative solution that will not only meet your objectives but will also help you set apart from your competitors.

Free Consultancy Service