Modeling, monitoring and scheduling techniques for network recovery from massive failures

Diman Zad Tootaghaj, Thomas La Porta, Ting He

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Large-scale failures in communication networks due to natural disasters or malicious attacks can severely affect critical communications and threaten lives of people in the affected area. In the absence of a proper communication infrastructure, rescue operation becomes extremely difficult. Progressive and timely network recovery is, therefore, a key to minimizing losses and facilitating rescue missions. To this end, we focus on network recovery assuming partial and uncertain knowledge of the failure locations. We proposed a progressive multi-stage recovery approach that uses the incomplete knowledge of failure to find a feasible recovery schedule. Next, we focused on failure recovery of multiple interconnected networks. In particular, we focused on the interaction between a power grid and a communication network. Then, we focused on network monitoring techniques that can be used for diagnosing the performance of individual links for localizing soft failures (e.g. highly congested links) in a communication network. We studied the optimal selection of the monitoring paths to balance identifiability and probing cost. Finally, we addressed, a minimum disruptive routing framework in software defined networks. Extensive experimental and simulation results show that our proposed recovery approaches have a lower disruption cost compared to the state-of-the-art while we can configure our choice of trade-off between the identifiability, execution time, the repair/probing cost, congestion and the demand loss.

Original languageEnglish (US)
Title of host publication2019 IFIP/IEEE Symposium on Integrated Network and Service Management, IM 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages695-700
Number of pages6
ISBN (Electronic)9783903176157
StatePublished - May 16 2019
Event2019 IFIP/IEEE Symposium on Integrated Network and Service Management, IM 2019 - Arlington, United States
Duration: Apr 8 2019Apr 12 2019

Publication series

Name2019 IFIP/IEEE Symposium on Integrated Network and Service Management, IM 2019

Conference

Conference2019 IFIP/IEEE Symposium on Integrated Network and Service Management, IM 2019
CountryUnited States
CityArlington
Period4/8/194/12/19

All Science Journal Classification (ASJC) codes

  • Information Systems and Management
  • Management Science and Operations Research
  • Information Systems
  • Computer Networks and Communications

Fingerprint Dive into the research topics of 'Modeling, monitoring and scheduling techniques for network recovery from massive failures'. Together they form a unique fingerprint.

Cite this