TY - JOUR
T1 - Fusing heterogeneous data
T2 - A case for remote sensing and social media
AU - Wang, Han
AU - Skau, Erik
AU - Krim, Hamid
AU - Cervone, Guido
N1 - Funding Information:
Manuscript received April 23, 2017; revised October 9, 2017, March 5, 2018, and April 23, 2018; accepted May 9, 2018. Date of publication July 17, 2018; date of current version November 22, 2018. This work was supported by the Department of Energy National Nuclear Security Administration’s Office of Defense Nuclear Nonproliferation R&D through the Consortium for Nonproliferation Enabling Capabilities at North Carolina State University, under Grant DE-NA0002576. (Corresponding author: Han Wang.) H. Wang is with the Department of Computer Science, Institute for Cyber Security, The University of Texas, San Antonio, TX 78249 USA (e-mail: uniwanghan@gmail.com).
Publisher Copyright:
© 2018 IEEE.
PY - 2018/12
Y1 - 2018/12
N2 - Data heterogeneity can pose a great challenge to process and systematically fuse low-level data from different modalities with no recourse to heuristics and manual adjustments and refinements. In this paper, a new methodology is introduced for the fusion of measured data for detecting and predicting weather-driven natural hazards. The proposed research introduces a robust theoretical and algorithmic framework for the fusion of heterogeneous data in near real time. We establish a flexible information-based fusion framework with a target optimality criterion of choice, which for illustration, is specialized to a maximum entropy principle and a least effort principle for semisupervised learning with noisy labels. We develop a methodology to account for multimodality data and a solution for addressing inherent sensor limitations. In our case study of interest, namely, that of flood density estimation, we further show that by fusing remote sensing and social media data, we can develop well founded and actionable flood maps. This capability is valuable in situations where environmental hazards, such as hurricanes or severe weather, affect very large areas. Relative to the state of the art working with such data, our proposed information-theoretic solution is principled and systematic, while offering a joint exploitation of any set of heterogeneous sensor modalities with minimally assuming priors. This flexibility is coupled with the ability to quantitatively and clearly state the fusion principles with very reasonable computational costs. The proposed method is tested and substantiated with the multimodality data of a 2013 Boulder Colorado flood event.
AB - Data heterogeneity can pose a great challenge to process and systematically fuse low-level data from different modalities with no recourse to heuristics and manual adjustments and refinements. In this paper, a new methodology is introduced for the fusion of measured data for detecting and predicting weather-driven natural hazards. The proposed research introduces a robust theoretical and algorithmic framework for the fusion of heterogeneous data in near real time. We establish a flexible information-based fusion framework with a target optimality criterion of choice, which for illustration, is specialized to a maximum entropy principle and a least effort principle for semisupervised learning with noisy labels. We develop a methodology to account for multimodality data and a solution for addressing inherent sensor limitations. In our case study of interest, namely, that of flood density estimation, we further show that by fusing remote sensing and social media data, we can develop well founded and actionable flood maps. This capability is valuable in situations where environmental hazards, such as hurricanes or severe weather, affect very large areas. Relative to the state of the art working with such data, our proposed information-theoretic solution is principled and systematic, while offering a joint exploitation of any set of heterogeneous sensor modalities with minimally assuming priors. This flexibility is coupled with the ability to quantitatively and clearly state the fusion principles with very reasonable computational costs. The proposed method is tested and substantiated with the multimodality data of a 2013 Boulder Colorado flood event.
UR - http://www.scopus.com/inward/record.url?scp=85056144998&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85056144998&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2018.2846199
DO - 10.1109/TGRS.2018.2846199
M3 - Article
AN - SCOPUS:85056144998
VL - 56
SP - 6956
EP - 6968
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
SN - 0196-2892
IS - 12
M1 - 8412269
ER -