TY - JOUR
T1 - Error Measures for Trajectory Estimations with Geo-Tagged Mobility Sample Data
AU - Parsafard, Mohsen
AU - Chi, Guangqing
AU - Qu, Xiaobo
AU - Li, Xiaopeng
AU - Wang, Haizhong
N1 - Funding Information:
This work was supported in part by the U.S. National Science Foundation under Grant CMMI 1558889, Grant CMMI 1541130, Grant CMMI 1453949, Grant CMMI 1541136, and Grant SES 1823633, and in part by the Eunice Kennedy Shriver National Institute of Child Health and Human Development under Grant P2C HD041025
Funding Information:
Manuscript received May 18, 2017; revised March 27, 2018; accepted August 28, 2018. Date of publication November 20, 2018; date of current version June 26, 2019. This work was supported in part by the U.S. National Science Foundation under Grant CMMI 1558889, Grant CMMI 1541130, Grant CMMI 1453949, Grant CMMI 1541136, and Grant SES 1823633, and in part by the Eunice Kennedy Shriver National Institute of Child Health and Human Development under Grant P2C HD041025. The Associate Editor for this paper was W. Lin. (Corresponding author: Xiaopeng Li.) M. Parsafard is with Coyote Logistics, Chicago, IL 60647 USA (e-mail: mohsen.parsafard@coyote.com).
Publisher Copyright:
© 2000-2011 IEEE.
PY - 2019/7
Y1 - 2019/7
N2 - Although geo-tagged mobility data (e.g., cell phone data and social media data) can be potentially used to estimate individual space-time travel trajectories, they often have low sample rates that only tell travelers' whereabouts at the sparse sample times while leaving the remaining activities to be estimated with interpolation. This paper proposes a set of time geography-based measures to quantify the accuracy of the trajectory estimation in a robust manner. A series of measures including activity bandwidth and normalized activity bandwidth are proposed to quantify the possible absolute and relative error ranges between the estimated and the ground truth trajectories that cannot be observed. These measures can be used to evaluate the suitability of the estimated individual trajectories from sparsely sampled geo-tagged mobility data for travel mobility analysis. We suggest cutoff values of these measures to separate useful data with low estimation errors and noisy data with high estimation errors. We conduct theoretical analysis to show that these error measures decrease with sample rates and peoples' activity ranges. We also propose a lookup table-based interpolation method to expedite the computational time. The proposed measures have been applied to 2013 geo-tagged tweet data in New York City, USA, and 2014 cell-phone data in Shenzhen, China. The results illustrate that the proposed measures can provide estimation error ranges for exceptionally large datasets in much shorter times than the benchmark method without using lookup tables. These results also reveal managerial results into the quality of these data for human mobility studies, including their distribution patterns.
AB - Although geo-tagged mobility data (e.g., cell phone data and social media data) can be potentially used to estimate individual space-time travel trajectories, they often have low sample rates that only tell travelers' whereabouts at the sparse sample times while leaving the remaining activities to be estimated with interpolation. This paper proposes a set of time geography-based measures to quantify the accuracy of the trajectory estimation in a robust manner. A series of measures including activity bandwidth and normalized activity bandwidth are proposed to quantify the possible absolute and relative error ranges between the estimated and the ground truth trajectories that cannot be observed. These measures can be used to evaluate the suitability of the estimated individual trajectories from sparsely sampled geo-tagged mobility data for travel mobility analysis. We suggest cutoff values of these measures to separate useful data with low estimation errors and noisy data with high estimation errors. We conduct theoretical analysis to show that these error measures decrease with sample rates and peoples' activity ranges. We also propose a lookup table-based interpolation method to expedite the computational time. The proposed measures have been applied to 2013 geo-tagged tweet data in New York City, USA, and 2014 cell-phone data in Shenzhen, China. The results illustrate that the proposed measures can provide estimation error ranges for exceptionally large datasets in much shorter times than the benchmark method without using lookup tables. These results also reveal managerial results into the quality of these data for human mobility studies, including their distribution patterns.
UR - http://www.scopus.com/inward/record.url?scp=85057168182&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85057168182&partnerID=8YFLogxK
U2 - 10.1109/TITS.2018.2868182
DO - 10.1109/TITS.2018.2868182
M3 - Article
C2 - 32699534
AN - SCOPUS:85057168182
SN - 1524-9050
VL - 20
SP - 2566
EP - 2583
JO - IEEE Transactions on Intelligent Transportation Systems
JF - IEEE Transactions on Intelligent Transportation Systems
IS - 7
M1 - 8541110
ER -