Error Measures for Trajectory Estimations with Geo-Tagged Mobility Sample Data

Mohsen Parsafard, Guangqing Chi, Xiaobo Qu, Xiaopeng Li, Haizhong Wang

Research output: Contribution to journalArticle

Abstract

Although geo-tagged mobility data (e.g., cell phone data and social media data) can be potentially used to estimate individual space-time travel trajectories, they often have low sample rates that only tell travelers' whereabouts at the sparse sample times while leaving the remaining activities to be estimated with interpolation. This paper proposes a set of time geography-based measures to quantify the accuracy of the trajectory estimation in a robust manner. A series of measures including activity bandwidth and normalized activity bandwidth are proposed to quantify the possible absolute and relative error ranges between the estimated and the ground truth trajectories that cannot be observed. These measures can be used to evaluate the suitability of the estimated individual trajectories from sparsely sampled geo-tagged mobility data for travel mobility analysis. We suggest cutoff values of these measures to separate useful data with low estimation errors and noisy data with high estimation errors. We conduct theoretical analysis to show that these error measures decrease with sample rates and peoples' activity ranges. We also propose a lookup table-based interpolation method to expedite the computational time. The proposed measures have been applied to 2013 geo-tagged tweet data in New York City, USA, and 2014 cell-phone data in Shenzhen, China. The results illustrate that the proposed measures can provide estimation error ranges for exceptionally large datasets in much shorter times than the benchmark method without using lookup tables. These results also reveal managerial results into the quality of these data for human mobility studies, including their distribution patterns.

Original languageEnglish (US)
Article number8541110
Pages (from-to)2566-2583
Number of pages18
JournalIEEE Transactions on Intelligent Transportation Systems
Volume20
Issue number7
DOIs
StatePublished - Jul 1 2019

Fingerprint

Trajectories
Error analysis
Table lookup
Interpolation
Bandwidth
Travel time

All Science Journal Classification (ASJC) codes

  • Automotive Engineering
  • Mechanical Engineering
  • Computer Science Applications

Cite this

Parsafard, Mohsen ; Chi, Guangqing ; Qu, Xiaobo ; Li, Xiaopeng ; Wang, Haizhong. / Error Measures for Trajectory Estimations with Geo-Tagged Mobility Sample Data. In: IEEE Transactions on Intelligent Transportation Systems. 2019 ; Vol. 20, No. 7. pp. 2566-2583.
@article{05ec47332deb45f8858661db8e11e35c,
title = "Error Measures for Trajectory Estimations with Geo-Tagged Mobility Sample Data",
abstract = "Although geo-tagged mobility data (e.g., cell phone data and social media data) can be potentially used to estimate individual space-time travel trajectories, they often have low sample rates that only tell travelers' whereabouts at the sparse sample times while leaving the remaining activities to be estimated with interpolation. This paper proposes a set of time geography-based measures to quantify the accuracy of the trajectory estimation in a robust manner. A series of measures including activity bandwidth and normalized activity bandwidth are proposed to quantify the possible absolute and relative error ranges between the estimated and the ground truth trajectories that cannot be observed. These measures can be used to evaluate the suitability of the estimated individual trajectories from sparsely sampled geo-tagged mobility data for travel mobility analysis. We suggest cutoff values of these measures to separate useful data with low estimation errors and noisy data with high estimation errors. We conduct theoretical analysis to show that these error measures decrease with sample rates and peoples' activity ranges. We also propose a lookup table-based interpolation method to expedite the computational time. The proposed measures have been applied to 2013 geo-tagged tweet data in New York City, USA, and 2014 cell-phone data in Shenzhen, China. The results illustrate that the proposed measures can provide estimation error ranges for exceptionally large datasets in much shorter times than the benchmark method without using lookup tables. These results also reveal managerial results into the quality of these data for human mobility studies, including their distribution patterns.",
author = "Mohsen Parsafard and Guangqing Chi and Xiaobo Qu and Xiaopeng Li and Haizhong Wang",
year = "2019",
month = "7",
day = "1",
doi = "10.1109/TITS.2018.2868182",
language = "English (US)",
volume = "20",
pages = "2566--2583",
journal = "IEEE Transactions on Intelligent Transportation Systems",
issn = "1524-9050",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "7",

}

Error Measures for Trajectory Estimations with Geo-Tagged Mobility Sample Data. / Parsafard, Mohsen; Chi, Guangqing; Qu, Xiaobo; Li, Xiaopeng; Wang, Haizhong.

In: IEEE Transactions on Intelligent Transportation Systems, Vol. 20, No. 7, 8541110, 01.07.2019, p. 2566-2583.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Error Measures for Trajectory Estimations with Geo-Tagged Mobility Sample Data

AU - Parsafard, Mohsen

AU - Chi, Guangqing

AU - Qu, Xiaobo

AU - Li, Xiaopeng

AU - Wang, Haizhong

PY - 2019/7/1

Y1 - 2019/7/1

N2 - Although geo-tagged mobility data (e.g., cell phone data and social media data) can be potentially used to estimate individual space-time travel trajectories, they often have low sample rates that only tell travelers' whereabouts at the sparse sample times while leaving the remaining activities to be estimated with interpolation. This paper proposes a set of time geography-based measures to quantify the accuracy of the trajectory estimation in a robust manner. A series of measures including activity bandwidth and normalized activity bandwidth are proposed to quantify the possible absolute and relative error ranges between the estimated and the ground truth trajectories that cannot be observed. These measures can be used to evaluate the suitability of the estimated individual trajectories from sparsely sampled geo-tagged mobility data for travel mobility analysis. We suggest cutoff values of these measures to separate useful data with low estimation errors and noisy data with high estimation errors. We conduct theoretical analysis to show that these error measures decrease with sample rates and peoples' activity ranges. We also propose a lookup table-based interpolation method to expedite the computational time. The proposed measures have been applied to 2013 geo-tagged tweet data in New York City, USA, and 2014 cell-phone data in Shenzhen, China. The results illustrate that the proposed measures can provide estimation error ranges for exceptionally large datasets in much shorter times than the benchmark method without using lookup tables. These results also reveal managerial results into the quality of these data for human mobility studies, including their distribution patterns.

AB - Although geo-tagged mobility data (e.g., cell phone data and social media data) can be potentially used to estimate individual space-time travel trajectories, they often have low sample rates that only tell travelers' whereabouts at the sparse sample times while leaving the remaining activities to be estimated with interpolation. This paper proposes a set of time geography-based measures to quantify the accuracy of the trajectory estimation in a robust manner. A series of measures including activity bandwidth and normalized activity bandwidth are proposed to quantify the possible absolute and relative error ranges between the estimated and the ground truth trajectories that cannot be observed. These measures can be used to evaluate the suitability of the estimated individual trajectories from sparsely sampled geo-tagged mobility data for travel mobility analysis. We suggest cutoff values of these measures to separate useful data with low estimation errors and noisy data with high estimation errors. We conduct theoretical analysis to show that these error measures decrease with sample rates and peoples' activity ranges. We also propose a lookup table-based interpolation method to expedite the computational time. The proposed measures have been applied to 2013 geo-tagged tweet data in New York City, USA, and 2014 cell-phone data in Shenzhen, China. The results illustrate that the proposed measures can provide estimation error ranges for exceptionally large datasets in much shorter times than the benchmark method without using lookup tables. These results also reveal managerial results into the quality of these data for human mobility studies, including their distribution patterns.

UR - http://www.scopus.com/inward/record.url?scp=85057168182&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85057168182&partnerID=8YFLogxK

U2 - 10.1109/TITS.2018.2868182

DO - 10.1109/TITS.2018.2868182

M3 - Article

AN - SCOPUS:85057168182

VL - 20

SP - 2566

EP - 2583

JO - IEEE Transactions on Intelligent Transportation Systems

JF - IEEE Transactions on Intelligent Transportation Systems

SN - 1524-9050

IS - 7

M1 - 8541110

ER -