TY - GEN
T1 - Actively learning to distinguish suspicious from innocuous anomalies in a batch of vehicle tracks
AU - Qiu, Zhicong
AU - Miller, David Jonathan
AU - Stieber, Brian
AU - Fair, Tim
PY - 2014/1/1
Y1 - 2014/1/1
N2 - We investigate the problem of actively learning to distinguish between two sets of anomalous vehicle tracks, innocuous and suspicious, starting from scratch, without any initial examples of suspicious and with no prior knowledge of what an operator would deem suspicious. This two-class problem is challenging because it is a priori unknown which track features may characterize the suspicious class. Furthermore, there is inherent imbalance in the sizes of the labeled innocuous and suspicious sets, even after some suspicious examples are identified. We present a comprehensive solution wherein a classifier learns to discriminate suspicious from innocuous based on derived p-value track features. Through active learning, our classifier thus learns the types of anomalies on which to base its discrimination. Our solution encompasses: i) judicious choice of kinematic p-value based features conditioned on the road of origin, along with more explicit features that capture unique vehicle behavior (e.g. U-turns); ii) novel semi-supervised learning that exploits information in the unlabeled (test batch) tracks, and iii) evaluation of several classifier models (logistic regression, SVMs). We find that two active labeling streams are necessary in practice in order to have efficient classifier learning while also forwarding (for labeling) the most actionable tracks. Experiments on wide-area motion imagery (WAMI) tracks, extracted via a system developed by Toyon Research Corporation, demonstrate the strong ROC AUC performance of our system, with sparing use of operator-based active labeling.
AB - We investigate the problem of actively learning to distinguish between two sets of anomalous vehicle tracks, innocuous and suspicious, starting from scratch, without any initial examples of suspicious and with no prior knowledge of what an operator would deem suspicious. This two-class problem is challenging because it is a priori unknown which track features may characterize the suspicious class. Furthermore, there is inherent imbalance in the sizes of the labeled innocuous and suspicious sets, even after some suspicious examples are identified. We present a comprehensive solution wherein a classifier learns to discriminate suspicious from innocuous based on derived p-value track features. Through active learning, our classifier thus learns the types of anomalies on which to base its discrimination. Our solution encompasses: i) judicious choice of kinematic p-value based features conditioned on the road of origin, along with more explicit features that capture unique vehicle behavior (e.g. U-turns); ii) novel semi-supervised learning that exploits information in the unlabeled (test batch) tracks, and iii) evaluation of several classifier models (logistic regression, SVMs). We find that two active labeling streams are necessary in practice in order to have efficient classifier learning while also forwarding (for labeling) the most actionable tracks. Experiments on wide-area motion imagery (WAMI) tracks, extracted via a system developed by Toyon Research Corporation, demonstrate the strong ROC AUC performance of our system, with sparing use of operator-based active labeling.
UR - http://www.scopus.com/inward/record.url?scp=84905739424&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84905739424&partnerID=8YFLogxK
U2 - 10.1117/12.2052778
DO - 10.1117/12.2052778
M3 - Conference contribution
AN - SCOPUS:84905739424
SN - 9781628410167
T3 - Proceedings of SPIE - The International Society for Optical Engineering
BT - Ground/Air Multisensor Interoperability, Integration, and Networking for Persistent ISR V
PB - SPIE
T2 - Ground/Air Multisensor Interoperability, Integration, and Networking for Persistent ISR V
Y2 - 5 May 2014 through 6 May 2014
ER -