Order-sensitive imputation for clustered missing values (Extended Abstract)

Qian Ma, Yu Gu, Wang-chien Lee, Ge Yu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

To study the issue of missing values (MVs), we propose the Order-Sensitive Imputation for Clustered Missing values (OSICM) framework, in which missing values are imputed sequentially such that the values filled earlier in the process are also used for later imputation of other MVs. Obviously, the order of imputations is critical to the effectiveness and efficiency of OSICM framework. We formulate the searching of the optimal imputation order as an optimization problem, and show its NP-hardness. Furthermore, we devise an algorithm to find the exact optimal solution and propose two approximate/heuristic algorithms to trade off effectiveness for efficiency. Finally, we conduct extensive experiments on real and synthetic datasets to demonstrate the superiority of our OSICM framework.

Original languageEnglish (US)
Title of host publicationProceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019
PublisherIEEE Computer Society
Pages2147-2148
Number of pages2
ISBN (Electronic)9781538674741
DOIs
StatePublished - Apr 1 2019
Event35th IEEE International Conference on Data Engineering, ICDE 2019 - Macau, China
Duration: Apr 8 2019Apr 11 2019

Publication series

NameProceedings - International Conference on Data Engineering
Volume2019-April
ISSN (Print)1084-4627

Conference

Conference35th IEEE International Conference on Data Engineering, ICDE 2019
CountryChina
CityMacau
Period4/8/194/11/19

Fingerprint

Heuristic algorithms
Hardness
Experiments

All Science Journal Classification (ASJC) codes

  • Software
  • Signal Processing
  • Information Systems

Cite this

Ma, Q., Gu, Y., Lee, W., & Yu, G. (2019). Order-sensitive imputation for clustered missing values (Extended Abstract). In Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019 (pp. 2147-2148). [8731477] (Proceedings - International Conference on Data Engineering; Vol. 2019-April). IEEE Computer Society. https://doi.org/10.1109/ICDE.2019.00268
Ma, Qian ; Gu, Yu ; Lee, Wang-chien ; Yu, Ge. / Order-sensitive imputation for clustered missing values (Extended Abstract). Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019. IEEE Computer Society, 2019. pp. 2147-2148 (Proceedings - International Conference on Data Engineering).
@inproceedings{3233caa83b254918975ef8f6b19496bf,
title = "Order-sensitive imputation for clustered missing values (Extended Abstract)",
abstract = "To study the issue of missing values (MVs), we propose the Order-Sensitive Imputation for Clustered Missing values (OSICM) framework, in which missing values are imputed sequentially such that the values filled earlier in the process are also used for later imputation of other MVs. Obviously, the order of imputations is critical to the effectiveness and efficiency of OSICM framework. We formulate the searching of the optimal imputation order as an optimization problem, and show its NP-hardness. Furthermore, we devise an algorithm to find the exact optimal solution and propose two approximate/heuristic algorithms to trade off effectiveness for efficiency. Finally, we conduct extensive experiments on real and synthetic datasets to demonstrate the superiority of our OSICM framework.",
author = "Qian Ma and Yu Gu and Wang-chien Lee and Ge Yu",
year = "2019",
month = "4",
day = "1",
doi = "10.1109/ICDE.2019.00268",
language = "English (US)",
series = "Proceedings - International Conference on Data Engineering",
publisher = "IEEE Computer Society",
pages = "2147--2148",
booktitle = "Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019",
address = "United States",

}

Ma, Q, Gu, Y, Lee, W & Yu, G 2019, Order-sensitive imputation for clustered missing values (Extended Abstract). in Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019., 8731477, Proceedings - International Conference on Data Engineering, vol. 2019-April, IEEE Computer Society, pp. 2147-2148, 35th IEEE International Conference on Data Engineering, ICDE 2019, Macau, China, 4/8/19. https://doi.org/10.1109/ICDE.2019.00268

Order-sensitive imputation for clustered missing values (Extended Abstract). / Ma, Qian; Gu, Yu; Lee, Wang-chien; Yu, Ge.

Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019. IEEE Computer Society, 2019. p. 2147-2148 8731477 (Proceedings - International Conference on Data Engineering; Vol. 2019-April).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Order-sensitive imputation for clustered missing values (Extended Abstract)

AU - Ma, Qian

AU - Gu, Yu

AU - Lee, Wang-chien

AU - Yu, Ge

PY - 2019/4/1

Y1 - 2019/4/1

N2 - To study the issue of missing values (MVs), we propose the Order-Sensitive Imputation for Clustered Missing values (OSICM) framework, in which missing values are imputed sequentially such that the values filled earlier in the process are also used for later imputation of other MVs. Obviously, the order of imputations is critical to the effectiveness and efficiency of OSICM framework. We formulate the searching of the optimal imputation order as an optimization problem, and show its NP-hardness. Furthermore, we devise an algorithm to find the exact optimal solution and propose two approximate/heuristic algorithms to trade off effectiveness for efficiency. Finally, we conduct extensive experiments on real and synthetic datasets to demonstrate the superiority of our OSICM framework.

AB - To study the issue of missing values (MVs), we propose the Order-Sensitive Imputation for Clustered Missing values (OSICM) framework, in which missing values are imputed sequentially such that the values filled earlier in the process are also used for later imputation of other MVs. Obviously, the order of imputations is critical to the effectiveness and efficiency of OSICM framework. We formulate the searching of the optimal imputation order as an optimization problem, and show its NP-hardness. Furthermore, we devise an algorithm to find the exact optimal solution and propose two approximate/heuristic algorithms to trade off effectiveness for efficiency. Finally, we conduct extensive experiments on real and synthetic datasets to demonstrate the superiority of our OSICM framework.

UR - http://www.scopus.com/inward/record.url?scp=85067963755&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85067963755&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2019.00268

DO - 10.1109/ICDE.2019.00268

M3 - Conference contribution

AN - SCOPUS:85067963755

T3 - Proceedings - International Conference on Data Engineering

SP - 2147

EP - 2148

BT - Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019

PB - IEEE Computer Society

ER -

Ma Q, Gu Y, Lee W, Yu G. Order-sensitive imputation for clustered missing values (Extended Abstract). In Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019. IEEE Computer Society. 2019. p. 2147-2148. 8731477. (Proceedings - International Conference on Data Engineering). https://doi.org/10.1109/ICDE.2019.00268