TY - JOUR
T1 - IMatch2
T2 - Compound identification using retention index for analysis of gas chromatography-mass spectrometry data
AU - Koo, Imhoi
AU - Shi, Xue
AU - Kim, Seongho
AU - Zhang, Xiang
N1 - Funding Information:
This work was supported by National Science Foundation (NSF) grant DMS-1312603 , and National Institute of Health (NIH) grants 1RO1GM087735 through the National Institute of General Medical Sciences (NIGMS) and R21ES021311 through the National Institute of Environmental Health Sciences (NIEHS) .
PY - 2014/4/11
Y1 - 2014/4/11
N2 - We developed a method, iMatch2, for compound identification using retention indices (RI) in NIST11 library. Three-way ANOVA test and Kruskal-Wallis test respectively demonstrate that column class and temperature program type defined by the NIST library are the most dominant factors affecting the magnitude of retention index while the retention index data type does not cause significant difference. The developed linear regression transformation for merging retention indices with different data types, but the same column class and temperature program type, reduces the standard deviation of retention index up to 8%, compared to the simple union approach used in the original iMatch. As for outlier detection methods to remove retention indices having large difference with the remaining data of the same compound, Tietjen-Moore test and generalized extreme studentized deviate test are the strictest methods, while methods such as Dixon's test, Thompson tau approach, and Grubbs' test are more conservative. To improve the accuracy of retention index window, a concept of compound specific retention index window is introduced for compounds with a large number of retention indices in the NIST11 library, while the retention index window is calculated from empirical distributions for the compounds with a small number of retention indices. Analysis of the experimental data of a mixture of compound standards and the metabolite extract from mouse liver show significant improvement of retention index quality in the NIST11 library and the new data analysis methods.
AB - We developed a method, iMatch2, for compound identification using retention indices (RI) in NIST11 library. Three-way ANOVA test and Kruskal-Wallis test respectively demonstrate that column class and temperature program type defined by the NIST library are the most dominant factors affecting the magnitude of retention index while the retention index data type does not cause significant difference. The developed linear regression transformation for merging retention indices with different data types, but the same column class and temperature program type, reduces the standard deviation of retention index up to 8%, compared to the simple union approach used in the original iMatch. As for outlier detection methods to remove retention indices having large difference with the remaining data of the same compound, Tietjen-Moore test and generalized extreme studentized deviate test are the strictest methods, while methods such as Dixon's test, Thompson tau approach, and Grubbs' test are more conservative. To improve the accuracy of retention index window, a concept of compound specific retention index window is introduced for compounds with a large number of retention indices in the NIST11 library, while the retention index window is calculated from empirical distributions for the compounds with a small number of retention indices. Analysis of the experimental data of a mixture of compound standards and the metabolite extract from mouse liver show significant improvement of retention index quality in the NIST11 library and the new data analysis methods.
UR - http://www.scopus.com/inward/record.url?scp=84900628789&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84900628789&partnerID=8YFLogxK
U2 - 10.1016/j.chroma.2014.02.049
DO - 10.1016/j.chroma.2014.02.049
M3 - Article
C2 - 24630063
AN - SCOPUS:84900628789
SN - 0021-9673
VL - 1337
SP - 202
EP - 210
JO - Journal of Chromatography
JF - Journal of Chromatography
ER -