Compound identification using partial and semipartial correlations for gas chromatography-mass spectrometry data

Seongho Kim, Imhoi Koo, Jaesik Jeong, Shiwen Wu, Xue Shi, Xiang Zhang

Research output: Contribution to journalArticle

33 Citations (Scopus)

Abstract

Compound identification is a key component of data analysis in the applications of gas chromatography-mass spectrometry (GC-MS). Currently, the most widely used compound identification is mass spectrum matching, in which the dot product and its composite version are employed as spectral similarity measures. Several forms of transformations for fragment ion intensities have also been proposed to increase the accuracy of compound identification. In this study, we introduced partial and semipartial correlations as mass spectral similarity measures and applied them to identify compounds along with different transformations of peak intensity. The mixture versions of the proposed method were also developed to further improve the accuracy of compound identification. To demonstrate the performance of the proposed spectral similarity measures, the National Institute of Standards and Technology (NIST) mass spectral library and replicate spectral library were used as the reference library and the query spectra, respectively. Identification results showed that the mixture partial and semipartial correlations always outperform both the dot product and its composite measure. The mixture similarity with semipartial correlation has the highest accuracy of 84.6% in compound identification with a transformation of (0.53,1.3) for fragment ion intensity and m/z value, respectively.

Original languageEnglish (US)
Pages (from-to)6477-6487
Number of pages11
JournalAnalytical chemistry
Volume84
Issue number15
DOIs
StatePublished - Aug 7 2012

Fingerprint

Gas chromatography
Mass spectrometry
Ions
Composite materials

All Science Journal Classification (ASJC) codes

  • Analytical Chemistry

Cite this

Kim, Seongho ; Koo, Imhoi ; Jeong, Jaesik ; Wu, Shiwen ; Shi, Xue ; Zhang, Xiang. / Compound identification using partial and semipartial correlations for gas chromatography-mass spectrometry data. In: Analytical chemistry. 2012 ; Vol. 84, No. 15. pp. 6477-6487.
@article{7daa58ddddd44a4ca6f8f6c9c4962fa0,
title = "Compound identification using partial and semipartial correlations for gas chromatography-mass spectrometry data",
abstract = "Compound identification is a key component of data analysis in the applications of gas chromatography-mass spectrometry (GC-MS). Currently, the most widely used compound identification is mass spectrum matching, in which the dot product and its composite version are employed as spectral similarity measures. Several forms of transformations for fragment ion intensities have also been proposed to increase the accuracy of compound identification. In this study, we introduced partial and semipartial correlations as mass spectral similarity measures and applied them to identify compounds along with different transformations of peak intensity. The mixture versions of the proposed method were also developed to further improve the accuracy of compound identification. To demonstrate the performance of the proposed spectral similarity measures, the National Institute of Standards and Technology (NIST) mass spectral library and replicate spectral library were used as the reference library and the query spectra, respectively. Identification results showed that the mixture partial and semipartial correlations always outperform both the dot product and its composite measure. The mixture similarity with semipartial correlation has the highest accuracy of 84.6{\%} in compound identification with a transformation of (0.53,1.3) for fragment ion intensity and m/z value, respectively.",
author = "Seongho Kim and Imhoi Koo and Jaesik Jeong and Shiwen Wu and Xue Shi and Xiang Zhang",
year = "2012",
month = "8",
day = "7",
doi = "10.1021/ac301350n",
language = "English (US)",
volume = "84",
pages = "6477--6487",
journal = "Analytical Chemistry",
issn = "0003-2700",
publisher = "American Chemical Society",
number = "15",

}

Compound identification using partial and semipartial correlations for gas chromatography-mass spectrometry data. / Kim, Seongho; Koo, Imhoi; Jeong, Jaesik; Wu, Shiwen; Shi, Xue; Zhang, Xiang.

In: Analytical chemistry, Vol. 84, No. 15, 07.08.2012, p. 6477-6487.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Compound identification using partial and semipartial correlations for gas chromatography-mass spectrometry data

AU - Kim, Seongho

AU - Koo, Imhoi

AU - Jeong, Jaesik

AU - Wu, Shiwen

AU - Shi, Xue

AU - Zhang, Xiang

PY - 2012/8/7

Y1 - 2012/8/7

N2 - Compound identification is a key component of data analysis in the applications of gas chromatography-mass spectrometry (GC-MS). Currently, the most widely used compound identification is mass spectrum matching, in which the dot product and its composite version are employed as spectral similarity measures. Several forms of transformations for fragment ion intensities have also been proposed to increase the accuracy of compound identification. In this study, we introduced partial and semipartial correlations as mass spectral similarity measures and applied them to identify compounds along with different transformations of peak intensity. The mixture versions of the proposed method were also developed to further improve the accuracy of compound identification. To demonstrate the performance of the proposed spectral similarity measures, the National Institute of Standards and Technology (NIST) mass spectral library and replicate spectral library were used as the reference library and the query spectra, respectively. Identification results showed that the mixture partial and semipartial correlations always outperform both the dot product and its composite measure. The mixture similarity with semipartial correlation has the highest accuracy of 84.6% in compound identification with a transformation of (0.53,1.3) for fragment ion intensity and m/z value, respectively.

AB - Compound identification is a key component of data analysis in the applications of gas chromatography-mass spectrometry (GC-MS). Currently, the most widely used compound identification is mass spectrum matching, in which the dot product and its composite version are employed as spectral similarity measures. Several forms of transformations for fragment ion intensities have also been proposed to increase the accuracy of compound identification. In this study, we introduced partial and semipartial correlations as mass spectral similarity measures and applied them to identify compounds along with different transformations of peak intensity. The mixture versions of the proposed method were also developed to further improve the accuracy of compound identification. To demonstrate the performance of the proposed spectral similarity measures, the National Institute of Standards and Technology (NIST) mass spectral library and replicate spectral library were used as the reference library and the query spectra, respectively. Identification results showed that the mixture partial and semipartial correlations always outperform both the dot product and its composite measure. The mixture similarity with semipartial correlation has the highest accuracy of 84.6% in compound identification with a transformation of (0.53,1.3) for fragment ion intensity and m/z value, respectively.

UR - http://www.scopus.com/inward/record.url?scp=84864590019&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84864590019&partnerID=8YFLogxK

U2 - 10.1021/ac301350n

DO - 10.1021/ac301350n

M3 - Article

C2 - 22794294

AN - SCOPUS:84864590019

VL - 84

SP - 6477

EP - 6487

JO - Analytical Chemistry

JF - Analytical Chemistry

SN - 0003-2700

IS - 15

ER -