Regularized f-measure maximization for feature selection and classification

Zhenqiu Liu, Ming Tan, Feng Jiang

Research output: Contribution to journalArticlepeer-review

15 Scopus citations

Abstract

Receiver Operating Characteristic (ROC) analysis is a common tool for assessing the performance of various classifications. It gained much popularity in medical and other fields including biological markers and, diagnostic test. This is particularly due to the fact that in real-world problems misclassification costs are not known, and thus, ROC curve and related utility functions such as F-measure can be more meaningful performance measures. F-measure combines recall and precision into a global measure. In this paper, we propose a novel method through regularized F-measure maximization. The proposed method assigns different costs to positive and negative samples and does simultaneous feature selection and prediction with L 1 penalty. This method is useful especially when data set is highly unbalanced, or the labels for negative (positive) samples are missing. Our experiments with the benchmark, methylation, and high dimensional microarray data show that the performance of proposed algorithm is better or equivalent compared with the other popular classifiers in limited experiments.

Original languageEnglish (US)
Article number617946
JournalJournal of Biomedicine and Biotechnology
Volume2009
DOIs
StatePublished - 2009

All Science Journal Classification (ASJC) codes

  • Biotechnology
  • Molecular Medicine
  • Molecular Biology
  • Genetics
  • Health, Toxicology and Mutagenesis

Fingerprint Dive into the research topics of 'Regularized f-measure maximization for feature selection and classification'. Together they form a unique fingerprint.

Cite this