Learning classifiers from large databases using statistical queries

Neeraj Koul, Cornelia Caragea, Vasant Honavar, Vikas Bahirwani, Doina Caragea

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

1 We describe an approach to learning predictive models from large databases in settings where direct access to data is not available because of massive size of data, access restrictions, or bandwidth requirements. We outline some techniques for minimizing the number of statistical queries needed; and for efficiently coping with missing values in the data. We provide open source implementation of the decision tree and Naive bayes algorithms to demonstrate the feasibility of the proposed approach.

Original languageEnglish (US)
Title of host publicationProceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008
Pages923-926
Number of pages4
DOIs
StatePublished - Dec 1 2008
Event2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008 - Sydney, NSW, Australia
Duration: Dec 9 2008Dec 12 2008

Publication series

NameProceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008

Other

Other2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008
CountryAustralia
CitySydney, NSW
Period12/9/0812/12/08

Fingerprint

Decision trees
Classifiers
Bandwidth

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications
  • Electrical and Electronic Engineering

Cite this

Koul, N., Caragea, C., Honavar, V., Bahirwani, V., & Caragea, D. (2008). Learning classifiers from large databases using statistical queries. In Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008 (pp. 923-926). [4740577] (Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008). https://doi.org/10.1109/WIIAT.2008.366
Koul, Neeraj ; Caragea, Cornelia ; Honavar, Vasant ; Bahirwani, Vikas ; Caragea, Doina. / Learning classifiers from large databases using statistical queries. Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008. 2008. pp. 923-926 (Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008).
@inproceedings{fa31580319994b299074b21227413c40,
title = "Learning classifiers from large databases using statistical queries",
abstract = "1 We describe an approach to learning predictive models from large databases in settings where direct access to data is not available because of massive size of data, access restrictions, or bandwidth requirements. We outline some techniques for minimizing the number of statistical queries needed; and for efficiently coping with missing values in the data. We provide open source implementation of the decision tree and Naive bayes algorithms to demonstrate the feasibility of the proposed approach.",
author = "Neeraj Koul and Cornelia Caragea and Vasant Honavar and Vikas Bahirwani and Doina Caragea",
year = "2008",
month = "12",
day = "1",
doi = "10.1109/WIIAT.2008.366",
language = "English (US)",
isbn = "9780769534961",
series = "Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008",
pages = "923--926",
booktitle = "Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008",

}

Koul, N, Caragea, C, Honavar, V, Bahirwani, V & Caragea, D 2008, Learning classifiers from large databases using statistical queries. in Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008., 4740577, Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008, pp. 923-926, 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008, Sydney, NSW, Australia, 12/9/08. https://doi.org/10.1109/WIIAT.2008.366

Learning classifiers from large databases using statistical queries. / Koul, Neeraj; Caragea, Cornelia; Honavar, Vasant; Bahirwani, Vikas; Caragea, Doina.

Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008. 2008. p. 923-926 4740577 (Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Learning classifiers from large databases using statistical queries

AU - Koul, Neeraj

AU - Caragea, Cornelia

AU - Honavar, Vasant

AU - Bahirwani, Vikas

AU - Caragea, Doina

PY - 2008/12/1

Y1 - 2008/12/1

N2 - 1 We describe an approach to learning predictive models from large databases in settings where direct access to data is not available because of massive size of data, access restrictions, or bandwidth requirements. We outline some techniques for minimizing the number of statistical queries needed; and for efficiently coping with missing values in the data. We provide open source implementation of the decision tree and Naive bayes algorithms to demonstrate the feasibility of the proposed approach.

AB - 1 We describe an approach to learning predictive models from large databases in settings where direct access to data is not available because of massive size of data, access restrictions, or bandwidth requirements. We outline some techniques for minimizing the number of statistical queries needed; and for efficiently coping with missing values in the data. We provide open source implementation of the decision tree and Naive bayes algorithms to demonstrate the feasibility of the proposed approach.

UR - http://www.scopus.com/inward/record.url?scp=62949221087&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=62949221087&partnerID=8YFLogxK

U2 - 10.1109/WIIAT.2008.366

DO - 10.1109/WIIAT.2008.366

M3 - Conference contribution

AN - SCOPUS:62949221087

SN - 9780769534961

T3 - Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008

SP - 923

EP - 926

BT - Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008

ER -

Koul N, Caragea C, Honavar V, Bahirwani V, Caragea D. Learning classifiers from large databases using statistical queries. In Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008. 2008. p. 923-926. 4740577. (Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008). https://doi.org/10.1109/WIIAT.2008.366