Designing a value based niche search engine using evolutionary strategies

Sourav Sengupta, Bernard J. Jansen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The advent of e-commerce and corporate intranets has led to the growth of organizational repositories containing large, fragmented, and unstructured document collections. Though it is difficult to retrieve relevant documents from such collections, it is relatively less cumbersome to define categories broadly classifying the information contained in the collection. Such categories lend value to the information contained in the collection. This research addresses the issue of improving retrieval accuracy of search engines that retrieve documents from organizational repositories using a value based approach. We test an evolutionary algorithm approach on a document collection. The precision of the search algorithm improved from 40% in generation 1 of the algorithm to nearly 90% in generation 10,000.

Original languageEnglish (US)
Title of host publicationProceedings ITCC 2005 - International Conference on Information Technology
Subtitle of host publicationCoding and Computing
EditorsH. Selvaraj, P.K. Srimani
Pages800-805
Number of pages6
Volume1
StatePublished - Sep 21 2005
EventITCC 2005 - International Conference on Information Technology: Coding and Computing - Las Vegas, NV, United States
Duration: Apr 4 2005Apr 6 2005

Other

OtherITCC 2005 - International Conference on Information Technology: Coding and Computing
CountryUnited States
CityLas Vegas, NV
Period4/4/054/6/05

Fingerprint

Search engines
Intranets
Evolutionary algorithms

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Cite this

Sengupta, S., & Jansen, B. J. (2005). Designing a value based niche search engine using evolutionary strategies. In H. Selvaraj, & P. K. Srimani (Eds.), Proceedings ITCC 2005 - International Conference on Information Technology: Coding and Computing (Vol. 1, pp. 800-805)
Sengupta, Sourav ; Jansen, Bernard J. / Designing a value based niche search engine using evolutionary strategies. Proceedings ITCC 2005 - International Conference on Information Technology: Coding and Computing. editor / H. Selvaraj ; P.K. Srimani. Vol. 1 2005. pp. 800-805
@inproceedings{a5bbf204e0524e1290ff08dd116eaf45,
title = "Designing a value based niche search engine using evolutionary strategies",
abstract = "The advent of e-commerce and corporate intranets has led to the growth of organizational repositories containing large, fragmented, and unstructured document collections. Though it is difficult to retrieve relevant documents from such collections, it is relatively less cumbersome to define categories broadly classifying the information contained in the collection. Such categories lend value to the information contained in the collection. This research addresses the issue of improving retrieval accuracy of search engines that retrieve documents from organizational repositories using a value based approach. We test an evolutionary algorithm approach on a document collection. The precision of the search algorithm improved from 40{\%} in generation 1 of the algorithm to nearly 90{\%} in generation 10,000.",
author = "Sourav Sengupta and Jansen, {Bernard J.}",
year = "2005",
month = "9",
day = "21",
language = "English (US)",
isbn = "0769523153",
volume = "1",
pages = "800--805",
editor = "H. Selvaraj and P.K. Srimani",
booktitle = "Proceedings ITCC 2005 - International Conference on Information Technology",

}

Sengupta, S & Jansen, BJ 2005, Designing a value based niche search engine using evolutionary strategies. in H Selvaraj & PK Srimani (eds), Proceedings ITCC 2005 - International Conference on Information Technology: Coding and Computing. vol. 1, pp. 800-805, ITCC 2005 - International Conference on Information Technology: Coding and Computing, Las Vegas, NV, United States, 4/4/05.

Designing a value based niche search engine using evolutionary strategies. / Sengupta, Sourav; Jansen, Bernard J.

Proceedings ITCC 2005 - International Conference on Information Technology: Coding and Computing. ed. / H. Selvaraj; P.K. Srimani. Vol. 1 2005. p. 800-805.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Designing a value based niche search engine using evolutionary strategies

AU - Sengupta, Sourav

AU - Jansen, Bernard J.

PY - 2005/9/21

Y1 - 2005/9/21

N2 - The advent of e-commerce and corporate intranets has led to the growth of organizational repositories containing large, fragmented, and unstructured document collections. Though it is difficult to retrieve relevant documents from such collections, it is relatively less cumbersome to define categories broadly classifying the information contained in the collection. Such categories lend value to the information contained in the collection. This research addresses the issue of improving retrieval accuracy of search engines that retrieve documents from organizational repositories using a value based approach. We test an evolutionary algorithm approach on a document collection. The precision of the search algorithm improved from 40% in generation 1 of the algorithm to nearly 90% in generation 10,000.

AB - The advent of e-commerce and corporate intranets has led to the growth of organizational repositories containing large, fragmented, and unstructured document collections. Though it is difficult to retrieve relevant documents from such collections, it is relatively less cumbersome to define categories broadly classifying the information contained in the collection. Such categories lend value to the information contained in the collection. This research addresses the issue of improving retrieval accuracy of search engines that retrieve documents from organizational repositories using a value based approach. We test an evolutionary algorithm approach on a document collection. The precision of the search algorithm improved from 40% in generation 1 of the algorithm to nearly 90% in generation 10,000.

UR - http://www.scopus.com/inward/record.url?scp=24744467371&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=24744467371&partnerID=8YFLogxK

M3 - Conference contribution

SN - 0769523153

SN - 9780769523156

VL - 1

SP - 800

EP - 805

BT - Proceedings ITCC 2005 - International Conference on Information Technology

A2 - Selvaraj, H.

A2 - Srimani, P.K.

ER -

Sengupta S, Jansen BJ. Designing a value based niche search engine using evolutionary strategies. In Selvaraj H, Srimani PK, editors, Proceedings ITCC 2005 - International Conference on Information Technology: Coding and Computing. Vol. 1. 2005. p. 800-805