IR-tree: An efficient index for geographic document search

Zhisheng Li, Ken C.K. Lee, Baihua Zheng, Wang-chien Lee, Dik Lee, Xufa Wang

Research output: Contribution to journalArticle

168 Citations (Scopus)

Abstract

Given a geographic query that is composed of query keywords and a location, a geographic search engine retrieves documents that are the most textually and spatially relevant to the query keywords and the location, respectively, and ranks the retrieved documents according to their joint textual and spatial relevances to the query. The lack of an efficient index that can simultaneously handle both the textual and spatial aspects of the documents makes existing geographic search engines inefficient in answering geographic queries. In this paper, we propose an efficient index, called IR-tree, that together with a top-k document search algorithm facilitates four major tasks in document searches, namely, 1) spatial filtering, 2) textual filtering, 3) relevance computation, and 4) document ranking in a fully integrated manner. In addition, IR-tree allows searches to adopt different weights on textual and spatial relevance of documents at the runtime and thus caters for a wide variety of applications. A set of comprehensive experiments over a wide range of scenarios has been conducted and the experiment results demonstrate that IR-tree outperforms the state-of-the-art approaches for geographic document searches.

Original languageEnglish (US)
Article number5560653
Pages (from-to)585-599
Number of pages15
JournalIEEE Transactions on Knowledge and Data Engineering
Volume23
Issue number4
DOIs
StatePublished - Feb 28 2011

Fingerprint

Search engines
Experiments

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Computer Science Applications
  • Computational Theory and Mathematics

Cite this

Li, Zhisheng ; Lee, Ken C.K. ; Zheng, Baihua ; Lee, Wang-chien ; Lee, Dik ; Wang, Xufa. / IR-tree : An efficient index for geographic document search. In: IEEE Transactions on Knowledge and Data Engineering. 2011 ; Vol. 23, No. 4. pp. 585-599.
@article{b6d59af56f094218b1f6962e1630ffee,
title = "IR-tree: An efficient index for geographic document search",
abstract = "Given a geographic query that is composed of query keywords and a location, a geographic search engine retrieves documents that are the most textually and spatially relevant to the query keywords and the location, respectively, and ranks the retrieved documents according to their joint textual and spatial relevances to the query. The lack of an efficient index that can simultaneously handle both the textual and spatial aspects of the documents makes existing geographic search engines inefficient in answering geographic queries. In this paper, we propose an efficient index, called IR-tree, that together with a top-k document search algorithm facilitates four major tasks in document searches, namely, 1) spatial filtering, 2) textual filtering, 3) relevance computation, and 4) document ranking in a fully integrated manner. In addition, IR-tree allows searches to adopt different weights on textual and spatial relevance of documents at the runtime and thus caters for a wide variety of applications. A set of comprehensive experiments over a wide range of scenarios has been conducted and the experiment results demonstrate that IR-tree outperforms the state-of-the-art approaches for geographic document searches.",
author = "Zhisheng Li and Lee, {Ken C.K.} and Baihua Zheng and Wang-chien Lee and Dik Lee and Xufa Wang",
year = "2011",
month = "2",
day = "28",
doi = "10.1109/TKDE.2010.149",
language = "English (US)",
volume = "23",
pages = "585--599",
journal = "IEEE Transactions on Knowledge and Data Engineering",
issn = "1041-4347",
publisher = "IEEE Computer Society",
number = "4",

}

IR-tree : An efficient index for geographic document search. / Li, Zhisheng; Lee, Ken C.K.; Zheng, Baihua; Lee, Wang-chien; Lee, Dik; Wang, Xufa.

In: IEEE Transactions on Knowledge and Data Engineering, Vol. 23, No. 4, 5560653, 28.02.2011, p. 585-599.

Research output: Contribution to journalArticle

TY - JOUR

T1 - IR-tree

T2 - An efficient index for geographic document search

AU - Li, Zhisheng

AU - Lee, Ken C.K.

AU - Zheng, Baihua

AU - Lee, Wang-chien

AU - Lee, Dik

AU - Wang, Xufa

PY - 2011/2/28

Y1 - 2011/2/28

N2 - Given a geographic query that is composed of query keywords and a location, a geographic search engine retrieves documents that are the most textually and spatially relevant to the query keywords and the location, respectively, and ranks the retrieved documents according to their joint textual and spatial relevances to the query. The lack of an efficient index that can simultaneously handle both the textual and spatial aspects of the documents makes existing geographic search engines inefficient in answering geographic queries. In this paper, we propose an efficient index, called IR-tree, that together with a top-k document search algorithm facilitates four major tasks in document searches, namely, 1) spatial filtering, 2) textual filtering, 3) relevance computation, and 4) document ranking in a fully integrated manner. In addition, IR-tree allows searches to adopt different weights on textual and spatial relevance of documents at the runtime and thus caters for a wide variety of applications. A set of comprehensive experiments over a wide range of scenarios has been conducted and the experiment results demonstrate that IR-tree outperforms the state-of-the-art approaches for geographic document searches.

AB - Given a geographic query that is composed of query keywords and a location, a geographic search engine retrieves documents that are the most textually and spatially relevant to the query keywords and the location, respectively, and ranks the retrieved documents according to their joint textual and spatial relevances to the query. The lack of an efficient index that can simultaneously handle both the textual and spatial aspects of the documents makes existing geographic search engines inefficient in answering geographic queries. In this paper, we propose an efficient index, called IR-tree, that together with a top-k document search algorithm facilitates four major tasks in document searches, namely, 1) spatial filtering, 2) textual filtering, 3) relevance computation, and 4) document ranking in a fully integrated manner. In addition, IR-tree allows searches to adopt different weights on textual and spatial relevance of documents at the runtime and thus caters for a wide variety of applications. A set of comprehensive experiments over a wide range of scenarios has been conducted and the experiment results demonstrate that IR-tree outperforms the state-of-the-art approaches for geographic document searches.

UR - http://www.scopus.com/inward/record.url?scp=79951934075&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79951934075&partnerID=8YFLogxK

U2 - 10.1109/TKDE.2010.149

DO - 10.1109/TKDE.2010.149

M3 - Article

AN - SCOPUS:79951934075

VL - 23

SP - 585

EP - 599

JO - IEEE Transactions on Knowledge and Data Engineering

JF - IEEE Transactions on Knowledge and Data Engineering

SN - 1041-4347

IS - 4

M1 - 5560653

ER -