Improving search ranking of geospatial data based on deep learning using user behavior data

Yun Li, Yongyao Jiang, Chaowei Yang, Manzhu Yu, Lara Kamal, Edward M. Armstrong, Thomas Huang, David Moroni, Lewis J. McGibbney

Research output: Contribution to journalArticle

Abstract

Finding geospatial data has been a big challenge regarding the data size and heterogeneity across various domains. Previous work has explored using machine learning to improve geospatial data search ranking, but it usually relies on training data labelled by subject matter experts, which makes it laborious and costly to apply to scenarios in which data relevancy to a query can change over time. When a user interacts with a search engine, plenteous information is recorded in the log file, which is essentially free, sustainable and up-to-the-minute. In this research, we propose a deep learning-based search ranking framework that can expeditiously update the ranking model through capturing real-time user clickstream data. The contributions of the proposed framework consist of 1) a log parser that can ingest and parse Web logs that record users’ behavior in a real-time manner; 2) a set of hypotheses of modelling the relative relevance of data; and 3) a deep learning based ranking model which can be updated dynamically with the increment of user behavior data. Quantitative comparison with a few other machine learning algorithms suggests substantial improvement.

Original languageEnglish (US)
Article number104520
JournalComputers and Geosciences
Volume142
DOIs
StatePublished - Sep 2020

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Computers in Earth Sciences

Fingerprint Dive into the research topics of 'Improving search ranking of geospatial data based on deep learning using user behavior data'. Together they form a unique fingerprint.

  • Cite this

    Li, Y., Jiang, Y., Yang, C., Yu, M., Kamal, L., Armstrong, E. M., Huang, T., Moroni, D., & McGibbney, L. J. (2020). Improving search ranking of geospatial data based on deep learning using user behavior data. Computers and Geosciences, 142, [104520]. https://doi.org/10.1016/j.cageo.2020.104520