Abstract
Many web databases are hidden behind restrictive form-like interfaces which may or may not provide domain information for an attribute. When attribute domains are not available, domain discovery becomes a critical challenge facing the application of a broad range of existing techniques on third-party analytical and mash-up applications over hidden databases. In this paper, we consider the problem of domain discovery over a hidden database through its web interface. We prove that for any database schema, an achievability guarantee on domain discovery can be made based solely upon the interface design. We also develop novel techniques which provide effective guarantees on the comprehensiveness of domain discovery. We present theoretical analysis and extensive experiments to illustrate the effectiveness of our approach.
Original language | English (US) |
---|---|
Title of host publication | Proceedings of SIGMOD 2011 and PODS 2011 |
Pages | 553-564 |
Number of pages | 12 |
DOIs | |
State | Published - Jul 11 2011 |
Event | 2011 ACM SIGMOD and 30th PODS 2011 Conference - Athens, Greece Duration: Jun 12 2011 → Jun 16 2011 |
Other
Other | 2011 ACM SIGMOD and 30th PODS 2011 Conference |
---|---|
Country | Greece |
City | Athens |
Period | 6/12/11 → 6/16/11 |
All Science Journal Classification (ASJC) codes
- Software
- Information Systems