Publication Type |
Journal Article |
School or College |
College of Engineering |
Department |
Computing, School of |
Creator |
Freire, Juliana |
Other Author |
Barbosa, Luciano |
Title |
Siphoning hidden-web data through keyword-based interfaces |
Date |
2004-01-01 |
Description |
In this paper, we study the problem of automating the retrieval of data hidden behind simple search interfaces that accept keyword-based queries. Our goal is to automatically retrieve all available results (or, as many as possible). We propose a new approach to siphon hidden data that automatically generates a small set of representative keywords and builds queries which lead to high coverage. We evaluate our algorithms over several real Web sites. Preliminary results indicate our approach is effective: coverage of over 90% is obtained for most of the sites considered. |
Type |
Text |
Publisher |
Elsevier |
Volume |
1 |
Issue |
1 |
First Page |
309 |
Last Page |
321 |
Language |
eng |
Bibliographic Citation |
Barbosa, L., & Freire, J. (2004). Siphoning hidden-web data through keyword-based interfaces, 1(1), 309-21. Proceedings of Brazilian symposium on databases (SBBD). |
Rights Management |
(c) Elsevier ; Authors manuscript from Barbosa, L., & Freire, J. (2004). Siphoning hidden-web data through keyword-based interfaces, 1(1), 309-21. Proceedings of Brazilian symposium on databases (SBBD). http://libra.msra.cn/Publication/1813716/siphoning-hidden-web-data-through-keyword-based-interfaces |
Format Medium |
application/pdf |
Format Extent |
901,907 bytes |
Identifier |
uspace,12354 |
ARK |
ark:/87278/s6v703fr |
Setname |
ir_uspace |
ID |
709267 |
Reference URL |
https://collections.lib.utah.edu/ark:/87278/s6v703fr |