File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/JCDL.2004.240416
- Scopus: eid_2-s2.0-4944246916
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Building domain-specific Web collections for scientific digital libraries: A meta-search enhanced focused crawling method
Title | Building domain-specific Web collections for scientific digital libraries: A meta-search enhanced focused crawling method |
---|---|
Authors | |
Keywords | Digital libraries Domain-specific collection building Focused crawling Meta-search Web search algorithm |
Issue Date | 2004 |
Publisher | IEEE. |
Citation | ACM / IEEE Joint Conference on Digital Libraries Proceedings, Tuscon, Arizona, USA, 7-11 June 2004, p. 135-141 How to Cite? |
Abstract | Collecting domain-specific documents from the Web using focused crawlers has been considered one of the most important strategies to build digital libraries that serve the scientific community. However, because most focused crawlers use local search algorithms to traverse the Web space, they could be easily trapped within a limited sub-graph of the Web that surrounds the starting URLs and build domain-specific collections that are not comprehensive and diverse enough to scientists and researchers. In this study, we investigated the problems of traditional focused crawlers caused by local search algorithms and proposed a new crawling approach, meta-search enhanced focused crawling, to address the problems. We conducted two user evaluation experiments to examine the performance of our proposed approach and the results showed that our approach could build domain-specific collections with higher quality than traditional focused crawling techniques. |
Persistent Identifier | http://hdl.handle.net/10722/47075 |
ISSN | 2020 SCImago Journal Rankings: 0.264 |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Qin, J | en_HK |
dc.contributor.author | Zhou, Y | en_HK |
dc.contributor.author | Chau, M | en_HK |
dc.date.accessioned | 2007-10-30T07:06:25Z | - |
dc.date.available | 2007-10-30T07:06:25Z | - |
dc.date.issued | 2004 | en_HK |
dc.identifier.citation | ACM / IEEE Joint Conference on Digital Libraries Proceedings, Tuscon, Arizona, USA, 7-11 June 2004, p. 135-141 | en_HK |
dc.identifier.issn | 1552-5996 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/47075 | - |
dc.description.abstract | Collecting domain-specific documents from the Web using focused crawlers has been considered one of the most important strategies to build digital libraries that serve the scientific community. However, because most focused crawlers use local search algorithms to traverse the Web space, they could be easily trapped within a limited sub-graph of the Web that surrounds the starting URLs and build domain-specific collections that are not comprehensive and diverse enough to scientists and researchers. In this study, we investigated the problems of traditional focused crawlers caused by local search algorithms and proposed a new crawling approach, meta-search enhanced focused crawling, to address the problems. We conducted two user evaluation experiments to examine the performance of our proposed approach and the results showed that our approach could build domain-specific collections with higher quality than traditional focused crawling techniques. | en_HK |
dc.format.extent | 296016 bytes | - |
dc.format.extent | 2605 bytes | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | text/plain | - |
dc.language | eng | en_HK |
dc.publisher | IEEE. | en_HK |
dc.relation.ispartof | Proceedings of the ACM IEEE International Conference on Digital Libraries, JCDL 2004 | en_HK |
dc.rights | ©2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | en_HK |
dc.subject | Digital libraries | en_HK |
dc.subject | Domain-specific collection building | en_HK |
dc.subject | Focused crawling | en_HK |
dc.subject | Meta-search | en_HK |
dc.subject | Web search algorithm | en_HK |
dc.title | Building domain-specific Web collections for scientific digital libraries: A meta-search enhanced focused crawling method | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.email | Chau, M: mchau@hkucc.hku.hk | en_HK |
dc.identifier.authority | Chau, M=rp01051 | en_HK |
dc.description.nature | published_or_final_version | en_HK |
dc.identifier.doi | 10.1109/JCDL.2004.240416 | en_HK |
dc.identifier.scopus | eid_2-s2.0-4944246916 | en_HK |
dc.identifier.hkuros | 91965 | - |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-4944246916&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.spage | 135 | en_HK |
dc.identifier.epage | 141 | en_HK |
dc.identifier.scopusauthorid | Qin, J=7402896547 | en_HK |
dc.identifier.scopusauthorid | Zhou, Y=7405368400 | en_HK |
dc.identifier.scopusauthorid | Chau, M=7006073763 | en_HK |
dc.identifier.issnl | 1552-5996 | - |