File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Building domain-specific Web collections for scientific digital libraries: A meta-search enhanced focused crawling method

TitleBuilding domain-specific Web collections for scientific digital libraries: A meta-search enhanced focused crawling method
Authors
KeywordsDigital libraries
Domain-specific collection building
Focused crawling
Meta-search
Web search algorithm
Issue Date2004
PublisherIEEE.
Citation
ACM / IEEE Joint Conference on Digital Libraries Proceedings, Tuscon, Arizona, USA, 7-11 June 2004, p. 135-141 How to Cite?
AbstractCollecting domain-specific documents from the Web using focused crawlers has been considered one of the most important strategies to build digital libraries that serve the scientific community. However, because most focused crawlers use local search algorithms to traverse the Web space, they could be easily trapped within a limited sub-graph of the Web that surrounds the starting URLs and build domain-specific collections that are not comprehensive and diverse enough to scientists and researchers. In this study, we investigated the problems of traditional focused crawlers caused by local search algorithms and proposed a new crawling approach, meta-search enhanced focused crawling, to address the problems. We conducted two user evaluation experiments to examine the performance of our proposed approach and the results showed that our approach could build domain-specific collections with higher quality than traditional focused crawling techniques.
Persistent Identifierhttp://hdl.handle.net/10722/47075
ISSN
2020 SCImago Journal Rankings: 0.264
References

 

DC FieldValueLanguage
dc.contributor.authorQin, Jen_HK
dc.contributor.authorZhou, Yen_HK
dc.contributor.authorChau, Men_HK
dc.date.accessioned2007-10-30T07:06:25Z-
dc.date.available2007-10-30T07:06:25Z-
dc.date.issued2004en_HK
dc.identifier.citationACM / IEEE Joint Conference on Digital Libraries Proceedings, Tuscon, Arizona, USA, 7-11 June 2004, p. 135-141en_HK
dc.identifier.issn1552-5996en_HK
dc.identifier.urihttp://hdl.handle.net/10722/47075-
dc.description.abstractCollecting domain-specific documents from the Web using focused crawlers has been considered one of the most important strategies to build digital libraries that serve the scientific community. However, because most focused crawlers use local search algorithms to traverse the Web space, they could be easily trapped within a limited sub-graph of the Web that surrounds the starting URLs and build domain-specific collections that are not comprehensive and diverse enough to scientists and researchers. In this study, we investigated the problems of traditional focused crawlers caused by local search algorithms and proposed a new crawling approach, meta-search enhanced focused crawling, to address the problems. We conducted two user evaluation experiments to examine the performance of our proposed approach and the results showed that our approach could build domain-specific collections with higher quality than traditional focused crawling techniques.en_HK
dc.format.extent296016 bytes-
dc.format.extent2605 bytes-
dc.format.mimetypeapplication/pdf-
dc.format.mimetypetext/plain-
dc.languageengen_HK
dc.publisherIEEE.en_HK
dc.relation.ispartofProceedings of the ACM IEEE International Conference on Digital Libraries, JCDL 2004en_HK
dc.rights©2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.en_HK
dc.subjectDigital librariesen_HK
dc.subjectDomain-specific collection buildingen_HK
dc.subjectFocused crawlingen_HK
dc.subjectMeta-searchen_HK
dc.subjectWeb search algorithmen_HK
dc.titleBuilding domain-specific Web collections for scientific digital libraries: A meta-search enhanced focused crawling methoden_HK
dc.typeConference_Paperen_HK
dc.identifier.emailChau, M: mchau@hkucc.hku.hken_HK
dc.identifier.authorityChau, M=rp01051en_HK
dc.description.naturepublished_or_final_versionen_HK
dc.identifier.doi10.1109/JCDL.2004.240416en_HK
dc.identifier.scopuseid_2-s2.0-4944246916en_HK
dc.identifier.hkuros91965-
dc.relation.referenceshttp://www.scopus.com/mlt/select.url?eid=2-s2.0-4944246916&selection=ref&src=s&origin=recordpageen_HK
dc.identifier.spage135en_HK
dc.identifier.epage141en_HK
dc.identifier.scopusauthoridQin, J=7402896547en_HK
dc.identifier.scopusauthoridZhou, Y=7405368400en_HK
dc.identifier.scopusauthoridChau, M=7006073763en_HK
dc.identifier.issnl1552-5996-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats