File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1016/j.dss.2007.07.006
- Scopus: eid_2-s2.0-44849099980
- WOS: WOS:000257566900021
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: SpidersRUs: Creating specialized search engines in multiple languages
Title | SpidersRUs: Creating specialized search engines in multiple languages |
---|---|
Authors | |
Keywords | Information retrieval Multilingual search engines Search engine development |
Issue Date | 2008 |
Publisher | Elsevier BV. The Journal's web site is located at http://www.elsevier.com/locate/dss |
Citation | Decision Support Systems, 2008, v. 45 n. 3, p. 621-640 How to Cite? |
Abstract | While small-scale search engines in specific domains and languages are increasingly used by Web users, most existing search engine development tools do not support the development of search engines in languages other than English, cannot be integrated with other applications, or rely on proprietary software. A tool that supports search engine creation in multiple languages is thus highly desired. To study the research issues involved, we review related literature and suggest the criteria for an ideal search tool. We present the design of a toolkit, called SpidersRUs, developed for multilingual search engine creation. The design and implementation of the tool, consisting of a Spider module, an Indexer module, an Index Structure, a Search module, and a Graphical User Interface module, are discussed in detail. A sample user session and a case study on using the tool to develop a medical search engine in Chinese are also presented. The technical issues involved and the lessons learned in the project are then discussed. This study demonstrates that the proposed architecture is feasible in developing search engines easily in different languages such as Chinese, Spanish, Japanese, and Arabic. © 2007 Elsevier B.V. All rights reserved. |
Persistent Identifier | http://hdl.handle.net/10722/85965 |
ISSN | 2023 Impact Factor: 6.7 2023 SCImago Journal Rankings: 2.211 |
ISI Accession Number ID | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chau, M | en_HK |
dc.contributor.author | Qin, J | en_HK |
dc.contributor.author | Zhou, Y | en_HK |
dc.contributor.author | Tseng, C | en_HK |
dc.contributor.author | Chen, H | en_HK |
dc.date.accessioned | 2010-09-06T09:11:16Z | - |
dc.date.available | 2010-09-06T09:11:16Z | - |
dc.date.issued | 2008 | en_HK |
dc.identifier.citation | Decision Support Systems, 2008, v. 45 n. 3, p. 621-640 | en_HK |
dc.identifier.issn | 0167-9236 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/85965 | - |
dc.description.abstract | While small-scale search engines in specific domains and languages are increasingly used by Web users, most existing search engine development tools do not support the development of search engines in languages other than English, cannot be integrated with other applications, or rely on proprietary software. A tool that supports search engine creation in multiple languages is thus highly desired. To study the research issues involved, we review related literature and suggest the criteria for an ideal search tool. We present the design of a toolkit, called SpidersRUs, developed for multilingual search engine creation. The design and implementation of the tool, consisting of a Spider module, an Indexer module, an Index Structure, a Search module, and a Graphical User Interface module, are discussed in detail. A sample user session and a case study on using the tool to develop a medical search engine in Chinese are also presented. The technical issues involved and the lessons learned in the project are then discussed. This study demonstrates that the proposed architecture is feasible in developing search engines easily in different languages such as Chinese, Spanish, Japanese, and Arabic. © 2007 Elsevier B.V. All rights reserved. | en_HK |
dc.language | eng | en_HK |
dc.publisher | Elsevier BV. The Journal's web site is located at http://www.elsevier.com/locate/dss | en_HK |
dc.relation.ispartof | Decision Support Systems | en_HK |
dc.rights | Decision Support Systems. Copyright © Elsevier BV. | en_HK |
dc.subject | Information retrieval | en_HK |
dc.subject | Multilingual search engines | en_HK |
dc.subject | Search engine development | en_HK |
dc.title | SpidersRUs: Creating specialized search engines in multiple languages | en_HK |
dc.type | Article | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=0167-9236&volume=45&issue=3&spage=621&epage=640&date=2008&atitle=SpidersRUs:+Creating+Specialized+Search+Engines+in+Multiple+Languages | en_HK |
dc.identifier.email | Chau, M: mchau@hkucc.hku.hk | en_HK |
dc.identifier.authority | Chau, M=rp01051 | en_HK |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1016/j.dss.2007.07.006 | en_HK |
dc.identifier.scopus | eid_2-s2.0-44849099980 | en_HK |
dc.identifier.hkuros | 148564 | en_HK |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-44849099980&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.volume | 45 | en_HK |
dc.identifier.issue | 3 | en_HK |
dc.identifier.spage | 621 | en_HK |
dc.identifier.epage | 640 | en_HK |
dc.identifier.isi | WOS:000257566900021 | - |
dc.publisher.place | Netherlands | en_HK |
dc.identifier.scopusauthorid | Chau, M=7006073763 | en_HK |
dc.identifier.scopusauthorid | Qin, J=7402896547 | en_HK |
dc.identifier.scopusauthorid | Zhou, Y=7405368400 | en_HK |
dc.identifier.scopusauthorid | Tseng, C=8965979100 | en_HK |
dc.identifier.scopusauthorid | Chen, H=8871373800 | en_HK |
dc.identifier.issnl | 0167-9236 | - |