File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1002/asi.20329
- Scopus: eid_2-s2.0-33645018819
- WOS: WOS:000235870100008
- Find via
Supplementary
-
Bookmarks:
- CiteULike: 1
- Citations:
- Appears in Collections:
Article: Multilingual web retrieval: An experiment in English-Chinese business intelligence
Title | Multilingual web retrieval: An experiment in English-Chinese business intelligence |
---|---|
Authors | |
Issue Date | 2006 |
Publisher | John Wiley & Sons, Inc. The Journal's web site is located at http://www.asis.org/Publications/JASIS/jasis.html |
Citation | Journal Of The American Society For Information Science And Technology, 2006, v. 57 n. 5, p. 671-683 How to Cite? |
Abstract | As increasing numbers of non-English resources have become available on the Web, the interesting and important issue of how Web users can retrieve documents in different languages has arisen. Cross-language information retrieval (CLIR), the study of retrieving information in one language by queries expressed in another language, is a promising approach to the problem. Cross-language information retrieval has attracted much attention in recent years. Most research systems have achieved satisfactory performance on standard Text REtrieval Conference (TREC) collections such as news articles, but CLIR techniques have not been widely studied and evaluated for applications such as Web portals. In this article, the authors present their research in developing and evaluating a multilingual English-Chinese Web portal that incorporates various CLIR techniques for use in the business domain. A dictionary-based approach was adopted and combines phrasal translation, co-occurrence analysis, and pre- and posttranslation query expansion. The portal was evaluated by domain experts, using a set of queries in both English and Chinese. The experimental results showed that co-occurrence-based phrasal translation achieved a 74.6% improvement in precision over simple word-by-word translation. When used together, pre- and posttranslation query expansion improved the performance slightly, achieving a 78.0% improvement over the baseline word-by-word translation approach. In general, applying CLIR techniques in Web applications shows promise. © 2006 Wiley Periodicals, Inc. |
Persistent Identifier | http://hdl.handle.net/10722/86047 |
ISSN | 2015 Impact Factor: 2.452 |
ISI Accession Number ID | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Qin, J | en_HK |
dc.contributor.author | Zhou, Y | en_HK |
dc.contributor.author | Chau, M | en_HK |
dc.contributor.author | Chen, H | en_HK |
dc.date.accessioned | 2010-09-06T09:12:12Z | - |
dc.date.available | 2010-09-06T09:12:12Z | - |
dc.date.issued | 2006 | en_HK |
dc.identifier.citation | Journal Of The American Society For Information Science And Technology, 2006, v. 57 n. 5, p. 671-683 | en_HK |
dc.identifier.issn | 1532-2882 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/86047 | - |
dc.description.abstract | As increasing numbers of non-English resources have become available on the Web, the interesting and important issue of how Web users can retrieve documents in different languages has arisen. Cross-language information retrieval (CLIR), the study of retrieving information in one language by queries expressed in another language, is a promising approach to the problem. Cross-language information retrieval has attracted much attention in recent years. Most research systems have achieved satisfactory performance on standard Text REtrieval Conference (TREC) collections such as news articles, but CLIR techniques have not been widely studied and evaluated for applications such as Web portals. In this article, the authors present their research in developing and evaluating a multilingual English-Chinese Web portal that incorporates various CLIR techniques for use in the business domain. A dictionary-based approach was adopted and combines phrasal translation, co-occurrence analysis, and pre- and posttranslation query expansion. The portal was evaluated by domain experts, using a set of queries in both English and Chinese. The experimental results showed that co-occurrence-based phrasal translation achieved a 74.6% improvement in precision over simple word-by-word translation. When used together, pre- and posttranslation query expansion improved the performance slightly, achieving a 78.0% improvement over the baseline word-by-word translation approach. In general, applying CLIR techniques in Web applications shows promise. © 2006 Wiley Periodicals, Inc. | en_HK |
dc.language | eng | en_HK |
dc.publisher | John Wiley & Sons, Inc. The Journal's web site is located at http://www.asis.org/Publications/JASIS/jasis.html | en_HK |
dc.relation.ispartof | Journal of the American Society for Information Science and Technology | en_HK |
dc.rights | Journal of the American Society for Information Science and Technology. Copyright © John Wiley & Sons, Inc. | en_HK |
dc.title | Multilingual web retrieval: An experiment in English-Chinese business intelligence | en_HK |
dc.type | Article | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=1532-2882&volume=57&issue=5&spage=671&epage=683&date=2006&atitle=Multilingual+Web+Retrieval:+An+Experiment+in+English-Chinese+Business+Intelligence%27 | en_HK |
dc.identifier.email | Chau, M: mchau@hkucc.hku.hk | en_HK |
dc.identifier.authority | Chau, M=rp01051 | en_HK |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1002/asi.20329 | en_HK |
dc.identifier.scopus | eid_2-s2.0-33645018819 | en_HK |
dc.identifier.hkuros | 121072 | en_HK |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-33645018819&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.volume | 57 | en_HK |
dc.identifier.issue | 5 | en_HK |
dc.identifier.spage | 671 | en_HK |
dc.identifier.epage | 683 | en_HK |
dc.identifier.isi | WOS:000235870100008 | - |
dc.publisher.place | United States | en_HK |
dc.identifier.scopusauthorid | Qin, J=7402896547 | en_HK |
dc.identifier.scopusauthorid | Zhou, Y=7405368400 | en_HK |
dc.identifier.scopusauthorid | Chau, M=7006073763 | en_HK |
dc.identifier.scopusauthorid | Chen, H=8871373800 | en_HK |
dc.identifier.citeulike | 819073 | - |
dc.identifier.issnl | 1532-2882 | - |