File Download
There are no files associated with this item.
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: Automatic expansion of abbreviations in Chinese news text
Title | Automatic expansion of abbreviations in Chinese news text |
---|---|
Authors | |
Issue Date | 2006 |
Publisher | Springer Verlag. The Journal's web site is located at http://springerlink.com/content/105633/ |
Citation | Lecture Notes In Computer Science (Including Subseries Lecture Notes In Artificial Intelligence And Lecture Notes In Bioinformatics), 2006, v. 4182 LNCS, p. 530-536 How to Cite? |
Abstract | This paper presents an n-gram based approach to Chinese abbreviation expansion. In this study, we distinguish reduced abbreviations from non-reduced abbreviations that are created by elimination or generalization. For a reduced abbreviation, a mapping table is compiled to map each short-word in it to a set of long-words, and a bigram based Viterbi algorithm is thus applied to decode an appropriate combination of long-words as its full-form. For a non-reduced abbreviation, a dictionary of non-reduced abbreviation/full-form pairs is used to generate its expansion candidates, and a disambiguation technique is further employed to select a proper expansion based on bigram word segmentation. The evaluation on an abbreviation-expanded corpus built from the PKU corpus showed that the proposed system achieved a recall of 82.9% and a precision of 85.5% on average for different types of abbreviations in Chinese news text. © Springer-Verlag Berlin Heidelberg 2006. |
Persistent Identifier | http://hdl.handle.net/10722/75074 |
ISSN | 2023 SCImago Journal Rankings: 0.606 |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Fu, G | en_HK |
dc.contributor.author | Luke, KK | en_HK |
dc.contributor.author | Zhou, G | en_HK |
dc.contributor.author | Xu, R | en_HK |
dc.date.accessioned | 2010-09-06T07:07:36Z | - |
dc.date.available | 2010-09-06T07:07:36Z | - |
dc.date.issued | 2006 | en_HK |
dc.identifier.citation | Lecture Notes In Computer Science (Including Subseries Lecture Notes In Artificial Intelligence And Lecture Notes In Bioinformatics), 2006, v. 4182 LNCS, p. 530-536 | en_HK |
dc.identifier.issn | 0302-9743 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/75074 | - |
dc.description.abstract | This paper presents an n-gram based approach to Chinese abbreviation expansion. In this study, we distinguish reduced abbreviations from non-reduced abbreviations that are created by elimination or generalization. For a reduced abbreviation, a mapping table is compiled to map each short-word in it to a set of long-words, and a bigram based Viterbi algorithm is thus applied to decode an appropriate combination of long-words as its full-form. For a non-reduced abbreviation, a dictionary of non-reduced abbreviation/full-form pairs is used to generate its expansion candidates, and a disambiguation technique is further employed to select a proper expansion based on bigram word segmentation. The evaluation on an abbreviation-expanded corpus built from the PKU corpus showed that the proposed system achieved a recall of 82.9% and a precision of 85.5% on average for different types of abbreviations in Chinese news text. © Springer-Verlag Berlin Heidelberg 2006. | en_HK |
dc.language | eng | en_HK |
dc.publisher | Springer Verlag. The Journal's web site is located at http://springerlink.com/content/105633/ | en_HK |
dc.relation.ispartof | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | en_HK |
dc.title | Automatic expansion of abbreviations in Chinese news text | en_HK |
dc.type | Article | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=0302-9743&volume=4182&spage=530&epage=536&date=2006&atitle=Automatic+expansion+of+abbreviations+in+Chinese+news+text | en_HK |
dc.identifier.email | Luke, KK:kkluke@hkusua.hku.hk | en_HK |
dc.identifier.authority | Luke, KK=rp01201 | en_HK |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.scopus | eid_2-s2.0-33751359259 | en_HK |
dc.identifier.hkuros | 138526 | en_HK |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-33751359259&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.volume | 4182 LNCS | en_HK |
dc.identifier.spage | 530 | en_HK |
dc.identifier.epage | 536 | en_HK |
dc.publisher.place | Germany | en_HK |
dc.identifier.scopusauthorid | Fu, G=7202721096 | en_HK |
dc.identifier.scopusauthorid | Luke, KK=7003697439 | en_HK |
dc.identifier.scopusauthorid | Zhou, G=7403686010 | en_HK |
dc.identifier.scopusauthorid | Xu, R=35520467000 | en_HK |
dc.identifier.issnl | 0302-9743 | - |