File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: A hybrid approach to Chinese abbreviation expansion

TitleA hybrid approach to Chinese abbreviation expansion
Authors
KeywordsAbbreviation disambiguation
Chinese abbreviation expansion
Hidden Markov models (HMMs)
Issue Date2006
PublisherSpringer Verlag. The Journal's web site is located at http://springerlink.com/content/105633/
Citation
21st International Conference on Computer Processing of Oriental Languages: Beyond the Orient: The Research Challenges Ahead, ICCPOL 2006, Singapore, 17-19 December 2006. In Lecture Notes In Computer Science (Including Subseries Lecture Notes In Artificial Intelligence And Lecture Notes In Bioinformatics), 2006, v. 4285 LNAI, p. 277-287 How to Cite?
AbstractThis paper presents a hybrid approach to Chinese abbreviation expansion. In this study, each short-form in Chinese text is assumed to be created by the method of reduction and the method of elimination or generalization, respectively. A mapping table between short words and long words and a dictionary of non-reduced short-form/full-form pairs are thus applied to generate the respective expansion candidates. Then, a hidden Markov model (HMM) based disambiguation is employed to rank these candidates and select a proper expansion for each ambiguous abbreviation. In order to improve expansion accuracy, some linguistic knowledge like discourse information and abbreviation patterns are further employed to double-check the expanded results and revise some error expansions if any. The proposed approach was evaluated on an abbreviation-expanded corpus built from the Peking University Corpus. The results showed that a recall of 83.8% and a precision of 86.3% can be achieved on average for different types of Chinese abbreviations. © 2006 Springer-Verlag.
Persistent Identifierhttp://hdl.handle.net/10722/90275
ISSN
2023 SCImago Journal Rankings: 0.606
References

 

DC FieldValueLanguage
dc.contributor.authorFu, Gen_HK
dc.contributor.authorLuke, KKen_HK
dc.contributor.authorZhang, Men_HK
dc.contributor.authorZhou, Gen_HK
dc.date.accessioned2010-09-06T10:08:05Z-
dc.date.available2010-09-06T10:08:05Z-
dc.date.issued2006en_HK
dc.identifier.citation21st International Conference on Computer Processing of Oriental Languages: Beyond the Orient: The Research Challenges Ahead, ICCPOL 2006, Singapore, 17-19 December 2006. In Lecture Notes In Computer Science (Including Subseries Lecture Notes In Artificial Intelligence And Lecture Notes In Bioinformatics), 2006, v. 4285 LNAI, p. 277-287en_HK
dc.identifier.issn0302-9743en_HK
dc.identifier.urihttp://hdl.handle.net/10722/90275-
dc.description.abstractThis paper presents a hybrid approach to Chinese abbreviation expansion. In this study, each short-form in Chinese text is assumed to be created by the method of reduction and the method of elimination or generalization, respectively. A mapping table between short words and long words and a dictionary of non-reduced short-form/full-form pairs are thus applied to generate the respective expansion candidates. Then, a hidden Markov model (HMM) based disambiguation is employed to rank these candidates and select a proper expansion for each ambiguous abbreviation. In order to improve expansion accuracy, some linguistic knowledge like discourse information and abbreviation patterns are further employed to double-check the expanded results and revise some error expansions if any. The proposed approach was evaluated on an abbreviation-expanded corpus built from the Peking University Corpus. The results showed that a recall of 83.8% and a precision of 86.3% can be achieved on average for different types of Chinese abbreviations. © 2006 Springer-Verlag.en_HK
dc.languageengen_HK
dc.publisherSpringer Verlag. The Journal's web site is located at http://springerlink.com/content/105633/en_HK
dc.relation.ispartofLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)en_HK
dc.subjectAbbreviation disambiguationen_HK
dc.subjectChinese abbreviation expansionen_HK
dc.subjectHidden Markov models (HMMs)en_HK
dc.titleA hybrid approach to Chinese abbreviation expansionen_HK
dc.typeConference_Paperen_HK
dc.identifier.emailLuke, KK:kkluke@hkusua.hku.hken_HK
dc.identifier.authorityLuke, KK=rp01201en_HK
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1007/11940098_29en_HK
dc.identifier.scopuseid_2-s2.0-77049117554en_HK
dc.identifier.hkuros153287en_HK
dc.relation.referenceshttp://www.scopus.com/mlt/select.url?eid=2-s2.0-77049117554&selection=ref&src=s&origin=recordpageen_HK
dc.identifier.volume4285 LNAIen_HK
dc.identifier.spage277en_HK
dc.identifier.epage287en_HK
dc.publisher.placeGermanyen_HK
dc.description.other21st International Conference on Computer Processing of Oriental Languages: Beyond the Orient: The Research Challenges Ahead, ICCPOL 2006, Singapore, 17-19 December 2006. In Lecture Notes In Computer Science (Including Subseries Lecture Notes In Artificial Intelligence And Lecture Notes In Bioinformatics), 2006, v. 4285 LNAI, p. 277-287-
dc.identifier.scopusauthoridFu, G=7202721096en_HK
dc.identifier.scopusauthoridLuke, KK=7003697439en_HK
dc.identifier.scopusauthoridZhang, M=36041252700en_HK
dc.identifier.scopusauthoridZhou, G=7403686010en_HK
dc.identifier.issnl0302-9743-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats