File Download
Supplementary
-
Citations:
- Appears in Collections:
Conference Paper: A continuous Putonghua recognizer
Title | A continuous Putonghua recognizer |
---|---|
Authors | |
Issue Date | 1997 |
Publisher | IEEE. |
Citation | The 13th International Conference on Digital Signal Processing, Santorini, Greece, 2-4 July 1997, v. 2, p. 889-892 How to Cite? |
Abstract | A multi-speaker continuous Putonghua recognizer has been developed composing of 20 speaker-dependent recognizer as sub-systems. Each sub-system is a network of hidden Markov models modeling triphones as the fundamental speech units. Over 3 GB of speech data have been collected for training from twenty native Putonghua speakers reading carefully designed tests trying to include all phone-to-phone transitions in Putonghua. A Viterbi path search yields the best speech unit sequence over the HMMnet for each unknown input utterance which is then passed down to a language model for post-processing. The most suitable word sequence is determined by means of the bigram statistics of 470 word classes covering a vocabulary of over 80,000 words. An enrollment process is required for each new user to select the most suitable speaker-dependent system among the 20 sub-systems according to their recognition performance on a small quantity of speech data collected from the user. |
Persistent Identifier | http://hdl.handle.net/10722/45600 |
ISBN |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Wong, PK | en_HK |
dc.contributor.author | Chan, C | en_HK |
dc.date.accessioned | 2007-10-30T06:30:02Z | - |
dc.date.available | 2007-10-30T06:30:02Z | - |
dc.date.issued | 1997 | en_HK |
dc.identifier.citation | The 13th International Conference on Digital Signal Processing, Santorini, Greece, 2-4 July 1997, v. 2, p. 889-892 | en_HK |
dc.identifier.isbn | 0-7803-4137-6 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/45600 | - |
dc.description.abstract | A multi-speaker continuous Putonghua recognizer has been developed composing of 20 speaker-dependent recognizer as sub-systems. Each sub-system is a network of hidden Markov models modeling triphones as the fundamental speech units. Over 3 GB of speech data have been collected for training from twenty native Putonghua speakers reading carefully designed tests trying to include all phone-to-phone transitions in Putonghua. A Viterbi path search yields the best speech unit sequence over the HMMnet for each unknown input utterance which is then passed down to a language model for post-processing. The most suitable word sequence is determined by means of the bigram statistics of 470 word classes covering a vocabulary of over 80,000 words. An enrollment process is required for each new user to select the most suitable speaker-dependent system among the 20 sub-systems according to their recognition performance on a small quantity of speech data collected from the user. | en_HK |
dc.format.extent | 437347 bytes | - |
dc.format.extent | 3669 bytes | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | text/plain | - |
dc.language | eng | en_HK |
dc.publisher | IEEE. | en_HK |
dc.rights | ©1997 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | - |
dc.title | A continuous Putonghua recognizer | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=0-7803-4137-6&volume=2&spage=889&epage=892&date=1997&atitle=A+continuous+Putonghua+recognizer | en_HK |
dc.description.nature | published_or_final_version | en_HK |
dc.identifier.doi | 10.1109/ICDSP.1997.628503 | en_HK |
dc.identifier.hkuros | 38209 | - |