Dependency parsing with energy-based reinforcement learning

Zhang, L; Chan, KP

File Download

Content.pdf

Supplementary

Citations:
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Dependency parsing with energy-based reinforcement learning

Title	Dependency parsing with energy-based reinforcement learning
Authors	Zhang, L Chan, KP
Issue Date	2009
Citation	The 11th International Conference on Parsing Technologies (IWPT'09), Paris, France, 7-9 October 2009. In Proceedings of the 11th IWPT, 2009, p. 234-237 How to Cite?
Abstract	We present a model which integrates dependency parsing with reinforcement learning based on Markov decision process. At each time step, a transition is picked up to construct the dependency tree in terms of the long-run reward. The optimal policy for choosing transitions can be found with the SARSA algorithm. In SARSA, an approximation of the state-action function can be obtained by calculating the negative free energies for the Restricted Boltzmann Machine. The experimental results on CoNLL-X multilingual data show that the proposed model achieves comparable results with the current state-of-the-art methods.
Description	Short Paper Session VII
Persistent Identifier	http://hdl.handle.net/10722/142602

DC Field	Value	Language
dc.contributor.author	Zhang, L	en_US
dc.contributor.author	Chan, KP	en_US
dc.date.accessioned	2011-10-28T02:52:51Z	-
dc.date.available	2011-10-28T02:52:51Z	-
dc.date.issued	2009	en_US
dc.identifier.citation	The 11th International Conference on Parsing Technologies (IWPT'09), Paris, France, 7-9 October 2009. In Proceedings of the 11th IWPT, 2009, p. 234-237	en_US
dc.identifier.uri	http://hdl.handle.net/10722/142602	-
dc.description	Short Paper Session VII	-
dc.description.abstract	We present a model which integrates dependency parsing with reinforcement learning based on Markov decision process. At each time step, a transition is picked up to construct the dependency tree in terms of the long-run reward. The optimal policy for choosing transitions can be found with the SARSA algorithm. In SARSA, an approximation of the state-action function can be obtained by calculating the negative free energies for the Restricted Boltzmann Machine. The experimental results on CoNLL-X multilingual data show that the proposed model achieves comparable results with the current state-of-the-art methods.	-
dc.language	eng	en_US
dc.relation.ispartof	Proceedings of the 11th International Conference on Parsing Technology, IWPT'09	en_US
dc.title	Dependency parsing with energy-based reinforcement learning	en_US
dc.type	Conference_Paper	en_US
dc.identifier.email	Zhang, L: lzhang@cs.hku.hk	en_US
dc.identifier.email	Chan, KP: kpchan@cs.hku.hk	-
dc.identifier.authority	Chan, KP=rp00092	en_US
dc.description.nature	postprint	-
dc.identifier.hkuros	184437	en_US
dc.identifier.spage	234	-
dc.identifier.epage	237	-
dc.description.other	The 11th International Conference on Parsing Technologies (IWPT'09), Paris, France, 7-9 October 2009. In Proceedings of the 11th IWPT, 2009, p. 234-237	-

File Download

Supplementary

Conference Paper: Dependency parsing with energy-based reinforcement learning

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats