File Download
Supplementary
-
Citations:
- Appears in Collections:
Conference Paper: Dependency parsing with energy-based reinforcement learning
Title | Dependency parsing with energy-based reinforcement learning |
---|---|
Authors | |
Issue Date | 2009 |
Citation | The 11th International Conference on Parsing Technologies (IWPT'09), Paris, France, 7-9 October 2009. In Proceedings of the 11th IWPT, 2009, p. 234-237 How to Cite? |
Abstract | We present a model which integrates dependency parsing with reinforcement learning based on Markov decision process. At each time step, a transition is picked up to construct the dependency tree in terms of the long-run reward. The optimal policy for choosing transitions can be found with the SARSA algorithm. In SARSA, an approximation of the state-action function can be obtained by calculating the negative free energies for the Restricted Boltzmann Machine. The experimental results on CoNLL-X multilingual data show that the proposed model achieves comparable results with the current state-of-the-art methods. |
Description | Short Paper Session VII |
Persistent Identifier | http://hdl.handle.net/10722/142602 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zhang, L | en_US |
dc.contributor.author | Chan, KP | en_US |
dc.date.accessioned | 2011-10-28T02:52:51Z | - |
dc.date.available | 2011-10-28T02:52:51Z | - |
dc.date.issued | 2009 | en_US |
dc.identifier.citation | The 11th International Conference on Parsing Technologies (IWPT'09), Paris, France, 7-9 October 2009. In Proceedings of the 11th IWPT, 2009, p. 234-237 | en_US |
dc.identifier.uri | http://hdl.handle.net/10722/142602 | - |
dc.description | Short Paper Session VII | - |
dc.description.abstract | We present a model which integrates dependency parsing with reinforcement learning based on Markov decision process. At each time step, a transition is picked up to construct the dependency tree in terms of the long-run reward. The optimal policy for choosing transitions can be found with the SARSA algorithm. In SARSA, an approximation of the state-action function can be obtained by calculating the negative free energies for the Restricted Boltzmann Machine. The experimental results on CoNLL-X multilingual data show that the proposed model achieves comparable results with the current state-of-the-art methods. | - |
dc.language | eng | en_US |
dc.relation.ispartof | Proceedings of the 11th International Conference on Parsing Technology, IWPT'09 | en_US |
dc.title | Dependency parsing with energy-based reinforcement learning | en_US |
dc.type | Conference_Paper | en_US |
dc.identifier.email | Zhang, L: lzhang@cs.hku.hk | en_US |
dc.identifier.email | Chan, KP: kpchan@cs.hku.hk | - |
dc.identifier.authority | Chan, KP=rp00092 | en_US |
dc.description.nature | postprint | - |
dc.identifier.hkuros | 184437 | en_US |
dc.identifier.spage | 234 | - |
dc.identifier.epage | 237 | - |
dc.description.other | The 11th International Conference on Parsing Technologies (IWPT'09), Paris, France, 7-9 October 2009. In Proceedings of the 11th IWPT, 2009, p. 234-237 | - |