CoNT: Contrastive Neural Text Generation

An, C; Feng, J; Lv, K; Kong, L; Qiu, X; Huang, X

File Download

There are no files associated with this item.

Supplementary

Citations:
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: CoNT: Contrastive Neural Text Generation

Title	CoNT: Contrastive Neural Text Generation
Authors	An, C Feng, J Lv, K Kong, L Qiu, X Huang, X
Issue Date	2022
Publisher	Curran Associates, Inc..
Citation	Thirty-Sixth Conference on Neural Information Processing Systems (NeurIPS) (Hybrid), New Orleans, Louisiana, United States of America, November 28-December 9, 2022. In Advances in Neural Information Processing Systems 35 (NeurIPS 2022) How to Cite?
Abstract	Recently, contrastive learning attracts increasing interests in neural text generation as a new solution to alleviate the exposure bias problem. It introduces a sequence-level training signal which is crucial to generation tasks that always rely on auto-regressive decoding. However, previous methods using contrastive learning in neural text generation usually lead to inferior performance. In this paper, we analyse the underlying reasons and propose a new Contrastive Neural Text generation framework, CoNT. CoNT addresses bottlenecks that prevent contrastive learning from being widely adopted in generation tasks from three aspects -- the construction of contrastive examples, the choice of the contrastive loss, and the strategy in decoding. We validate CoNT on five generation tasks with ten benchmarks, including machine translation, summarization, code comment generation, data-to-text generation and commonsense generation. Experimental results show that CoNT clearly outperforms its baseline on all the ten benchmarks with a convincing margin. Especially, CoNT surpasses previous the most competitive contrastive learning method for text generation, by 1.50 BLEU on machine translation and 1.77 ROUGE-1 on summarization, respectively. It achieves new state-of-the-art on summarization, code comment generation (without external data) and data-to-text generation.
Persistent Identifier	http://hdl.handle.net/10722/317986

DC Field	Value	Language
dc.contributor.author	An, C	-
dc.contributor.author	Feng, J	-
dc.contributor.author	Lv, K	-
dc.contributor.author	Kong, L	-
dc.contributor.author	Qiu, X	-
dc.contributor.author	Huang, X	-
dc.date.accessioned	2022-10-07T10:30:35Z	-
dc.date.available	2022-10-07T10:30:35Z	-
dc.date.issued	2022	-
dc.identifier.citation	Thirty-Sixth Conference on Neural Information Processing Systems (NeurIPS) (Hybrid), New Orleans, Louisiana, United States of America, November 28-December 9, 2022. In Advances in Neural Information Processing Systems 35 (NeurIPS 2022)	-
dc.identifier.uri	http://hdl.handle.net/10722/317986	-
dc.description.abstract	Recently, contrastive learning attracts increasing interests in neural text generation as a new solution to alleviate the exposure bias problem. It introduces a sequence-level training signal which is crucial to generation tasks that always rely on auto-regressive decoding. However, previous methods using contrastive learning in neural text generation usually lead to inferior performance. In this paper, we analyse the underlying reasons and propose a new Contrastive Neural Text generation framework, CoNT. CoNT addresses bottlenecks that prevent contrastive learning from being widely adopted in generation tasks from three aspects -- the construction of contrastive examples, the choice of the contrastive loss, and the strategy in decoding. We validate CoNT on five generation tasks with ten benchmarks, including machine translation, summarization, code comment generation, data-to-text generation and commonsense generation. Experimental results show that CoNT clearly outperforms its baseline on all the ten benchmarks with a convincing margin. Especially, CoNT surpasses previous the most competitive contrastive learning method for text generation, by 1.50 BLEU on machine translation and 1.77 ROUGE-1 on summarization, respectively. It achieves new state-of-the-art on summarization, code comment generation (without external data) and data-to-text generation.	-
dc.language	eng	-
dc.publisher	Curran Associates, Inc..	-
dc.relation.ispartof	Advances in Neural Information Processing Systems 35 (NeurIPS 2022)	-
dc.title	CoNT: Contrastive Neural Text Generation	-
dc.type	Conference_Paper	-
dc.identifier.email	Kong, L: lpk@cs.hku.hk	-
dc.identifier.authority	Kong, L=rp02775	-
dc.identifier.hkuros	337877	-
dc.publisher.place	United States	-

File Download

Supplementary

Conference Paper: CoNT: Contrastive Neural Text Generation

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats