INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation

Zhu, Wenhao; Xu, Jingjing; Huang, Shujian; Kong, Lingpeng; Chen, Jiajun

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.18653/v1/2023.acl-long.888

Supplementary

Citations:
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation

Title	INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation
Authors	Zhu, Wenhao Xu, Jingjing Huang, Shujian Kong, Lingpeng Chen, Jiajun
Issue Date	11-Jul-2023
Abstract	Neural machine translation has achieved promising results on many translation tasks. However, previous studies have shown that neural models induce a non-smooth representation space, which harms its generalization results. Recently, kNN-MT has provided an effective paradigm to smooth the prediction based on neighbor representations during inference. Despite promising results, kNN-MT usually requires large inference overhead. We propose an effective training framework INK to directly smooth the representation space via adjusting representations of kNN neighbors with a small number of new parameters. The new parameters are then used to refresh the whole representation datastore to get new kNN knowledge asynchronously. This loop keeps running until convergence. Experiments on four benchmark datasets show that INK achieves average gains of 1.99 COMET and 1.0 BLEU, outperforming the state-of-the-art kNN-MT system with 0.02x memory space and 1.9x inference speedup.
Persistent Identifier	http://hdl.handle.net/10722/333813

DC Field	Value	Language
dc.contributor.author	Zhu, Wenhao	-
dc.contributor.author	Xu, Jingjing	-
dc.contributor.author	Huang, Shujian	-
dc.contributor.author	Kong, Lingpeng	-
dc.contributor.author	Chen, Jiajun	-
dc.date.accessioned	2023-10-06T08:39:17Z	-
dc.date.available	2023-10-06T08:39:17Z	-
dc.date.issued	2023-07-11	-
dc.identifier.uri	http://hdl.handle.net/10722/333813	-
dc.description.abstract	<p>Neural machine translation has achieved promising results on many translation tasks. However, previous studies have shown that neural models induce a non-smooth representation space, which harms its generalization results. Recently, kNN-MT has provided an effective paradigm to smooth the prediction based on neighbor representations during inference. Despite promising results, kNN-MT usually requires large inference overhead. We propose an effective training framework INK to directly smooth the representation space via adjusting representations of kNN neighbors with a small number of new parameters. The new parameters are then used to refresh the whole representation datastore to get new kNN knowledge asynchronously. This loop keeps running until convergence. Experiments on four benchmark datasets show that INK achieves average gains of 1.99 COMET and 1.0 BLEU, outperforming the state-of-the-art kNN-MT system with 0.02x memory space and 1.9x inference speedup.<br></p>	-
dc.language	eng	-
dc.relation.ispartof	Annual Meeting of the Association for Computational Linguistics (ACL 2023) (11/07/2023-18/07/2023)	-
dc.title	INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation	-
dc.type	Conference_Paper	-
dc.identifier.doi	10.18653/v1/2023.acl-long.888	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats