File Download
Supplementary

Conference Paper: Learning distributed sentence vectors with bi-directional 3D convolutions

TitleLearning distributed sentence vectors with bi-directional 3D convolutions
Authors
Issue Date2020
PublisherInternational Committee on Computational Linguistics.
Citation
Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), Virtual Conference, Barcelona, Spain, 8-13 December 2020, p. 6820–6830 How to Cite?
AbstractWe propose to learn distributed sentence representation using text’s visual features as input. Different from the existing methods that render the words or characters of a sentence into images separately, we further fold these images into a 3-dimensional sentence tensor. Then, multiple 3-dimensional convolutions with different lengths (the third dimension) are applied to the sentence tensor, which act as bi-gram, tri-gram, quad-gram, and even five-gram detectors jointly. Similar to the Bi-LSTM, these n-gram detectors learn both forward and backward distributional semantic knowledge from the sentence tensor. That is, the proposed model using bi-directional convolutions to learn text embedding according to the semantic order of words. The feature maps from the two directions are concatenated for final sentence embedding learning. Our model involves only a single-layer of convolution which makes it easy and fast to train. Finally, we evaluate the sentence embeddings on several downstream Natural Language Processing (NLP) tasks, which demonstrate a surprisingly excellent performance of the proposed model.
DescriptionSession Poster25 - Machine Learning and Language Modelling.
Persistent Identifierhttp://hdl.handle.net/10722/294918

 

DC FieldValueLanguage
dc.contributor.authorLiu, B-
dc.contributor.authorWang, L-
dc.contributor.authorYin, G-
dc.date.accessioned2020-12-21T11:50:24Z-
dc.date.available2020-12-21T11:50:24Z-
dc.date.issued2020-
dc.identifier.citationProceedings of the 28th International Conference on Computational Linguistics (COLING 2020), Virtual Conference, Barcelona, Spain, 8-13 December 2020, p. 6820–6830-
dc.identifier.urihttp://hdl.handle.net/10722/294918-
dc.descriptionSession Poster25 - Machine Learning and Language Modelling.-
dc.description.abstractWe propose to learn distributed sentence representation using text’s visual features as input. Different from the existing methods that render the words or characters of a sentence into images separately, we further fold these images into a 3-dimensional sentence tensor. Then, multiple 3-dimensional convolutions with different lengths (the third dimension) are applied to the sentence tensor, which act as bi-gram, tri-gram, quad-gram, and even five-gram detectors jointly. Similar to the Bi-LSTM, these n-gram detectors learn both forward and backward distributional semantic knowledge from the sentence tensor. That is, the proposed model using bi-directional convolutions to learn text embedding according to the semantic order of words. The feature maps from the two directions are concatenated for final sentence embedding learning. Our model involves only a single-layer of convolution which makes it easy and fast to train. Finally, we evaluate the sentence embeddings on several downstream Natural Language Processing (NLP) tasks, which demonstrate a surprisingly excellent performance of the proposed model.-
dc.languageeng-
dc.publisherInternational Committee on Computational Linguistics.-
dc.relation.ispartofThe 28th International Conference on Computational Linguistics (COLING 2020)-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.titleLearning distributed sentence vectors with bi-directional 3D convolutions-
dc.typeConference_Paper-
dc.identifier.emailYin, G: gyin@hku.hk-
dc.identifier.authorityYin, G=rp00831-
dc.description.naturepublished_or_final_version-
dc.identifier.hkuros320598-
dc.identifier.spage6820-
dc.identifier.epage6830-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats