File Download
There are no files associated with this item.
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Domain Adapted Word Embeddings for Improved Sentiment Classification
Title | Domain Adapted Word Embeddings for Improved Sentiment Classification |
---|---|
Authors | |
Issue Date | 2018 |
Citation | Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2018, p. 51-59 How to Cite? |
Abstract | Generic word embeddings are trained on large-scale generic corpora; Domain Specific (DS) word embeddings are trained only on data from a domain of interest. This paper proposes a method to combine the breadth of generic embeddings with the specificity of domain specific embeddings. The resulting embeddings, called Domain Adapted (DA) word embeddings, are formed by first aligning corresponding word vectors using Canonical Correlation Analysis (CCA) or the related nonlinear Kernel CCA (KCCA) and then combining them via convex optimization. Results from evaluation on sentiment classification tasks show that the DA embeddings substantially outperform both generic, DS embeddings when used as input features to standard or state-of-the-art sentence encoding algorithms for classification. |
Persistent Identifier | http://hdl.handle.net/10722/341283 |
ISSN |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Sarma, Prathusha K. | - |
dc.contributor.author | Liang, Yingyu | - |
dc.contributor.author | Sethares, William A. | - |
dc.date.accessioned | 2024-03-13T08:41:36Z | - |
dc.date.available | 2024-03-13T08:41:36Z | - |
dc.date.issued | 2018 | - |
dc.identifier.citation | Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2018, p. 51-59 | - |
dc.identifier.issn | 0736-587X | - |
dc.identifier.uri | http://hdl.handle.net/10722/341283 | - |
dc.description.abstract | Generic word embeddings are trained on large-scale generic corpora; Domain Specific (DS) word embeddings are trained only on data from a domain of interest. This paper proposes a method to combine the breadth of generic embeddings with the specificity of domain specific embeddings. The resulting embeddings, called Domain Adapted (DA) word embeddings, are formed by first aligning corresponding word vectors using Canonical Correlation Analysis (CCA) or the related nonlinear Kernel CCA (KCCA) and then combining them via convex optimization. Results from evaluation on sentiment classification tasks show that the DA embeddings substantially outperform both generic, DS embeddings when used as input features to standard or state-of-the-art sentence encoding algorithms for classification. | - |
dc.language | eng | - |
dc.relation.ispartof | Proceedings of the Annual Meeting of the Association for Computational Linguistics | - |
dc.title | Domain Adapted Word Embeddings for Improved Sentiment Classification | - |
dc.type | Conference_Paper | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.scopus | eid_2-s2.0-85089289412 | - |
dc.identifier.spage | 51 | - |
dc.identifier.epage | 59 | - |