A Comparison of Pre-trained Word Embeddings for Sentiment Analysis Using Deep Learning

Santosh Kumar, P.; Yadav, Rakesh Bahadur; Dhavale, Sunita Vikrant

doi:10.1007/978-981-15-5113-0_41

P. Santosh Kumar²⁰,
Rakesh Bahadur Yadav²⁰ &
Sunita Vikrant Dhavale²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1165))

1496 Accesses
5 Citations

Abstract

The public opinion expressed on review or blogging sites and social networking platforms can be the source for the extraction of very critical information related to feelings and emotions of mass towards the subject matter in the field of commerce and governance. Natural Language Processing (NLP) and Artificial Intelligence can be used for sentiment analysis of this textual information. For text processing, NLP applications nowadays rely on pre-trained embeddings derived from large corpora such as news collection and web crawlers. There are many pre-trained word embeddings available. However, no study found which compares the accuracy achieved using these embeddings. In this paper, we worked on different kinds of word embeddings (pre-trained and untrained) and derived a comparison concerning accuracy for sentiment analysis applications using Deep Learning (DL) models. We found that the deep learning models perform better with pre-trained embeddings compared to Keras default (untrained) embedding.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Sentiment Analysis of News Articles Using Deep Learning Methodologies

Sentiment Analysis: Choosing the Right Word Embedding for Deep Learning Model

A supervised deep learning-based sentiment analysis by the implementation of Word2Vec and GloVe Embedding techniques

Article 09 April 2024

Notes

1.
https://keras.io.
2.
https://www.tensorflow.org.

References

O. Kolchyna, T.T. Souza, P. Treleaven, T. Aste, Twitter sentiment analysis: Lexicon method, machine learning method and their combination. arXiv preprint arXiv:1507.00955 (2015)
P. Bojanowski, E. Grave, A. Joulin, T. Mikolov, Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5, 135–146 (2017)
Article Google Scholar
A.L. Maas, R.E. Daly, P.T. Pham, D. Huang, A.Y. Ng, C. Potts, Learning word vectors for sentiment analysis, in Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-vol. 1 (Association for Computational Linguistics, 2011), pp. 142–150
Google Scholar
R. Socher, A. Perelygin, J. Wu, J. Chuang, C. D. Manning, A. Ng, C. Potts, Recursive deep models for semantic compositionality over a sentiment treebank, in Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1631–1642 (2013)
Google Scholar
T. Mikolov, I. Sutskever, K. Chen, G.S. Corrado, J. Dean, Distributed representations of words and phrases and their compositionality, in Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
J. Pennington, R. Socher, C. Manning, Glove: global vectors for word representation, in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014)
Google Scholar
T. Mikolov, E. Grave, P. Bojanowski, C. Puhrsch, A. Joulin, Advances in pre-training distributed word representations, in Proceedings of the International Conference on Language Resources and Evaluation (LREC 2018) (2018)
Google Scholar
A. Hassan, A. Mahmood, Convolutional recurrent deep learning model for sentence classification. IEEE Access 6, 13949–13957 (2018)
Article Google Scholar
Y. Bengio, R. Ducharme, P. Vincent, C. Jauvin, A neural probabilistic language model. J. Mach. Learn. Res. 3, 1137–1155 (2003)
MATH Google Scholar
T. Mikolov, K. Chen, G. Corrado, J. Dean, Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Y. Kim, Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014)
M. Sundermeyer, H. Ney, R. Schlüter, From feedforward to recurrent lstm neural networks for language modeling. IEEE/ACM Trans. Audio Speech Language Process. 23(3), 517–529 (2015)
Article Google Scholar
S. Hochreiter, J. Schmidhuber, Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
X. Wang, W. Jiang, Z. Luo, Combination of convolutional and recurrent neural network for sentiment analysis of short texts, in Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 2428–2437 (2016)
Google Scholar
Y. Xiao, K. Cho, Efficient character-level document classification by combining convolution and recurrent layers. arXiv preprint arXiv:1602.00367 (2016)

Download references

Acknowledgements

The authors would like to acknowledge and thank NVIDIA for their support provided through the GPU grant for carrying out this research work.

Author information

Authors and Affiliations

Defence Institute of Advanced Technology, Girinagar, Pune, 411025, India
P. Santosh Kumar, Rakesh Bahadur Yadav & Sunita Vikrant Dhavale

Authors

P. Santosh Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Rakesh Bahadur Yadav
View author publications
You can also search for this author in PubMed Google Scholar
Sunita Vikrant Dhavale
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. Santosh Kumar .

Editor information

Editors and Affiliations

Maharaja Agrasen Institute of Technology, Rohini, Delhi, India
Deepak Gupta
Maharaja Agrasen Institute of Technology, Rohini, Delhi, India
Ashish Khanna
CHRIST (Deemed to be University), Bengaluru, Karnataka, India
Siddhartha Bhattacharyya
Department of Information Technology, Faculty of Computers and Information, Cairo University, Giza, Egypt
Aboul Ella Hassanien
Department of Computer Science, Shaheed Sukhdev College of Business Studies, University of Delhi, Rohini, Delhi, India
Sameer Anand
Department of Computer Science, Shaheed Sukhdev College of Business Studies, University of Delhi, Rohini, Delhi, India
Ajay Jaiswal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Santosh Kumar, P., Yadav, R.B., Dhavale, S.V. (2021). A Comparison of Pre-trained Word Embeddings for Sentiment Analysis Using Deep Learning. In: Gupta, D., Khanna, A., Bhattacharyya, S., Hassanien, A.E., Anand, S., Jaiswal, A. (eds) International Conference on Innovative Computing and Communications. Advances in Intelligent Systems and Computing, vol 1165. Springer, Singapore. https://doi.org/10.1007/978-981-15-5113-0_41

Download citation

DOI: https://doi.org/10.1007/978-981-15-5113-0_41
Published: 02 August 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-5112-3
Online ISBN: 978-981-15-5113-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

A Comparison of Pre-trained Word Embeddings for Sentiment Analysis Using Deep Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Sentiment Analysis of News Articles Using Deep Learning Methodologies

Sentiment Analysis: Choosing the Right Word Embedding for Deep Learning Model

A supervised deep learning-based sentiment analysis by the implementation of Word2Vec and GloVe Embedding techniques

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Comparison of Pre-trained Word Embeddings for Sentiment Analysis Using Deep Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Sentiment Analysis of News Articles Using Deep Learning Methodologies

Sentiment Analysis: Choosing the Right Word Embedding for Deep Learning Model

A supervised deep learning-based sentiment analysis by the implementation of Word2Vec and GloVe Embedding techniques

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation