Skip to main content

Research on Short Text Classification Method Based on Convolution Neural Network

  • Conference paper
  • First Online:
Advances in Intelligent, Interactive Systems and Applications (IISA 2018)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 885))

Abstract

Short text classification is one of the hotspots of research in Natural Language Processing. a new model of text representation is proposed in this paper (N-of-DOC), and in order to solve the problem of sparse representation in Chinese, the word2vec distributed representation is used, finally, it is applied to the improved convolution neural network model (CNN) to extract the high level features from the filter layer, the classification model is obtained by connecting the softmax classifier after the pooling layer. In the experiment, the traditional text representation model and the improved text representation model are used as the input of the original data, respectively. It acts on the model of traditional machine learning (KNN, SVM, logistic regression, naive Bayes) and the improved convolution neural network model. The results show that the proposed method can not only solve the dimension disaster and sparse problem of Chinese text vectors, but also improve the classification accuracy by 10.23% compared with traditional methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Jiang, B.: Micro-blog Automatic Classification Method Research and Application. Harbin Institute of Technology, Harbin (2012)

    Google Scholar 

  2. Zhang, Z., Miao, D., Chan, H.: Short text classification method LDA topic model. Based Comput. Appl. 33(6), 1587–1590 (2013)

    Google Scholar 

  3. Zhang, A., Liu, G., Liu, C.: Research on multi class text classification based on SVM. Inf. Mag. 23(9), 6–7 (2004)

    Google Scholar 

  4. Guo, S.: Research on Short Text Classification Algorithm Based on Bayesian Network. Chongqing University of Posts and Telecommunications, Chongqing (2010)

    Google Scholar 

  5. Zhong, W., Liu, R.: An improved KNN text classification. Comput. Eng. Appl. 48(2), 142–144 (2012)

    Google Scholar 

  6. Miaomiao, T.: A study of text classification based on decision tree. J. Jilin Norm. Univ. (Nat. Sci. Edit.) 29(1), 54–56 (2008)

    Google Scholar 

  7. Kim, Y.: Convolutional neural networks for sentence classification. Eprint Arxiv (2014)

    Google Scholar 

  8. Huang, W., Moyang, : Chinese spam filtering. Comput. Eng. Based Text Weight. KNN Algorithm 43(3), 193–199 (2017)

    Google Scholar 

  9. Chen, Y., Wu, J., Xu, K.: Development, Gini index for attribute selection of microcomputer based on decision tree. Microcomput. Dev. 14(5), 66–68 (2004)

    Google Scholar 

  10. Hu, W., He, T., Zhang, Y.: Extraction of Chinese terminology based on Chi square test. Comput. Appl. 27(12), 3019–3020 (2007)

    Google Scholar 

  11. Tan, S., Li, : Menstrual in text classification TF IDF. Improv. Method Mod. Libr. Inf. Technol. 29(10), 27–30 (2013)

    Google Scholar 

Download references

Acknowledgements

First of all, I would like to thank my tutor, Professor Chen Qiaohong, for his great care and help in my life and my studies, Chen virtuous, friendly, knowledgeable, rigorous scholarship, During my master’s study, She not only taught me the skills of learning, she also taught me the rules of being a man, which will certainly benefit me for life. Finally, I would like to thank my parents for their greatest support, and I love you.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lei Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, L., Chen, Q., Sun, Q., Jia, Y. (2019). Research on Short Text Classification Method Based on Convolution Neural Network. In: Xhafa, F., Patnaik, S., Tavana, M. (eds) Advances in Intelligent, Interactive Systems and Applications. IISA 2018. Advances in Intelligent Systems and Computing, vol 885. Springer, Cham. https://doi.org/10.1007/978-3-030-02804-6_53

Download citation

Publish with us

Policies and ethics