Skip to main content

Abstract

The aim of this paper is to compile dictionaries of slang words, abbreviations, contractions, and emoticons to help the pre-processing of texts published in social networks. The use of these dictionaries is intended to improve the results of the tasks related to data obtained from these platforms. Therefore, a hypothesis was evaluated in the task of identifying author profiles (author profiling).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Schler J, Koppel M, Argamon S, Pennebaker JW (2006) Effects of age and gender on blogging. In: Computational approaches to analyzing weblogs, papers from the 2006 AAAI spring symposium, technical report SS-06-03, Stanford, California, USA, 27–29 Mar 2006, pp 199–205

    Google Scholar 

  2. Viloria A, Lis-Gutiérrez JP, Gaitán-Angulo M, Godoy ARM, Moreno GC, Kamatkar SJ (2018) Methodology for the design of a student pattern recognition tool to facilitate the teaching—learning process through knowledge data discovery (Big Data). In: Tan Y, Shi Y, Tang Q (eds) Data mining and big data. DMBD 2018. Lecture notes in computer science, vol 10943. Springer, Cham

    Google Scholar 

  3. Tang J (2016) AMiner: mining deep knowledge from big scholar data. In: Proceedings of the 25th international conference companion on world wide web, international world wide web conferences steering committee, republic and canton of Geneva, Switzerland, pp 373–373

    Google Scholar 

  4. Obit JH, Ouelhadj D, Landa-Silva D, Vun TK, Alfred R (2011) Designing a multi-agent approach system for distributed course timetabling. pp 103–108 https://doi.org/10.1109/his.2011.6122088

  5. Lewis MRR (2006) Metaheuristics for university course timetabling. Ph.D. Thesis, Napier University

    Google Scholar 

  6. Deng X, Zhang Y, Kang B, Wu J, Sun X, Deng Y (2011) An application of genetic algorithm for university course timetabling problem. pp 2119–2122 https://doi.org/10.1109/ccdc.2011.5968555

  7. Mahiba AA, Durai CAD (2012) Genetic algorithm with search bank strategies for university course timetabling problem. Procedia Eng 38:253–263

    Article  Google Scholar 

  8. Adomavicius G, Tuzhilin A (2005) Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans Know Data Eng 17:734–749

    Article  Google Scholar 

  9. Camacho-Vázquez V, Sidorov G, Galicia-Haro SN (2016) Machine learning applied to a balanced and emotional corpus of tweets with many varieties of Spanish. Submitted

    Google Scholar 

  10. Nguyen K, Lu T, Le T, Tran N (2011) Memetic algorithm for a university course timetabling problem. pp 67–71. https://doi.org/10.1007/978-3-642-25899-2_10

  11. Haddi E, Liu X, Shi Y (2013) The role of text pre-processing in sentiment analysis. In: Procedia computer science first international conference on information technology and quantitative management vol 17, pp 26–32

    Google Scholar 

  12. Hemalatha I, Varma DGPS, Govardhan DA (2012) Preprocessing the informal text for efficient sentiment analysis. Int J Emerg Trends Technol Comput Sci (IJETTCS) 1(2):58–61

    Google Scholar 

  13. Pinto D, Vilarinõ-Ayala D, Alemán Y, Gómez-Adorno H, Loya N, Jiménez-Salazar H (2012) The soundex phonetic algorithm revisited for sms-based information retrieval. In: II Spanish conference on information retrieval CERI 2012

    Google Scholar 

  14. Torres-Samuel M, Vásquez C, Viloria A, Lis-Gutiérrez JP, Borrero TC, Varela N (2018) Web visibility profiles of Top100 Latin American universities. International conference on data mining and big data. Springer, Cham, pp 254–262

    Chapter  Google Scholar 

  15. Henao-Rodríguez C, Lis-Gutiérrez JP, Bouza C, Gaitán-Angulo M, Viloria A (2019) Citescore of publications indexed in scopus: an implementation of panel data. International conference on data mining and big data. Springer, Singapore, pp 53–60

    Chapter  Google Scholar 

  16. Peersman C, Daelemans W, Van Vaerenbergh L (2011) Predicting age and gender in online social networks. In: Proceedings of the 3rd international workshop on search and mining user-generated contents. New York, NY, USA, ACM, pp 37–44

    Google Scholar 

  17. Nguyen D, Gravel R, Trieschnigg D, Meder T (2013) how old do you think i am?: a study of language and age in twitter. In: Proceedings of the seventh international AAAI conference on weblogs and social media. ICWSM

    Google Scholar 

  18. Rangel F, Rosso P (2013) Use of language and author profiling: Identification of gender and age. In: Proceedings of the 10th workshop on natural language processing and cognitive science (NLPCS-2013)

    Google Scholar 

  19. Bedford D (2013) Evaluating classification schema and classification decisions. Bull Am Soc Inform Sci Technol 39:13–21

    Article  Google Scholar 

  20. Toutanova K, Klein D, Manning C, Singer Y (2003) Feature-rich part-of-speech tagging with a cyclic dependency network. In: Human language technology conference (HLT-NAACL 2003)

    Google Scholar 

  21. McGrail MR, Rickard CM, Jones R (2006) Publish or perish: a systematic review of interventions to increase academic publication rates. Higher Educ Res Dev 25:19–35

    Article  Google Scholar 

  22. Costas R, van Leeuwen TN, Bordons M (2010) A bibliometric classificatory approach for the study and assessment of research performance at the individual level: the effects of age on productivity and impact. J Am Soc Inf Sci 61:1564–1581

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jesús Silva .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Silva, J. et al. (2021). Identification of Author Profiles Through Social Networks. In: Gunjan, V.K., Zurada, J.M. (eds) Proceedings of International Conference on Recent Trends in Machine Learning, IoT, Smart Cities and Applications. Advances in Intelligent Systems and Computing, vol 1245. Springer, Singapore. https://doi.org/10.1007/978-981-15-7234-0_84

Download citation

Publish with us

Policies and ethics