Abstract
Hundreds of stock market news reach the financial markets every day. In order to benefit from these movements as an investor, this work aims at developing a system with which the direction of the price fluctuations can be predicted. This paper is about building an own dataset of financial information with thousands of financial articles, and historical stock data are used to build the foundation of the following actions. This foundational data is used to build a learning algorithm. Therefore, the articles are normalized by the methods of natural language processing and converted into a matrix based on the occurrences of the individual words in the news. This matrix then serves as an endogenous variable that predicts the likely direction of market impact. In order to make that statement, the actual impact on the markets was used as an exogenous variable to train different classification algorithms. To find the best algorithm to train, we created a confusion matrix. In the end, the best algorithm gets selected to perform the prediction and as a result, our trained algorithm achieved high accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
E.F. Fama, Eficient capital markets: a review of theory and empirical work. J. Financ. 25, 383–417 (1970)
J.B. DeLong, A. Shleifer, L.H. Summers, R.J. Waldmann, Noise trader risk in financial markets. J. Polit. Econ. 98, 703–738 (1990)
U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, From data mining to knowledge discovery in databases. AI Mag. 17(3), 37 (1996)
F. Li, Do stock market investors understand the risk sentiment of corporate annual reports? (2006) 54. Working Paper
R.P. Schumaker, H. Chen, Textual analysis of stock market prediction using breaking financial news: the AZFin text system. ACM Trans. Inf. Syst. 27, 12:1–12:19 (2009)
J. Bollen, A. Pepe, H. Mao, Twitter mood predicts the stock market. J. Comput. Sci. 2, 1–8 (2011)
A. Klein, O. Altuntas, T. Hausser, W. Kessler, Extracting investor sentiment from weblog texts: a knowledge-based approach, in IEEE (2011), pp. 1–9
R.P. Schumaker, Y.L. Zhang, C.N. Huang, H. Chen, Evaluating sentiment in financial news articles. Decis. Support. Syst. 53, 458–464 (2012)
R.P. Schumaker, N. Maida, Analysis of stock price movement following financial news article release. Commun. IIMA 16(1) (2018). Article 1
A.S. Abrahams, J. Jiao, G.A. Wang, W.G. Fan, Vehicle defect discovery from social media. Decis. Support Syst. 54, 87–97 (2012)
V. Lavrenko, M. Schmill, D. Lawrie, P. Ogilvie, D. Jensen, J. Allan, Language models for financial news recommendation, in Proceedings of the 9th International Conference on Information and Knowledge Management (CIKM) (2000), pp. 389–396
T. Loughran, B. McDonald, When is a liability is not a liability? Textual analysis, dictionaries, and 10-ks. J. Financ. 66, 35–65 (2012)
M.A. Mittermayer, G.F. Knolmayer, Newscats: a news categorization and trading system, in Proceedings of the 6th International Conference on Data Mining (ICDM) (IEEE), pp. 1002–1007
R.P. Schumaker, H. Chen, Evaluating a news-aware quantitative trader: the effect of momentum and contrarian stock selection strategies. J. Am. Soc. Inform. Sci. Technol. 59, 247–255 (2008)
S. Bird, E. Klein, E. Loper, Natural language processing with python: analyzing text with the natural language toolkit (O’Reilly Media, Inc., 2009)
R. Collobert, J. Weston, L. Bottou, M. Karlen, K. Kavukcuoglu, P. Kuksa, Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011)
S. Dumais, J. Platt, D. Heckerman, M. Sahami, Inductive learning algorithms and representations for text categorization (1998)
C. Manning, M. Surdeanu, J. Bauer, J. Finkel, S. Bethard, D. McClosky, The Stanford CoreNLP natural language processing toolkit, in Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, June 2014 (2014), pp. 55–60
J. Shriwas, S. D. Sharma, Stock price prediction using hybrid approach of rule based algorithm and financial news. Int. J. Comput. Technol. Appl. 5(1), 205–211
S. Karthik, K.K. Sureshkumar, Analysis of stock market trend using integrated clustering and weighted rule mining technique. Int. J. Comput. Sci. Manag. Res, 1(5) (2012). ISSN 2278–733X
S. Evans, Data mining in financial markets (2011)
A. Rajput, S.P. Saxena, R. Prasad Aharwal, R. Soni, Rule based classification of BSE stock data with data mining. Int. J. Inf. Sci. Appl. 4(1) (2012). ISSN 0974-2255
S.Prasanna, D. Ezhilmaran, An analysis on stock market prediction using data mining techniques. Int. J. Comput. Sci. Eng. Technol. (IJCSET)
M.V. Pinto, K. Asnani, Stock price prediction using quotes and financial news. Int. J. Comput. Sci. Eng. Technol. (IJCSET) (2011)
N. Sharma, H. Om, Significant patterns for oral cancer detection: association rule on clinical examination and history data. Netw. Model. Anal. Health Inf. Bioinform. 03 (2014). Springer, Wien. Article 50, December 2013
N. Sharma, H. Om, Early detection and prevention of oral cancer: association rule mining on investigations. WSEAS Trans. Comput. 13, 1–8 (2014)
N. Sharma, H. Om, Correlation neural network model for classification of oral cancer. WSEAS Trans. Biol. Biomed. 11, 45–51 (2014)
N. Sharma, H. Om, Using MLP and SVM for predicting survival rate of oral cancer patients. Netw. Model. Anal. Health Inf. Bioinform. 3 (2014). Springer, Wien. Article 58
N. Sharma, H. Om, Usage of probabilistic and general regression neural network for early detection-prevention of oral cancer. Sci. World J. (Hindawi Publishing Corporation) 2015, 1–11 (2015). 234191
N. Sharma, H. Om, GMDH polynomial and RBF neural network for oral cancer classification. Netw. Model. Anal. Health Inf. Bioinform. 04(1) (2015). Springer, Wien
N. Sharma, H. Om, Hybrid framework using data mining techniques for early detection and prevention of oral cancer. Int. J. Adv. Intell. Paradig. Indersci. 09(5/6), 604–622 (2017)
S. Dumais, J. Platt, D. Hecherman, S. Sahami, Inductive learning algorithms and representations for text categorization, in Proceedings of the Seventh International Conference on Information and Knowledge Management 148–155. https://doi.org/10.1145/288627.288651
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Eck, M., Germani, J., Sharma, N., Seitz, J., Ramdasi, P.P. (2021). Prediction of Stock Market Performance Based on Financial News Articles and Their Classification. In: Sharma, N., Chakrabarti, A., Balas, V.E., Martinovic, J. (eds) Data Management, Analytics and Innovation. Advances in Intelligent Systems and Computing, vol 1175. Springer, Singapore. https://doi.org/10.1007/978-981-15-5619-7_3
Download citation
DOI: https://doi.org/10.1007/978-981-15-5619-7_3
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-5618-0
Online ISBN: 978-981-15-5619-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)