Deep Learning for Stock Market Prediction Using Sentiment and Technical Analysis

Chatziloizos, Georgios-Markos; Gunopulos, Dimitrios; Konstantinou, Konstantinos

doi:10.1007/s42979-024-02651-5

Deep Learning for Stock Market Prediction Using Sentiment and Technical Analysis

Original Research
Published: 18 April 2024

Volume 5, article number 446, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

SN Computer Science Aims and scope Submit manuscript

Deep Learning for Stock Market Prediction Using Sentiment and Technical Analysis

Download PDF

Georgios-Markos Chatziloizos¹,
Dimitrios Gunopulos² &
Konstantinos Konstantinou²

280 Accesses
1 Citation
Explore all metrics

Abstract

Machine learning and deep learning techniques are applied by researchers with a background in both economics and computer science, to predict stock prices and trends. These techniques are particularly attractive as an alternative to existing models and methodologies because of their ability to extract abstract features from data. Most existing research approaches are based on using either numerical/economical data or textual/sentimental data. In this article, we use cutting-edge deep learning/machine learning approaches on both numerical/economical data and textual/sentimental data in order not only to predict stock market prices and trends based on combined data but also to understand how a stock's Technical Analysis can be strengthened by using Sentiment Analysis. Using the four tickers AAPL, GOOG, NVDA and S&P 500 Information Technology, we collected historical financial data and historical textual data and we used each type of data individually and in unison, to display in which case the results were more accurate and more profitable. We describe in detail how we analyzed each type of data, and how we used it to come up with our results.

Predicting Stock Market Prices Using Sentiment Analysis of News Articles

Predicting the Brazilian Stock Market with Sentiment Analysis, Technical Indicators and Stock Prices: A Deep Learning Approach

Article 01 June 2024

Stock Price Prediction Using Sentiment Analysis on Financial News

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Forecasting the prices and the trends of the stock market is one of the most challenging and competitive domains for scientists and financial experts. Many people have lost their savings trying to time the market and make a fortune off of it. The most dominant advice given by financial advisors and the most traditional investors is to just invest a part of your income into the S&P 500 and just wait, in any other case you will probably lose your money. Also, according to the Efficient Market Hypothesis (EMH), every stock price is trading at a fair value, and an investor cannot continuously beat the market. On the other hand, many investors, hedge funds, and scientists, often deny EMH since they can predict stock market trends using fundamental, technical, and sentiment analysis.

Predicting the price of the stock market is a multivariate equation that many researchers and financial experts have tried to find its components. At the beginning of 2021, we observed a massive movement by private investors against hedge funds and their private dinner parties. What they did was to buy the stock shares of companies that hedge funds shorted and thus leading them to a short squeeze and resulting in heavy damages for very famous and large funds. This massive movement was organized mainly through Reddit and Twitter. As we can imagine with the help of real-time crawlers at Twitter and Reddit, data can be obtained and then fed into a fast and highly effective real-time NLP system. Therefore, this could be potentially beneficial and profitable in the stock market prediction.

So, to understand the stock market, researchers need to retrieve two different types of information, hard and soft. The closing price of a stock, the revenue and the number of sales made by a company are considered hard data as they can be represented in numeric values. On the other hand, news, articles and tweets are considered soft data, as these are more abstract information that can also be represented numerically, but there is a loss of information. Because of their intrinsic differences, these two types of data are relevant to different tasks as they can produce better outcomes on different issues than each other. In [1], it is detailed in depth what each type of information is and how they differ in the banking and financial world.

While many people have studied the field of forecasting the prices and the trend of the stock market, most have tried to apply only one of the two types of data, hard [2,3,4,5,6,7] or soft [11, 12]. When they choose the latter, they mostly raise the question of whether a correlation between the sentiment of textual data and the stock market exists without applying it in an actual decision-making tool. The amount of noise in this kind of data and the difficulty to implement it into a viable prediction model for the stock market, have deterred researchers from fully trying to realize the potential of textual analysis in making such a tool. While this stands true, in recent years we have seen many occasions where the collective of internet users change, with their synchronous movement, the course of a stock price. As more and more people join these collectives and online communities in their effort to gain an advantage in their decision-making in regard to finance and the stock market, this analysis of sentiment can become more powerful and accurate. Ideas, personal opinions and inside information, all make their appearance in internet posts thus possibly providing an insight into how the stock trend is going to fluctuate. All of these kinds of posts portray the feelings of people towards companies and their stock tickers, and using sentiment analysis, it is possible to assess how positive or negative these feelings are. What this means is that the correlation of internet corpora of text and the stock trends becomes more apparent, and these decision-making tools based on sentiment analysis can actually become more efficient and trustworthy. Sentiment analysis analyzes people's feelings and moods toward an entity, such as a stock ticker, using textual data to determine how negative or positive their thoughts are [9, 10].

As technology evolved and machine and deep learning algorithms became more sophisticated and successful hard data have been used to predict stock market prices or trends [2,3,4,5,6,7, 18,19,20]. The reader is also referred to [8] for a recent survey by Ferreira et al. Also, there are several papers in which authors implemented textual data for predicting stock market movements. The authors of [11] explore the link between stock market movements and twitter sentiments using sentiment analysis and supervised machine learning approaches. The authors of [12] used sentiment data and stock price market data to create an SVM model for predicting stock movements the next day. The approach proposed in [13] initially labels a stock market-related tweet dataset, then compares various deep learning models, and ultimately introduces an LSTM model that outperforms all other models.

Taking all of this into account, in this paper we propose our methods to add to the direction of analyzing and implementing machine/deep learning techniques to correctly anticipate stock prices and trends using both numerical/economical data and textual/sentimental data. We use deep learning/machine learning approaches on both types of data with the purpose of not only predicting stock market trends but also understanding how a stock's Technical Analysis may be strengthened by using Sentiment Analysis. This article builds upon our previous work presented in [14]. In [14] we have employed three deep/machine learning methods [15] i.e., Long Short-Term Memory (LSTM), k-nearest neighbors (KNN) and Decision Trees. In this paper, we used three more deep/machine learning approaches that are, Convolutional Neural Networks (CNN), Support Vector Classification(SVC) and Multilayer Perceptron (MLP) and we applied them to the following three different sets of historical data (a) numerical/economical data such as stock closing prices, technical analysis indicators, labels, etc. (b) sentimental data e.g. scores computed using lexical methodologies on textual data collected from Twitter and labels (c) combined data that include all the above data in sets (a) and (b).

In our tests, we used data from four stock tickers: AAPL, GOOG, NVDA, and S&P 500 Information Technology. The data consists of numerical/economic data collected over a 20-year period and textual data (about 29,000 tweets for each of the above tickers) collected over an 8-year period. Out of all six algorithms we compared the two best ones i.e. LSTM and CNN, also on extended new datasets. Specifically, the “financial_phrasebank” dataset [21] was additionally used which includes 5000 labeled sentences from financial news articles about Finnish Banks.

The results demonstrate that the extended datasets improve our profits in most cases and the most profits for the extended datasets came from the CNN on Numerical data. However, the most profits among all datasets and methods came from the LSTM method on Numerical data from the original dataset, which was presented in [14]. Sentiment analysis also proved to have future promise, as it was profitable and in most cases a better option than a passive investment. Sentiment analysis appears to produce better findings when additional high-quality data is included, such as news titles and articles, and the number of tweets collected is increased.

This paper is structured as follows: "Sentiment Analysis" describes the soft information we used, as well as how we analyzed and applied it. The technical analysis indicators used are shown in "Technical Analysis". "Application" describes the data used in the deep/machine learning methods, as well as the application's remaining parameters. "Results" summarizes the findings, and "Conclusions" presents the conclusions.

Sentiment Analysis

In the last few years, the stock market has been greatly influenced by the power of words and mass transactions that are stimulated and coordinated by social media users. Naturally, many researchers have begun trying to understand the sentiment behind such users and their posts, so that the patterns can be identified, to make it possible to predict a stock trend. It is evident that not only well-thought and well-written news articles by huge news networks are influencing this market, but a simple internet post with rashly put together text can have the same impact especially if the latter comes in huge quantities. To emphasize this, the now historic stock market incident, which can simply be referred to as the “GameStop short squeeze”, was a landmark display of how internet users can collectively change the course of a stock.

As mentioned above, there have been several studies on the sentiment analysis of internet posts in order to make future predictions on stock trends or price. In our work these internet posts are referring to as Twitter posts, or as they will be named from this point on, “tweets”. Twitter has been a powerful ally to researchers and a trustworthy prediction tool. Ussama et al. [16] researched the power of predictability that Twitter possesses in a very important and serious matter which is the US 2016 elections. Pagolu et al. [11] and Rao and Srivastava [17] tried to prove that there is a correlation between the stock market and the sentiment in tweets and to further analyze this relationship. Researchers have proven again and again how Twitter, and social media platforms like it, Reddit, 4chan and so on, can play a huge role in shaping or predicting trends.

Given all of the above, it is safe to assume that there can be a significant percentage of accurate predictions that use textual data to track a stock’s trends. While in our work we used both numerical and textual data, in the rest of this Section we will explain how we used the latter.

Data Collection

In our previous work [14], we collected approximately 29.000 English tweets for each of the three different tech giants, Google, Nvidia and Apple, using their respective stock ticker GOOG, NVDA and AAPL. We used their tweets to make predictions for each respective corporation and for the S&P Information Technology Sector as well. The tweets ranged from the 1st of January 2012 to the 31st of December, 2019. For each ticker, we collected 10 tweets for each day and therefore, we have gathered about 29,000 tweets. These same tweets from our previous work were used again so that there is a valid benchmark to make the necessary comparisons with the new findings of our work.

To expand our research, we looked for a labeled dataset that could provide the right amount of data well focused on the topic of stock market and finance. The “financial_phrasebank” dataset which was developed and used in the work of Malo et al. [21] met the needs of our research. This dataset included 5000 labeled sentences from financial news articles about Finnish Banks. The sentiment behind each sentence was identified by people with sufficient knowledge of the financial world. These sentences were appropriate for our sentiment analysis research, as they included news on corporate finances as well as news unrelated to corporate internal affairs, focusing on external sentiments and assessments. Using this data we were able to expand and strengthen our research, especially when it was used in combination with the datasets mentioned above.

With these two types of data, the unlabeled and the labeled, we were able to move on. In the following, we discuss how all this data was used and in what ways.

Stemming and Cleaning

To use each of the datasets, both the unlabeled and the labeled one, we implemented a word removal process followed by a stemming process. Unnecessary and redundant words and characters, such as stopwords and punctuation, were eliminated throughout the removal process. Links and user mentions were also removed because they were no longer useful for our current purpose. Only the words that could be valuable to us remained in place. After that, the stemming phase can start.

Stemming refers to the process of disassembling a word, so that it can be returned to its original form. For example, word derivatives such as the word “weakening” and “weakness” can be represented after a stemming process as “weak”. This is done so that the number of different words can be limited to boost computational times, which can be crucial at a later stage of the research. It is also used to be able to match certain words, with a word in a lexicon or a vector representation model, so that we can have a better overall analysis of the sentiment of the data. The sentiment of each word and combination of words is mostly unaffected by this process.

Therefore, after cleaning each tweet, we tokenized the words and stemmed them, using the Snowball Stemmer. With this stemmer, we stripped each word from suffixes and we kept it at its basic form. The order of the words was not changed, so the word combinations that could substantially modify the meaning and sentiment of a sentence, were not changed.

Evaluation

After the sentences have been cleaned and filtered, the evaluation process can begin. The evaluation process involves labeling (categorizing) the tweets of the unlabeled dataset as either positive, negative, or neutral based on their sentiment. To categorize the data, we implemented two methods: the Lexicon and the Machine Learning methods. The Lexicon method involves using three pre-defined sentiment lexicons to assign a sentiment score to each tweet. For this method, we get a score for each lexicon that we later use for the stock prediction. However, the Machine Learning method involves training machine learning models on the labeled dataset to predict the sentiment of new tweets of the unlabeled dataset. For this method, we use the accuracy as a metric for the evaluation.

Lexical Methodology

The Lexicon method or the lexical methodology uses a lexicon to identify and evaluate with a numerical score each word of a sentence. These scores can then be aggregated to demonstrate the overall score of the sentence, and hence the sentiment behind a tweet. How or how well the score reflects the sentiment depends to a large extent on the quality of the lexicon. We used this technique for three different lexicons: the VADER, the Loughran-McDonald and a generic lexicon.

The VADER Lexicon [22] is a lexicon that is mainly used for social media analysis, as it contains words and their respective sentiment score, focused around social media posts.
The Loughran-McDonald Lexicon [23] although limited in the number of words it contains, the reason for its development, that is the understanding of words around finance, as well as its ability to use words in unison, make it a very powerful lexicon for our research.
A Generic Lexicon is a simple lexicon that does not use a certain viewpoint (finance, social media, etc.) to determine the sentiment score of a word. The number of words in such a lexicon could fill in the gaps when the other two could not provide a score.

Using these three lexicons, we created a score for each tweet. Then we calculated the average score of the 10 tweets of each day for each respective lexicon. This means that for each day we had 3 average scores, produced by the three lexicons.

Machine Learning Method

For the Machine Learning Method, the labeled dataset is used to train models so that they can learn patterns between the features (that we introduce in the following paragraph) in the tweets and their corresponding sentiment labels (positive, negative, or neutral). Once the models have been trained on the labeled dataset, they can be used to predict the sentiment of new tweets from the unlabeled dataset. This is accomplished by feeding the new-unlabeled tweets into the model and letting it make predictions based on what it has learned from the labeled data.

Creating vector representation models We used vectors to represent each sentence from the labeled dataset and each tweet from the unlabeled datasets. These vectors consist of different numbers based on the words of the tweets and sentences, which are used so that patterns can be identified to make predictions. Through the stemming process described above, we prepared the “financial_phrasebank” dataset to be used in vector representation models to train our models. We used two of the most basic such vector representation techniques: the Bag of Words (BOW), and the Term Frequency–Inverse Document Frequency (TF-IDF).

The BOW model is a very common vector representation model used in Machine Learning. When we enter a dataset, it keeps every unique word encountered in it, and the number of occurrences of the word in the entire dataset. Because of its simplicity, it is used as a benchmark vector representation model.
The TF-IDF model is also a common vector representation model used in Machine Learning. This model uses the frequency of a word in a given dataset and the frequency of the same word in a given sentence to produce a weight. This weight basically describes the rarity and therefore the significance of the word, making sentences with such rare words leaning more to those words’ sentiment.

We gave as input to these models the clean and filtered “financial_phrasebank” dataset. When they were finalized, we split them into training and testing vectors, 80% of the dataset as training and 20% as testing, to be used later by our Machine Learning Algorithms.

Training the models We used the BOW and TF-IDF models to train five different Machine Learning Algorithms so that we can later use them in labeling the tweets. These algorithms were:

Naive Bayes The Naive Bayes classifiers are a collection of supervised learning algorithms based on the Bayes’ theorem. The classifier we used, is a very commonly used classifier applied on many Machine Learning projects and researches, as well as on real-world problems, because of its efficiency relative to its simplicity.
Decision Trees Decision Trees (DT) is a non-parametric supervised learning method. This model creates nodes and splits into new ones depending on the features and their different variations derived from the training data given as input. These nodes create a structure resembling a tree, hence the name.
K-Nearest Neighbors K-Nearest Neighbors (KNN) is a non-parametric supervised learning method. The KNN model uses “k” number of points from the training dataset to make an assumption about a new data point taken from the testing dataset. These k points are chosen based on how close they are locally to the data point in question, in an n-dimensional space made from the n features, so that we can identify to which “neighborhood” of data this new instance belongs, or to put it differently, to which class.
SVC Support Vector Classification or SVC is a supervised learning method that splits the data depending on their features into two classes, therefore solving a binary problem. While it is used for binary classification, if this process is used multiple times to solve sub-classification problems within the dataset, it can produce a multi-classification result.
MLP Multilayer Perceptron or MLP is a deep learning method. MLP is an Artificial Neural Network (ANN), which means that it creates different layers within it, using the training input.

For each algorithm, two different inputs were given, the BOW and the TF-IDF, with a total of ten different models.

Testing the models When the training process was finished, the testing input was fed to the ten models. Here we can get the idea of how well the models could produce results, and act accordingly (Table 1). The accuracy for each model was:

Table 1 The accuracy of the models

Deep Learning for Stock Market Prediction Using Sentiment and Technical Analysis

Abstract

Similar content being viewed by others

Predicting Stock Market Prices Using Sentiment Analysis of News Articles

Predicting the Brazilian Stock Market with Sentiment Analysis, Technical Indicators and Stock Prices: A Deep Learning Approach

Stock Price Prediction Using Sentiment Analysis on Financial News

Explore related subjects

Introduction

Sentiment Analysis

Data Collection

Stemming and Cleaning

Evaluation

Lexical Methodology

Machine Learning Method

The Consumer Sentiment Index

Final Products

Technical Analysis

Application

Data and Features

Creating the Labels and Scaling

The Datasets

Training/Testing Datasets

Sequential Data

Machine and Deep Learning Models

Strategy

Buy and Hold Strategy

Results

Statistics

Comparing Passive Investor’s and LSTM Method’s Returns with the Original Dataset

Improvements in Profit with the Extended datasets

LSTM vs CNN with the Extended Datasets

Conclusions

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation