1 Introduction

In the present era, the study of financial markets have extensively emerged as a cornerstone for economic growth and prediction. The dynamics of financial markets are uncertain and stochastic in nature due to insufficient information flow among the market participants. The advancements in data acquisition equipment and computational techniques facilitate us to understand the intrinsic dynamics of stock price which help for a better prediction and accurate pricing. AraúJo and Ferreira (2013) proposed a morphological-rank-linear evolutionary technique for stock market prediction which does not follow the random walk principle. Efendi et al. (2018) have presented a new approach for stock market forecasting with the help of random auto-regression time series model. Several authors (Nayak et al. 2020; Verousis et al. 2018; Eholzer and Roth 2017) have applied different techniques to characterize the well-known stylized facts associated with the high-frequency stock price data sets. It has been confirmed from the high-frequency trading (Hasbrouck 2018; Seddon and Currie 2017), that the empirical data of stock return exhibits the power-law (Riyal et al. 2016; Singh and Karmeshu 2014) which creates an interest among various researchers to define a precise model to capture the dynamics of the real data. In this perspective, Gerig et al. (2009) proposed a mixture model for the return distribution to observe the tail behavior through Student’s t-distribution which is associated positive degrees of freedom (df). However, the discrete values of ‘df’ cannot describe the tail fluctuation of high-frequency data. Thus, it becomes necessary to apply the non-extensive concept (Singh and Karmeshu 2014; Singh et al. 2015; Tavayef et al. 2018) to observe the tail behavior of the stock return. As a result, Borland (2002) and Borland and Bouchaud (2004) have used the non-extensive Tsallis’ framework in their corresponding stochastic differential equation (SDE) to capture the long range memory effets. Stanley (2003) and Buchanan (2012, (2013) have noticed that all types of stock market exhibit power-law in the tail region. Pan and Sinha (2008) examined the same for Indian market, whereas, multifractal detrended fluctuation analysis (MF-DFA) has been performed by Kumar and Deo (2009) to analyze Indian market. Jaynes (1957) and Tsallis (2004) have proposed two different entropy concepts where the maximization of Shannon entropy unable to describe tail fluctuation of stock returns. However, maximization of entropy with parameter q, successfully describes the the tail behavior of underlying stock returns. It is significant to observe the convergence of Tsallis entropy to Shannon entropy as q approaches to 1 (Mukherjee et al. 2019; Bebortta et al. 2020). Thus, Senapati and Karmeshu (2016) have proposed the q-Gaussian distribution subject to entropy optimization for intra-day stock returns (IDR) in high-frequency trading. They have also validated their proposed distribution with six intra-day high-frequency stock return data sets.

As the proposed return distribution captures the tail fluctuation, it is worthwhile to calculate the CCDF for the proposed return distribution. Hence in this paper, we present a rationale mathematical expression for the CCDF in terms of Hypergeometric2F1 function to characterize tail fluctuation of underlying returns through different values of the non-extensive parameter ‘q’. We have estimated the parameter‘q’ from empirical data by minimizing the generalized symmetric Jensen Shannon (JS) measure between the return distributions and empirical histograms of six different high-frequency data. For the sake of convenience, this article also presents a precise algorithm for the estimation of parameter parameter‘q’ . Since option price relies on the underlying stock price, we have derived the underlying stock distribution in terms of q-lognormal distribution. For ‘q’ equals to one, this stock distribution also converges to the well-known log-normal distribution. Consequently, we have successfully computed the corresponding non-Gaussian option price. It is interesting to observe, our option pricing model also coincides with the famous Black–Scholes option pricing model (Black and Scholes 1973) as q tends to 1.

The remaining of this paper is organized as follows: Sect. 2, describes the computation of non-Gaussian q distribution using the maximum Tsallis’ entropy framework. Section 3, provides a brief description for the computation of Kullback–Leibler (KL) and JS measure. The next section, deals with the procedure regarding the estimation of ‘q’ parameter for the Powershares QQQ of NASDAQ 100 index high-frequency data set. Section 5 examines the tail fluctuation in five different stocks viz., SPDR S&P 500 ETF, General Motors, Ever-source Energy, Coca-Cola and NQ Mobile Inc., listed in various stock indexes. We provide the stock price distribution and the option pricing model in Sect. 6. Eventually, we conclude this paper in Sect. 7.

2 Entropy Frame Work

2.1 Intra-day Stock Return Distribution

The intra-day stock return is defined as

$$\begin{aligned} R(T) = \log \left( {\frac{{{S_T}}}{{{S_0}}}} \right) . \end{aligned}$$
(1)

The non-extensive Tsallis (Tsallis 2004; Senapati and Karmeshu 2016) entropy for the above stock return is expressed as

$$\begin{aligned} {S_q}\left( {R,T} \right) = \frac{1}{{q - 1}} - \frac{{\int _{ - \infty }^\infty {{{\left\{ {{g_q}(R,T)} \right\} }^q}dR} }}{{q - 1}} \end{aligned}$$
(2)

The first and second moment constraints along with the normalization constraints are given below:

$$\begin{aligned}&\int _{-\infty }^{+\infty } g_{q}(R,T) dR = 1, \int _{ - \infty }^{ + \infty } {R{{{\tilde{g}}}_{es}}} (R,T)dR = \mu T, \,\,\hbox {and } \\&\quad \int _{ - \infty }^{ + \infty } {{{(R - \mu T)}^2}{{{\tilde{g}}}_{es}}} (R,T)dR = {\sigma ^2}T, \end{aligned}$$
(3)

where \({{{\tilde{g}}}_{es}}\left( {R,T} \right) = \frac{{{{\left\{ {{g_q}(R,T)} \right\} }^q}}}{{\int {{{\left\{ {{g_q}(R,T)} \right\} }^q}dR} }}\) defined as the escort probability distribution (Gradojevic and Genay 2011; Namakia et al. 2013; Abe and Bagci 2005). Abe and Bagci (2005) described the difference between the uses of the general probability distribution and the escort distribution. This papers also derives the relation between ordinary expectation and q-expectation for randomly varying quantity. The existence of the power-law distribution under the non-extensive statistical phenomena is well supported by the q-expectation which is validated through their paper. The Lagrangian is defined as

$$\begin{aligned} L\left( {{g_q},R} \right)& = \frac{{1 - \int \limits _{ - \infty }^\infty {{{\left[ {{g_q}\left( {R,T} \right) } \right] }^q}dR} }}{{q - 1}} + {\lambda _1}\left( {1 - \int _{ - \infty }^{ + \infty } {{g_q}} (R,T)dR} \right) \\&\quad + {\lambda _2}\left( {\mu T - \int _{ - \infty }^{ + \infty } {R{{{\tilde{g}}}_{es}}} (R,T)dR} \right) + {\lambda _3}\left( {{\sigma ^2}T - \int _{ - \infty }^{ + \infty } {{{{\tilde{g}}}_{es}}} (R,T){{(R - \mu T)}^2}dR} \right) , \end{aligned}$$
(4)

with \(\lambda _1\), \(\lambda _2\) and \(\lambda _3\) being the Lagrangian parameters which are obtained from the above constraints. The corresponding Euler–Lagrange equation is given as

$$\begin{aligned} \frac{{\partial L}}{{\partial {g_{q}(R,T)}}} - \frac{d}{{dR}}\left( {\frac{{\partial L}}{{\partial g_{q}'(R,T)}}} \right) = 0, \end{aligned}$$
(5)

Thus solving the Eq. (5), the maximum entropy probability distribution for IDR can be obtained :

$$\begin{aligned} {g_q}(R,T) = \frac{1}{Z}{\left\{ {1 - \frac{{1 - q}}{{(3 - q){\sigma ^2}T}}{{(R - \mu T)}^2}} \right\} ^{\frac{1}{{1 - q}}}},\quad - \infty< R < \infty , \end{aligned}$$
(6)

where the Z represents normalization constant.

$$\begin{aligned} Z = \sigma \sqrt{T} \sqrt{\frac{(3 -q)\pi }{q -1}}\frac{\varGamma {(\frac{3 -q}{2q -2}})}{\varGamma {(\frac{1}{q -1})}},\quad 1< q < 3. \end{aligned}$$
(7)

It is quite significant to observe that, as \(q\rightarrow 1\) the \(g_q(R,T)\) coincide with Gaussian distribution. From the Eq. (6), we can also obtain the CCDF as,

$$\begin{aligned} G_q(|R|> \xi )& = \int _\xi ^{\infty } g_{q}(R,T)dR=\frac{{2\sigma \sqrt{T} }}{{\varGamma (\alpha )Z}}{\beta ^{ - \gamma }}{\gamma ^{\frac{1}{{1 - q}}}}\varGamma \left( { - \frac{\gamma }{2}} \right) \\&\quad \times \,2F1\left[ { - \frac{\gamma }{2},\,\frac{1}{{q - 1}},\,\,\alpha ,\, - \frac{\gamma }{{{\beta ^2}}}} \right] , \end{aligned}$$
(8)

For \(R> > 0\), the asymptotic nature of CCDF is denoted as

$$\begin{aligned} {G_q}(\left| R \right| > \xi )& = \frac{{2\sigma \sqrt{T} }}{{\varGamma (\alpha )Z}}{\beta ^{ - \gamma }}{\gamma ^{\frac{1}{{1 - q}}}}\varGamma \left( { - \frac{\gamma }{2}} \right) \\&\quad \times \, 2F1\left[ { - \frac{\gamma }{2},\,\frac{1}{{q - 1}},\,\,\alpha ,\, - \frac{\gamma }{{{\beta ^2}}}} \right] \sim \xi ^{-\frac{{3 - q}}{{q-1}}}, \quad {\mathrm{for}} \quad 1< q < 3, \end{aligned}$$
(9)

where \(\alpha = {\frac{{q + 1}}{{2\left( {q - 1} \right) }}}\),   \(\gamma = \frac{{3 - q}}{{q - 1}}\),  2F1 is the Hypergeometric2F1 function, and \(\varGamma (.)\) denotes gamma function.

It is also observed that Eq. (9), exhibits power-law behavior which are in excellent agreement with the empirical data sets (Stanley 2003; Pan and Sinha 2008). It has been interesting to notice that the t-distribution is also capable of describing the power-law behavior of the high-frequency intra-day returns. Therefore in the next subsection, we provide some basic properties of the t-distribution in order to examine the relation between the t-distribution and the q-Gaussian distribution.

2.2 Student’s t-Distribution

The probability distribution function of Student’s-t (Brach and Dunn 2004) is given by,

$$\begin{aligned} {\phi _a }(R) = \frac{1}{{\sqrt{\pi a } }}\frac{{\varGamma \left( {\frac{{1 + a }}{2}} \right) }}{{\varGamma \left( {\frac{a }{2}} \right) }}{\left( {1 + \frac{{{{R }^2}}}{a }} \right) ^{ - \frac{{1 + a }}{2}}},\quad -\infty< R < \infty , \end{aligned}$$
(10)

where ‘a’ represents the degrees of freedom which can takes only non-negative integral values. This Eq. (10), is also used to compute the corresponding CCDF and for the higher values of return, we obtain the asymptotic nature of CCDF as

$$\begin{aligned} {\varPhi _a }(|R|> \xi ) = \int \limits _\xi ^{ \infty } {{\phi _a }(R)dR} \sim {\xi ^{ - a }} , \quad a>0. \end{aligned}$$
(11)

It is quite obvious from Eqs. (9) and (11), to have a relation between the exponent of the q-Gaussian distribution with the degrees of freedom t-distribution. The relation is as follows

$$\begin{aligned} a = \frac{{3 - q}}{{q - 1}}. \end{aligned}$$
(12)

3 Computation of q Parameter

In virtue of q-Gaussian distribution, there is a growing interest among various researchers to examine different low-frequency financial time series such as monthly data, weekly data and the daily data (Gradojevic and Genay 2011; Namakia et al. 2013). For the calculation of parameter ‘q’, Gradojevic and Genay (2011) have first computed the sum of the squares of errors between observed returns distribution with the proposed q-Gaussian distribution and then minimized the error. As a result, the estimated values of the non-extensive parameter ‘q’ found to be 1.51, 1.54 and 1.60 of weekly, monthly and daily respectively for TSE, KS11, SSE, DIJA30, NASDAQ100 and S&P 500 respectively. Similarly, Namakia et al. (2013) have found the values of q corresponding to monthly, weekly and daily empirical data sets as 1.63, 1.64 and 1.62 respectively. However, we have provided the estimation procedure for ‘q’ for different high-frequency stock returns.

3.1 Evaluation of JS Measure

Nishii (1988), has described the estimation technique for parameter ‘q’ when the true distribution is unknown. Generally this type of model appears when data available to a family containing the exact distribution is not sufficient. Following this method, here we compare the proposed q-Gaussian distribution with the histogram of empirical intra-day data sets of stock returns. In this situation, Kullback–Leibler (KL) measure plays an important role where we calculate the distance between two distributions. This paper calculates KL measure between our proposed q-distribution and the observed empirical distribution. According to Naudts (2011), the non-extensive generalized KL measure between the empirical distribution f(RT) and the proposed distribution \(g_q(R,T)\) is defined as,

$$\begin{aligned} \begin{aligned} KL\left\{ {{g_q}(R,T)||f(R,T)} \right\}&= \int {\left[ {\frac{{{{\left\{ {{g_q}(R,T)} \right\} }^{q - 1}}}}{{q - 1}} - \frac{1}{{q - 1}}{{\left\{ {\frac{{f(R,T)}}{{{g_q}(R,T)}}} \right\} }^{q - 1}}} \right] } \,dR \, \\&\quad - \int {\left\{ {\frac{{{g_q}(R,T) - f(R,T)}}{{f(R,T)}}} \right\} } {\left\{ {f(R,T)} \right\} ^q}\,dR. \end{aligned} \end{aligned}$$
(13)

The intra-day histogram of return data is treated as the discretized form of the empirical data. Thus discretizing Eq. (13), we have

$$\begin{aligned} \begin{aligned} KL\left\{ {{g_q}(R,T)||f(R,T)} \right\}&= \sum \limits _R {\left[ {\frac{{{{\left\{ {{g_q}(R,T)} \right\} }^{q - 1}}}}{{q - 1}} - \frac{1}{{q - 1}}{{\left\{ {\frac{{f(R,T)}}{{{g_q}(R,T)}}} \right\} }^{q - 1}}} \right] } \,\varDelta R \, \\&\quad - \sum \limits _R {\left\{ {\frac{{{g_q}(R,T) - f(R,T)}}{{f(R,T)}}} \right\} } {\left\{ {f(R,T)} \right\} ^q}\,\varDelta R. \end{aligned} \end{aligned}$$
(14)

Thus to overcome the asymmetric nature of KL measure, we use symmetric JS measure (Fuglede and Topsøe 2004) which is expressed as:

$$\begin{aligned} JS\left\{ {{g_q}(R,T)||f(R,T)} \right\}& = \frac{1}{2}\left[ KL\left\{ {{g_q}(R,T)||{{\left( {{g_q}(R,T) + f(R,T)} \right) } / 2}} \right\} \right. \\&\quad \left. + \,KL\left\{ {f(R,T)||{{\left( {{g_q}(R,T) + f(R,T)} \right) } / 2}} \right\} \right] . \end{aligned}$$
(15)

We compute the value of parameter ‘q’ by minimizing the JS divergence. The estimated value of non-extensive parameter q provides a close agreement along with the empirical returns. The following formula is used to get the precise value of the parameter ‘q’ from the empirical stock-return distribution.

$$\begin{aligned} \mathop D\nolimits _{QQQ}^{(q)} \left( {{g_q}||f} \right) = \mathop {\arg \min }\limits _{q \in (1,3)} \left\{ {JS\left( {{g_q}(R,T)||f(R,T)} \right) } \right\} . \end{aligned}$$
(16)

4 Computation of q for NASDAQ Data

The above discussed method is used to determine the value of parameter q for minute wise IDR data of Powershares QQQ of NASDAQ 100 (http://www.futurestickdata.com). For the ease of computation, 30 min of each trading data has been removed from 1.12.2004 to 31.12.2008 (Gerig et al. 2009; Liu et al. 1999). Trend behavior of minute wise data of Powershares QQQ of NASDAQ has been shown in Fig. 1a. From the given data sets, we have calculated the normalized return and the corresponding returns is shown in Fig. 1b. However, to show a comparison between our proposed model and the empirical returns, we need to classify the range of observed data into individual class intervals. For this purpose, we use the famous Struges formula (Martinez and Martinez 2002) for determining the bin number as, \(k = \left\lfloor {{{\log }_2}N} \right\rfloor +1\), where ⌊ ⌋ represents the floor function. Using this formula, we get 25 class intervals for \(N = O({10^6})\) number of observations (Fig. 2). It is observed from Fig. 3, that the JS measure between our proposed distribution and the empirical data attains minimum value at point A. Further, to get an insight, we have presented the computed JS measure values corresponding to different values of q around 1.5 in Table 1. It is clearly observed from the table that the JS measure provides a least value at \(q = 1.513\) i.e \(JS_{QQQ}= 2.9604848\). Thus \(q_{QQQ} = 1.513\) is found to be the minimum value of Powershares QQQ of NASDAQ 100 data. We also provide the probability density functions for stock returns corresponding to ‘q’ viz., 1.0, 1.5 and 1.513. Algorithm 1, describes the successive steps towards the estimation of non-extensive parameter q. We have successfully computed all the values of q corresponding to six different stock data sets with the use of the following algorithm 1.

figure a
Fig. 1
figure 1

a Intra-day trend behavior for Powershares QQQ (NASDAQ 100) representing minute-wise data ranging from 01/12/2004 to 31/12/2008, b Plot of JS measure against parameter q for Powershares QQQ (NASDAQ 100)

Fig. 2
figure 2

Plot of JS measure against parameter q for Powershares QQQ (NASDAQ 100)

Fig. 3
figure 3

a Variation of symmetric JS measure of Fig. 2 around q = 1.5. b Variation of JS symmetric measure of Student’s t from empirical Powershares QQQ data

In addition to this, we also described the asymptotic nature of the underlying return distribution at \(q = 1.513\) using the CCDF provided in Eq. (9), for large values of ‘\(\xi\)’ (Bebortta et al. 2020) and we get, \(G_{QQQ}(R)\sim \xi ^{-2.96}\).

It is quite interesting to notice that the negative exponent of high-frequency return distribution describes the stylized phenomena (Stanley 2003; Buchanan 2012) of the cubic law for stock returns. The tabulated JS measure corresponding to the non-extensive parameter q around 1.5 for Powershares QQQ is shown in Table 1. It can be observed, the minimum JS measure provides the minimum value of q. The tabulated JS measure corresponding to the non-extensive parameter q around 1.5 for Powershares QQQ is shown in Table 1. It can be observed, the minimum JS measure provides the required value of q.

Table 1 Tabulated values of JS measure for Powershares QQQ through various values of q

The Fig. 4a, shows the probability density functions of the empirical Powershares QQQ along with the q-Gaussian distributions for \(q= 1.0\), 1.5 and 1.513, Gaussian distribution and Student’s t-distribution.

Fig. 4
figure 4

a PDFs of q-Gaussian for q = 1.0, 1.5 and 1.513 together with Student’s t-distribution and Powershares QQQ of NASDAQ 100 normalized returns, b PDFs of q-Gaussian for \(q = 1.0\), 1.5 and 1.513 together with Student’s t-distribution and SPDR S&P 500 ETF (SPY) of NYSE normalized returns

5 Stylized fact analysis

SPDR S&P 500 ETF, General Motors, Ever-source Energy, Coca-Cola and NQ Mobile Inc., are also examined to observe the cubic law for high-frequency intra-day returns. As discussed earlier, we performed the JS measure between all the given stock intra-day returns data with our proposed distribution. Thereafter, we determine the parameter q for the respective stock returns. JS measures for all the above stock data against different values of ‘q’ lying in the interval (1, 3) are evaluated.

We have also calculated the JS measure for the Student’s-t distribution corresponding to all the underlying data sets. The divergence values for m = 3, are observed to be (1) 2.960543 (QQQ), (2) 5.067457 (SPY), (2) 5.067457 (SPY), (3) 5.176774 (GM), (4) 5.648495 (ES), (5) 4.904609 (KO) and (6) 4.858626 (NQ). Further, tabulated values for the JS divergence are also given to identify the value of parameter q with minimum JS value. For the sake of convenience, the parameter q with minimum JS value is represented through bold form in each table. From the Tables 12,  34, 5, and 6 the minimum values of the JS divergence are found as (1) 2.9604848 (QQQ), (2) 5.0674539 (SPY), (3) 5.1767703 (GM), (4) 5.6484839 (ES), (5) 4.9046019 (KO), and (6) 4.8586260 (NQ) obtained for \(q = 1.513\), \(q = 1.505\), \(q = 1.505\), \(q= 1.508\), \(q = 1.507\), and \(q =1.499\) respectively. So, using these values of ‘q’, we have obtained closed agreements between the empirical data sets and the proposed q-Gaussian distribution. The CCDF obtained in Eq. (9), captures the stylized fact from empirical returns with various negative exponents and can be represented as:

$$\begin{aligned}&G_{SPY}(\xi )\sim {\xi ^{ -3.01}}, \quad G_{GM}(\xi )\sim {\xi ^{ - 2.96}}, \quad G_{ES}(\xi )\sim {\xi ^{ -2.94}}, \\&G_{KO}(\xi )\sim {\xi ^{ -2.96}}, \quad G_{NQ}(\xi )\sim {\xi ^{ -2.94}}, \quad {\hbox {for large}} \quad \xi . \end{aligned}$$
(17)

The tabulated JS measure corresponding to the non-extensive parameter q around 1.5 for SPDR S&P 500 ETF is shown in Table 2. It can be observed, the minimum JS measure provides the required value of q. The Fig. 5b, shows the probability density functions of the empirical SPDR S&P 500 ETF along with the q-Gaussian distributions for \(q= 1.0\), 1.5 and 1.513, Gaussian distribution and Student’s t-distribution. The tabulated JS measure corresponding to the non-extensive parameter q around 1.5 for General Motors is shown in Table 4. It can be observed, the minimum JS measure provides the required value of q. The Fig. 5a, shows the probability density functions of the empirical General Motors along with the q-Gaussian distributions for \(q= 1.0\), 1.5 and 1.513 and 1.513, Gaussian distribution and Student’s t-distribution. The tabulated JS measure corresponding to the non-extensive parameter q around 1.5 for Ever Source Energy is shown in Table 3. It can be observed, the minimum JS measure provides the required value of q. The Fig. 5b, shows the probability density functions of the empirical Eversource Energy along with the q-Gaussian distributions for \(q= 1.0\), 1.5 and 1.513, Gaussian distribution and Student’s t-distribution. The tabulated JS measure corresponding to the non-extensive parameter q around 1.5 for Coca-Cola is shown in Table 5. It can be observed, the minimum JS measure provides the required value of q. The Fig. 6a, shows the probability density functions of the empirical Coca-Cola along with the q-Gaussian distributions for \(q= 1.0\), 1.5 and 1.513, Gaussian distribution and Student’s t-distribution. The tabulated JS measure corresponding to the non-extensive parameter q around 1.5 for NQ Mobile Inc. is shown in Table 6. It can be observed, the minimum JS measure provides the required value of q. The Fig. 6b, shows the probability density functions of the empirical NQ Mobile Inc. along with the q-Gaussian distributions for \(q= 1.0\), 1.5 and 1.513, Gaussian distribution and Student’s t-distribution. It is customary to show the CCDF in log-log scale to observe the exponent of power-law from both empirical and theoretical model. The slope of the straight line from the CCDF figure corresponding to each underlying return data set provides respective exponent measure. The existence of power-law behavior for six different stock returns is shown in the Fig. 7 in terms of corresponding CCDFs.

Table 2 Tabulated values of JS measure for SPY through various values of q
Table 3 Tabulated values of JS measure for Eversource Energy through various values of q
Table 4 Tabulated values of JS measure for General Motors through various values of q
Table 5 Tabulated values of JS measure for Coca-Cola through various values of q
Table 6 Tabulated values of JS measure for NQ Mobile Inc. through various values of q
Fig. 5
figure 5

a PDFs of q-Gaussian for \(q = 1.0\), 1.5 and 1.513 together with Student’s t-distribution and General Motors (GM) of S&P 500 normalized returns, b PDFs of q-Gaussian for \(q = 1.0\), 1.5 and 1.513 together with Student’s t-distribution and Eversource Energy (ES) of S&P 500 normalized returns

Fig. 6
figure 6

a PDFs of q-Gaussian for \(q = 1.0\), 1.5 and 1.513 together with Student’s t-distribution and Coca-Cola (KO) of NYSE normalized returns, b PDFs of q-Gaussian for \(q = 1.0\), 1.5 and 1.513 together with Student’s t-distribution and NQ Mobile Inc. (NQ) of NASDAQ 100 normalized returns

Fig. 7
figure 7

CCDF plot for six stock returns with q-Gaussian distribution and Student’s t-distribution

6 Stock Price Distribution with Option Price

It has been observed that Glasserman (2013), stock price is governed by the following stochastic differential equation:

$$\begin{aligned} dS = \mu S dT + \sigma SdW, \end{aligned}$$
(18)

which is associated with a Wiener process W, and the parameters \(\mu\) and \(\sigma\) representing the drift and degree of fluctuation respectively. The solution of the above Eq. (18), gives rise to the Geometric Brownian Motion (GBM) (Albanese and Campolieti 2006) as,

$$\begin{aligned} {S_T} = {S_0}\exp \left\{ {\left( {\mu - \frac{{{\sigma ^2}}}{2}} \right) T + \sigma \sqrt{T} Z} \right\} ,\quad Z\sim N(0,1). \end{aligned}$$
(19)

The log-normal distribution of Eq. (19), is given by,

$$\begin{aligned} h\left( {\left. {{S_T}} \right| {S_0},T} \right) = \frac{1}{{{S_T}\sqrt{2\pi T} \sigma }}\exp \left[ { - \frac{1}{2}{{\left\{ {\frac{{\log \left( {{{{S_T}} / {{S_0}}}} \right) - \left( {\mu - \frac{{{\sigma ^2}}}{2}} \right) T}}{{\sigma \sqrt{T} }}} \right\} }^2}} \right] \end{aligned}$$
(20)

Hull (2006) have provided a famous option pricing model where stock return exhibits Gaussian distribution. However, from the empirical data, it has been shown that returns does not follow the Gaussian distribution which has been studied by Senapati and Karmeshu (2016) through six different stocks. Lahmiri (2016) proposed modeling and forecasting techniques for intra-day stock price which is highly fluctuating and non stationary in nature.

The q-lognormal distribution of stock price can be obtained using Eqs. (1) and (6), as,

$$\begin{aligned} {\psi _q}(\left. {{S_T}} \right| {S_0},T) = \frac{1}{{Z{S_T}}}{\left\{ {1 - \left( {\frac{{1 - q}}{{3 - q}}} \right) {{\left( {\frac{{\log \left( {{{{S_T}} / {{S_0}}}} \right) - \mu T}}{{\sigma \sqrt{T} }}} \right) }^2}} \right\} ^{\frac{1}{{1 - q}}}}, \end{aligned}$$
(21)

Following Černỳ (2009), we use the risk-neutral probability \(\mu = r - \frac{{{\sigma ^2}}}{2}\) in Eq. (21), and we have,

$$\begin{aligned} {\psi _q}(\left. {{S_T}} \right| {S_0},T) = \frac{1}{{Z{S_T}}}{\left\{ {1 - \left( {\frac{{1 - q}}{{3 - q}}} \right) {{\left( {\frac{{\log \left( {{{{S_T}} / {{S_0}}}} \right) - \left( {r - \frac{{{\sigma ^2}}}{2}} \right) T}}{{\sigma \sqrt{T} }}} \right) }^2}} \right\} ^{\frac{1}{{1 - q}}}}. \end{aligned}$$
(22)

Fig. 8a, shows the plot of the well known lognormal distribution, along with the q-lognormal distribution Eq. (22) for \(q=1\).

Fig. 8
figure 8

a Plot of stock price distribution for \(q=1.0\) together with the log-normal distribution for \(S_{0}=10\), \(\sigma =0.3\), \(r=0.08\), and \(K=50\), b plot of stock price distribution for \(q=1.0\) together with the log-normal distribution

Figure 8, represents the plot of q-lognormal distribution Eq. (22), for three different values of parameter q. Now, we wish to determine the non-Gaussian option price corresponding to the given stock price distribution Eq. (22). The European call price (Albanese and Campolieti 2006; Hull 2006) under risk neutral measure Q is given by,

$$\begin{aligned} C\left( 0 \right) = {e^{ - rT}}{E^Q}\left[ {\left. {{{\left( {{S_T} - K} \right) }_ + }} \right| {\mathcal {F}}_{T}} \right] , \end{aligned}$$
(23)

where \({\mathcal {F}}_{t}\) represents filtration which depends on past information up to time T (Albanese and Campolieti 2006) and K is the strike price at time T. The non-Gaussian option price can be obtained by incorporating the proposed q-lognormal distribution Eq. (22) with Eq. (23), as,

$$\begin{aligned} C\left( 0 \right) = {e^{ - rT}}\int \limits _0^\infty {{{\left( {{S_T} - K} \right) }_ + }} \frac{1}{{Z{S_T}}}{\left\{ {1 - \left( {\frac{{1 - q}}{{3 - q}}} \right) {{\left( {\frac{{\log \left( {{{{S_T}} / {{S_0}}}} \right) - \left( {r - \frac{{{\sigma ^2}}}{2}} \right) T}}{{\sigma \sqrt{T} }}} \right) }^2}} \right\} ^{\frac{1}{{1 - q}}}}d{S_T}, \end{aligned}$$
(24)

Substituting R = \(\log \left( {{{{S_T}} / {{S_0}}}} \right)\), we have,

$$\begin{aligned} C\left( 0 \right) = {e^{ - rT}}\int \limits _{ - \infty }^\infty {{{\left( {{S_T} - K} \right) }_ + }} \frac{1}{Z}{\left\{ {1 - \left( {\frac{{1 - q}}{{3 - q}}} \right) {{\left( {\frac{{R - \left( {r - \frac{{{\sigma ^2}}}{2}} \right) T}}{{\sigma \sqrt{T} }}} \right) }^2}} \right\} ^{\frac{1}{{1 - q}}}}dR, \end{aligned}$$
(25)

Let \(y = \frac{{R - \left( {r - \frac{{{\sigma ^2}}}{2}} \right) T}}{{\sigma \sqrt{T} }}\).

$$\begin{aligned} C\left( 0 \right) = \frac{{{e^{ - rT}}\sigma \sqrt{T} }}{Z}\int \limits _{ - \infty }^\infty {{{\left\{ {{S_0}{e^{\sigma \sqrt{T} y + \left( {r - \frac{{{\sigma ^2}}}{2}} \right) T}} - K} \right\} }_ + }} {\left\{ {1 - \left( {\frac{{1 - q}}{{3 - q}}} \right) {y^2}} \right\} ^{\frac{1}{{1 - q}}}}dy, \end{aligned}$$
(26)

The condition for non-zero option implies that the stock price should be greater than the strike price. Thus, we obtain,

$$\begin{aligned} {S_0}{e^{\sigma \sqrt{T} y + \left( {r - \frac{{{\sigma ^2}}}{2}} \right) T}} - K > 0. \end{aligned}$$
(27)

Using the above condition, Eq. (26), becomes,

$$\begin{aligned} C\left( 0 \right) = \frac{{{e^{ - rT}}\sigma \sqrt{T} }}{Z}\int \limits _{{y_1}}^\infty {\left( {{S_0}{e^{\sigma \sqrt{T} y + \left( {r - \frac{{{\sigma ^2}}}{2}} \right) T}} - K} \right) } {\left\{ {1 - \left( {\frac{{1 - q}}{{3 - q}}} \right) {y^2}} \right\} ^{\frac{1}{{1 - q}}}}dy, \end{aligned}$$
(28)

where \({y_1} = \frac{{\log \left( {{K / {{S_0}}}} \right) - \left( {r - \frac{{{\sigma ^2}}}{2}} \right) T}}{{\sigma \sqrt{T} }}\). The option price obtained in Eq. (28), can be represented in it’s closed form as,

$$\begin{aligned} C\left( 0 \right) = {S_0}{A_q}\left( {{y_1}} \right) - K{e^{ - rT}}{B_q}\left( {{y_1}} \right) . \end{aligned}$$
(29)

where \({A_q}\left( {{y_1}} \right) = \frac{{{e^{ - \frac{{{\sigma ^2}}}{2}T}}\sigma \sqrt{T} }}{Z}\int \limits _{{y_1}}^\infty {{e^{\sigma \sqrt{T} y}}} {\left\{ {1 - \left( {\frac{{1 - q}}{{3 - q}}} \right) {y^2}} \right\} ^{^{\frac{1}{{1 - q}}}}}dy\)   and \({B_q} \left( {{y_1}} \right) = \frac{{\sigma \sqrt{T} }}{Z}\int \limits _{{y_1}}^\infty {{{\left\{ {1 - \left( {\frac{{1 - q}}{{3 - q}}} \right) {y^2}} \right\} }^{^{^{\frac{1}{{1 - q}}}}}}}dy\)

Figure 9a, depicts the proposed q-Gaussian option price for the Eq. (29), along with Black–Scholes price for T = 1.

Fig. 9
figure 9

a Call price of Black–Scholes model with q-Gaussian price \((q\rightarrow 1.5)\) for \(T=1\), \(S_{0}=10\), \(\sigma =0.03\), \(r=0.08\), and \(K=50\), b call price of Black–Scholes model with q-Gaussian price \((q\rightarrow 1.5)\) for \(T=0.75\), \(S_{0}=10,\) \(\sigma =0.03\), \(r=0.08\), and \(K=50\)

Figure 9b, shows the proposed q-Gaussian option price Eq. (29), along with Black–Scholes price for T = 0.75. Figure 10a, illustrates the proposed q-Gaussian option price Eq. (29), along with Black–Scholes price for T = 0.5.

Fig. 10
figure 10

a Call price of Black–Scholes model with q-Gaussian price \((q\rightarrow 1.5)\) for \(T=0.5\), \(S_{0}=10\), \(\sigma =0.03\), \(r=0.08\), and \(K=50\), b call price of Black–Scholes model with q-Gaussian price \((q\rightarrow 1.5)\) for \(T=0.25\), \(S_{0}=10\), \(\sigma =0.03\), \(r=0.08\), and \(K=50\)

Figure 10b, depicts the proposed q-Gaussian option price Eq. (29), along with Black–Scholes price for T = 0.25. The convergence of the proposed q-Gaussian option price Eq. (29), for \(q=1\) with the famous Black–Scholes option price for different time periods are shown in Fig. 11.

Fig. 11
figure 11

The call price of Black–Scholes model with q-Gaussian price \((q\rightarrow 1)\) for T = [0.25:0.25:1], \(S_{0}=10\), \(\sigma =0.03\), \(r=0.08\), and \(K=50\)

7 Conclusion and Future Works

It has been an interesting and challenging task to study the dynamical behavior of intra-day high-frequency data. For this purpose, we exploited the significance of the powerful Tsallis’ framework to obtain q-Gaussian distribution for stock return by maximizing Tsallis entropy under first, second moment constraints along with the normalization constraint. The generalized JS divergence phenomena is employed to compute the parameter ‘q’ for various stocks. We have examined six high-frequency datasets listed in different indices for the validation of the proposed distribution through different ‘q’ values. For the sake of convenience, a precise algorithm has been proposed for the estimation of parameter ‘q’. The derived distribution has clearly explained the cubic power-law stylized fact of the above empirical stock returns for the corresponding non-extensive parameter q. For characterizing the tail fluctuation, we have successfully computed the CCDF for the underlying return distribution through Hypergeometric2F1 function. In addition to this, the corresponding stock distribution is derived in terms of q-lognormal distribution which is used to provide the non-Gaussian option price.