1 Introduction

With the development of 4G/5G communication technology, a large number of Social Network Services (SNSs) have emerged. Mobile operators and equipment providers pay more attention to the Quality of User Experience (QoE) of such services, and improve the user experience through more intelligent scheduling strategies [1]. Some scheduling strategies adopt Deep Packet Inspection (DPI) technology to identify the specific services to which packets belong, and then use different scheduling strategies according to different services. This reflects that the communications industry has recognized that different services impact the user experience in different ways.

Therefore, based on the service recognition and efficient traffic analysis method [2,3,4,5], an internet service provider can employ many new techniques to improve the user experience and save energy. For example, more complex scheduling strategies are employed to select better access networks [6, 7], and some resource allocation schemes are developed to save energy [8,9,10,11,12] and to more effectively route traffic [13] in complex networks. Some approaches are designed to deal with security problems [14] and measure network performance [15]. To the capacity of a wireless channel in a mobile network, the effective capacity concept is proposed [16] and is employed to measure the probability of a QoS outage [17,18,19]. Many emerging technologies are employed in SDN networks [20, 21], optical networks [22, 23], sensor networks [24, 25], and networks migration [26]. To evaluate these new techniques, we need to build a large number of test scenarios. However, the traditional evaluation system mainly measures the service quality of a large-scale business using indistinguishable general indicators. However, many test systems are still based on the data flow model [27, 28], and cannot effectively trigger intelligent scheduling strategies; therefore, it is impossible to evaluate the effectiveness of these intelligent scheduling strategies. Many communication enterprises have to use the manual dial-up method to verify a new scheduling strategy, but this method is unable to simulate large-scale scenarios.

A possible approach is to replay the real data packets that are captured from networks in a simulation system [15]. These packets might be generated by user actions, such as logins, sending messages, comments, and so on. A complex test scenario should consist of these actions according to user behavior. Therefore, the new problems are how the user behavior affects the QoE, and how to reconstruct user behavior using the captured data packets.

The remainder of paper is divided into five sections. We first introduce the related works about user behavior distribution analysis. Next, we give an approach to build the test scenarios based on the user behavior central distribution. Thirdly, we explore the SNS user behaviors to demonstrate the user behavior distribution such as the operation frequency, the size of the posted message, and the total number of operations. Then, we test how the user behavior affects the end user experience indicators. We conclude in the fifth section.

2 Related works

There are different user actions for different mobile SNSs. It is difficult to analyze the behavioral characteristics of such a large number of user actions and design test cases when building communication scenarios. However, in recent years, several mainstream services have formed a large proportion in the communication market, and thus the research focus can be narrowed.

As shown in Fig. 1, in the four mainstream mobile business markets, a few brands have occupied large market shares [29]. This trend of branding greatly reduces the business scope for user experience evaluations. In addition, the phenomenon of crowd gathering in similar businesses also brings opportunities for user behavior analysis.

Fig. 1
figure 1

Market penetration of the main SNSs in China

Through studying the communication records between operators’ terminal nodes, Y.Jin found that users have a strong degree of aggregation [30]. As shown in Fig. 2, the PEARSON correlation can be used to describe the correlation among users, and the correlation is scaled between [− 10, 10]. It is obvious that besides the low degree of user aggregation of real-time services such as FTP large file transfers and VoIP, SNSs show a very high degree of user aggregation. This phenomenon implies that users’ behavioral habits are likely to influence each other in the same type of user groups. This aggregation phenomenon will result in the centralized distribution of user behavior, thus forming some typical user behavior. Typical behaviors will have a greater impact on the characteristics of the scenario than other atypical behaviors, and this will make it possible to build complex scenarios using a few typical user behaviors.

Fig. 2
figure 2

Level of aggregation of user behaviors

Through the analysis of the user behavior characteristics of the main SNSs, some researchers found concentrated distributions of the common actions, action frequency and information length of users.

Facebook is a popular social platform across the world, and thus, it provides a wealth of data on user behaviors. F. Schneider studied the behavioral characteristics of Facebook users [31]. As shown in Fig. 3, there are only five frequent user actions. In addition, the action sequence also has certain rules to follow, as shown in Fig. 4.

Fig. 3
figure 3

Frequency actions on Facebook

Fig. 4
figure 4

Action sequences on SNS

Some studies suggest that the traffic model of a traditional communication network can be used as a reference to analyze the communication behaviors of computer network users [32]. This model will provide support for simulating the network communication scenarios with respect to user behaviors and testing the combined influence of user behaviors and scheduling strategies on user experience. In the model, the network user behaviors can be described by the Poisson distribution of parameter λ:

$$P_{u} \left( {i,t} \right) = \frac{{\left( {\lambda t} \right)^{i} }}{i!}e^{ - \lambda t}$$
(1)

where \(i\) is the number of services during \(t\). The amount and length of the data flow of each service follow the geometric distributions of \(E_{f}\) and \(E_{l}\), respectively:

$$P_{f} \left( n \right) = \frac{1}{{E_{f} }}\left( {1 - \frac{1}{{E_{f} }}} \right)^{n - 1}$$
(2)
$$P_{l} \left( n \right) = \frac{1}{{E_{l} }}\left( {1 - \frac{1}{{E_{l} }}} \right)^{k - 1}$$
(3)

where \(n\) and \(k\) are the number and length of the data flows, respectively. The network traffic \(n_{ij} \left( T \right)\) can thus be obtained based on the number of users \(\left( T \right)\), the number of data flows \(f_{i} \left( T \right)\) and the length of data flow \(n_{ij} \left( T \right)\) during \(\left( {0,T} \right)\):

$$N\left( T \right) = \mathop \sum \limits_{i = 1}^{u\left( T \right)} \mathop \sum \limits_{j = 1}^{{f_{i} \left( T \right)}} n_{ij} \left( T \right).$$
(4)

Then, the network traffic during \(\left( {t,t + \tau } \right)\) can be written as follows:

$$N_{t} \left( \tau \right) = N\left( {t + \tau } \right) - N\left( t \right).$$
(5)

In this traffic model, the parameter \(\lambda\) depends on the user density and usage habits of the users in the scene, \(E_{f}\) depends on the operation sequences and action frequencies of the users, and \(E_{l}\) is determined by the distribution of user actions. Due to the additivity of the Poisson flow, if there is a centralized distribution of the user behaviors, a few typical action sequences can be used to simulate the entire communication scenario and form impact scenarios that are similar to the real scenarios. In addition, since user habits are somewhat fixed, typical user behaviors have a higher degree of reusability. The model in Fig. 5 can be adopted to collect real user communication data, and form a simple script describing the user behaviors to build a complex communication scenario. Through setting the ratio among the typical users, we build the scenario and illustrate the process in Fig. 6. Then, we can evaluate the QoE in this simulation scenario.

Fig. 5
figure 5

Building a complex scenario

Fig. 6
figure 6

Testing process

The key point of using the above model to evaluate user experience is whether the user action frequency and the amount of data that is triggered by the action obey a centralized distribution, and whether the user behavior impacts the QoE indicators.

3 User behavior analysis of an SNS

Using the above model, it is possible to reconstruct a real scene with a small number of typical user behaviors if the data length and frequency of user behaviors obey a central distribution. Related studies have found centralized distributions of user behaviors in SNSs. In this paper, we analyze the online times and operation frequencies of SNS users. To observe the impacts of user behaviors on QoE, we also test the appreciable indicators of the end users in different scenarios with different user densities and behaviors.

3.1 Central distribution of the online frequency

To analyze the distribution of the online frequency for an SNS, we survey 35 college students on their use counts per day. As shown in Fig. 7, the results show that this special user group has similar usage habits.

Fig. 7
figure 7

Online frequency of Weibo

Figure 8 shows the distribution of users’ usage times for Weibo on campus. This implies a centralized distribution of usage habits in this special communication scenario.

Fig. 8
figure 8

Online time of Weibo

3.2 Central distribution of user data

We collect the Weibo information of 60,000 users. We compute the average number of posts that these users published per day. The number of tweets per person per day was calculated. As shown in Fig. 9, for most users, they make less than 10 daily posts, and the distribution has a long tail.

Fig. 9
figure 9

Number of daily posts

In addition, the length of the post is also restricted by the usage habits. In this paper, the length of the 2622 Weibo posts without links is analyzed in Fig. 10, where the vertical coordinate is the number of blog posts and the horizontal coordinate is the length of the posts.

Fig. 10
figure 10

Distribution of post length

As shown in Fig. 10, the lengths of most posts are between 10 and 50 bytes. The oscillation in Fig. 10 is caused by the double-byte representation of Chinese characters in the computer. When Fig. 11 retains only the double-byte data, the oscillation disappears. This result shows a centralized, heavy-tailed distribution.

Fig. 11
figure 11

distribution of post lengths (only even bytes)

4 Impact of the scenario on QoE indicators

We survey the degree to which the communication scenario affects the user experience. In this paper, the appreciable indicators of the SNS are evaluated in an urban area. The cellular network access is provided by the same communication operators. Therefore, the infrastructure conditions of the tests are similar, and the performance fluctuations should be caused by the behavioral features of the different user groups, including the user density, operation frequency, and other operation habits.

To quickly test and analyze various user perception indicators, we define the user perception indicators of SNSs, such as QQ, MSN and Weibo. The algorithm for calculating these indicators is shown in Fig. 12. Firstly, the key attributes are extracted from the network packets. Then, the key messages are identified according to the combination of key attributes. Finally, important attributes such as the time stamps and serial numbers of key messages are used to calculate user perception indicators (e.g., post delay, login success rate, etc.).

Fig. 12
figure 12

Framework of the automatic evaluation method

By using the above algorithm, the user perception indicator is measured in real scenes, and the significant variances in user groups among scenarios are helpful for observing the impacts of user behavior on the QoE indicators.

Figures 1316 show the delay test results of QQ messages that are sent in different scenarios. The test results show that the distribution ranges of the delays are approximately the same. However, the number of delays with a significant deviation from the main distribution range is obviously different. In particular, Fig. 16 shows that the delay difference between the different time periods in the same place is also obvious. The test results indicate that user behavior is key factor of communication scenarios, which has a certain impact on the QoE indicators.

Fig. 13
figure 13

QQ message delays in a shopping mall

Fig. 14
figure 14

QQ message delays in an exhibition center

Fig. 15
figure 15

QQ message delays in the International Finance Centre (indoors)

Fig. 16
figure 16

QQ message delays in the International Finance Centre (outdoors)

In the same area, the communication capacities of the communication operators are more similar. In different scenarios, the group behaviors of users are different, and the usage habits are thus different. Therefore, the degree to which the user behavior affects QoE indicators can be observed. In this paper, we chose to test the delay of posting Weibo comments through a cellular network under different scenarios in a large research and development base. The scenarios include an office area, experiment area, restaurant, lounge, and so on. Except that the restaurant is significantly more densely populated than other areas, all other areas have similar user densities. In addition, the communication operators are known to identify data packets and adopt special scheduling strategies for SNSs.

The test results in Fig. 17 also indicate that in the case of the same user density and a similar communication capacity, user behavior causes delayed fluctuations. Because a delay jitter bigger than 0.1 s is appreciable, the delay fluctuation in Fig. 17 is obvious for users.

Fig. 17
figure 17

Average post delays at 22 test points (5 tests for each point)

5 Conclusions

In this paper, we mainly analyze the distribution of user behavior for an SNS, and the degree of impact of user behavior on QoE indicators. The investigation results show the centralized distribution of user behavior. The test results in real scenarios imply that the user behavior actually affects the user experience. Therefore, we propose that a QoE test scenario can be built using the typical user behaviors. The relationship between user behavior and the network load should be further studied to guide the test case design.