Social Media Types: introducing a data driven taxonomy

Koukaras, Paraskevas; Tjortjis, Christos; Rousidis, Dimitrios

doi:10.1007/s00607-019-00739-y

Social Media Types: introducing a data driven taxonomy

Published: 03 July 2019

Volume 102, pages 295–340, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Computing Aims and scope Submit manuscript

Social Media Types: introducing a data driven taxonomy

Download PDF

Paraskevas Koukaras¹,
Christos Tjortjis ORCID: orcid.org/0000-0001-8263-9024¹ &
Dimitrios Rousidis¹

2297 Accesses
28 Citations
Explore all metrics

Abstract

Social Media (SM) have been established as multifunctional networking tools that tend to offer an increasingly wider variety of services, making it difficult to determine their core purpose and mission, therefore, their type. This paper assesses this evolution of Social Media Types (SMTs), presents, and evaluates a novel hypothesis-based data driven methodology for analyzing Social Media Platforms (SMPs) and categorizing SMTs. We review and update literature regarding the categorization of SMPs, based on their services. We develop a methodology to propose and evaluate a new taxonomy, comprising: (i) the hypothesis that the number of SMTs is smaller than what current literature suggests, (ii) observations on data regarding SM usage and (iii) experimentation using association rules and clustering algorithms. As a result, we propose three (3) SMTs, namely Social, Entertainment and Profiling networks, typically capturing emerging SMP services. Our results show that our hypothesis is validated by implementing our methodology and we discuss threats to validity.

User behavior mining on social media: a systematic literature review

Article 17 August 2019

Name it as you like it? Keeping pace with social media something

Article 21 June 2018

Social Media Analytics: Techniques, Tools, Platforms a Comprehensive Review

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

People around the world use Social Media (SM) to communicate, connect and interact with other users, sharing and propagating information at a great rate [1]. SM facilitate sharing information, ideas, interests and other forms of expression through virtual communities and networks [2]. There is a great variety of services offered having many common features [3]. SM are considered interactive Internet-based applications [4]. SM are full of user-generated data, such as posts, photos, videos and so on. They offer user accounts (profiles) on websites and mobile apps, facilitating the generation of web based social networks, connecting users or groups [5].

A Social Network (SN) is a social structure consisting of several actors/entities/groups of entities, that describe a variety of interactions among them. Studies like the one reported in [6] present taxonomies for SN, which describe the spectrum of attributes that relate to these systems. They provide a reference point for different system compositions, aiming at capturing their building blocks, whilst examining the architectural designs and business models they might pose.

SN offer different techniques for analyzing the structure of social atoms (entities), as well as a set of theories for understanding and recognizing patterns hidden in them [7]. Such patterns can be local or global, which can be further analyzed in order to mine special entities that might influence others or examine characteristics of parts or the whole network [8].

During the early years of SM networking, Social Media Platforms (SMP) had a clear vision statement. Nowadays, most SM provide services and functionalities using different names. SM users take advantage of services such as connecting, sharing, entertaining, monetizing etc., seeking to detect brand awareness indicators, usage for sales, feedbacks, opinions and more, before approaching specific target groups. Figure 1 shows the number of SM users worldwide since 2010, along with estimated numbers for up to 2021. Categorizing SMPs helps addressing appropriate groups and improve our understanding regarding SM, whilst getting better results from each platform/site. New opportunities arise for research and improvements based on new data at our disposal. Although SM networking is considered a new field of studies, more and more researchers work on it, due to its wide user adoption [9].

SM data types are highly dependent on typical user activities. There are various characteristics and implications on SM that often lead to confusions regarding data handling [10]. Therefore, our work aims to elaborate on Social Media Types (SMTs), updating current literature, as well as to introduce new perspectives on SMPs multiple feature offerings.

While we refer to SMTs and networks, we survey and categorize most common such types and we research an update to their current standardization. To achieve that, we extract from SMTs features and services that we refer to as “Utilities”, and develop a methodology based on our initial hypothesis H₀ (“standard SMTs can be narrowed down to a smaller number n”) which is later backed up by further elaboration on our SM feature dataset.

We report on SM evolution and how we can use a data-driven approach in order to generate a new SMTs taxonomy. This is significant because SM offer an increasingly wider variety of services, making it difficult to determine their core purpose and mission; therefore, their type. This paper assesses SMT evolution, presents and evaluates a novel hypothesis-based data driven methodology for analyzing SMPs and categorizing SMTs based on their services.

As a result of our first experiment (Experiment #1, detailed in Sect. 4.2) we propose five (5) SMTs, which we argue to be better and more synched with the current state of play in SM than categorizations proposing, nine (9) [11] or seven (7) [2] SMTs respectively. Yet, when comparing these early results with work proposing three (3) SMTs [4], we conclude that a tighter categorization scheme is needed.

Thus, we conduct further research, striving for better results. With Experiment #2 we came up with four (4) clusters which can be interpreted as four (4) SMTs. Finally, we present an insight into the merged version of the two (2) experiments, which proposes a new categorization that consists of three (3) SMTs, namely: Social networks, Entertainment networks, and Profiling networks, typically capturing emerging SMP services.

The remainder of this paper is structured as follows: Literature review (Sect. 2) presents the state of the art on SMTs. Methodology (Sect. 3) defines our problem, methods, dataset, observations and research process. Experiments (Sect. 4) presents experimental results, while Research summary (Sect. 5.1) discusses key findings relating them with H₀ and presents important extracts from our research. The rest of the Conclusions (Sect. 5.2 & Sect 5.3) discusses results, assesses the importance of our work along with biases and threats to validity and presents directions for future work.

2 Literature review

There are various approaches when dealing with a new taxonomy proposal. For example, Engelbrecht et al. categorize data-driven business models based on three points: the data source, the target audience and the technological effort [12]. Then, they propose eight (8) categories of business models. Our work aims to research categories of SM (SMTs), a rather untapped topic regarding SM.

Based on Social Theories, there is the Social Atom as an individual that interacts with the Social Molecule which is the community, constructing seven (7) probable building blocks (Identity, Conversations, Sharing, Presence, Relationships, Reputation, Groups) of SM [2]. A categorization of SM sites (and by extension SMTs) such as blogs, social media sites, and virtual game worlds can be found in [4]. The classification is based on purpose and functionality. Nine (9) types of Social Media are identified [11]:

1.
Online Social Networking Web-based services that allow individuals and communities to connect with real world friends and acquaintances online. Users interact with each other through status updates, comments, media sharing and messages. Examples: Facebook, Myspace, LinkedIn.
2.
Blogging Journal-like websites for users, to contribute textual and multimedia content, arranged in a reverse chronological order. Blogs are generally maintained by an individual or by a community. Examples: Huffington Post, Business Insider, Engadget, WordPress.com, Medium.
3.
Micro-blogging Same as blogs, but with limited content. Examples: Twitter, Tumblr, Plurk.
4.
Wikis Collaborative editing environment that allows multiple users to develop Web pages. Examples: Wikipedia, Wikitravel, Wikihow.
5.
Social news Sharing and selection of news stories and articles by communities of users. Examples: Digg, Slashdot, Reddit, Quora.
6.
Social book-marking Allows users to bookmark Web content for storage, organization, and sharing. Examples: Delicious, StumbleUpon.
7.
Media sharing Sharing of media on the Web including video, audio, and photos. Examples: YouTube, Flickr, UstreamTV.
8.
Opinion, reviews and rating The primary function of such sites is to collect and publish user submitted content in the form of subjective commentary on existing products, services, entertainment, businesses and places. Examples: Epinions, Yelp, Cnet, Zomato, TripAdvisor.
9.
Answers Platforms for users seeking advice, guidance or knowledge to ask questions. Other community users can answer these questions based on previous experiences, personal opinions or relevant research. Answers are generally judged using ratings and comments. Examples: Yahoo! answers, WikiAnswers.

3 Methodology

In this section we analyze our methodology, including the problem definition, our methods, the data set, some key research observations and the corresponding process.

3.1 Problem definition

The current standardization on categories of SMTs (like the ones presented in [2, 4, 11]) is considered decaying, since SMTs develop rapidly on platforms that offer various services and multiple features that we label as Utilities. Our aim is to introduce a new taxonomy that narrows down the current SMTs standardization, since most of the modern SMPs tend to offer multiple Utilities into a single platform/product. Therefore, we investigate this issue, expecting to offer another option regarding SMTs. Our methodology takes into consideration our observations (Sect. 3.4) on a dataset that contains different SM alongside their official features. We perform two (2) experiments (reported in Sect. 4) involving association rule mining and clustering in order to unfold a data-driven methodology that validates our summarized research question: “Can the current state of the art on SMTs (Sect. 2) be updated by reducing the number of SMT standards; thus, better reflecting the current state of play?”

3.2 Methods

It should be noted that there are numerous data mining functions to choose from; two prominent ones are association rules and clustering, implemented by a variety of algorithms [13, 14]. We used RapidMiner^{Footnote 1} [17] for experimentation, because it contains all the algorithms we want to utilize for our experiments. The following subsections contain a short introduction to unsupervised learning (like clustering) and association rule mining with brief descriptions of key algorithms, as well as details about the methods we employed for our experiments.

3.2.1 Association rule mining

Association rule mining [18] is a machine learning method for discovering relations between variables in large databases [19]. The intention here is to identify strong rules in databases using some measures of interest, like confidence and support [20]. There are exhaustive and heuristic association rule algorithms, like Apriori [21], a prominent algorithm for mining frequent itemsets for Boolean association rules and FP-Growth [22] that is detailed in this subsection. Also, ARMICA [14], a novel ARM method, based on the heuristic Imperialism Competitive Algorithm (ICA), for finding frequent itemsets and extracting rules from datasets, whilst setting support automatically. In this paper we use two (2) measures in order to find interesting rules from the dataset: minimum support and confidence.

Let I = {i₁, i₂,…, i_n} be a set of n binary attributes called items. Let D = {t₁, t₂,…, t_m} be a set of transactions called the database. Each transaction in D has a unique transaction ID and contains a subset of the items in I. A rule is defined as an implication of the form X ⇒ Y where X, Y ⊆ I and X ∩ Y = ∅. The sets of items (itemsets) X and Y are called antecedent (left-hand-side or LHS) and consequent (right-hand-side or RHS) of the rule [23]. In order to select interesting rules from the set of all possible rules, constraints on various measures of significance and interest can be used. The best-known constraints are minimum thresholds on support and confidence.

Definition of Support

[24]

The support supp(X) of an itemset X is defined as the proportion of transactions in the dataset which contain the itemset.

Definition of Confidence

[24]

Confidence can be interpreted as an estimate of P(Y |X), i.e. the probability of finding the RHS of the rule in transactions under the condition that these transactions also satisfy the LHS, or the measure that indicates how often the rule is true. The confidence of a rule is defined as:

$$ {\text{conf}}({\text{X}} \Rightarrow {\text{Y}}) = {\text{supp}}({\text{X}} \cup {\text{Y}})/{\text{supp}}\left( {\text{X}} \right). $$

(1)

FP-Growth [22] was used in Experiment#1 (Sect. 4.2). This algorithm counts occurrences of items in the dataset and appoints them to a header table. Then it builds the FP-tree structure (“a compact structure that stores quantitative information about frequent patterns in a database”) [25] by inserting instances. Items in each instance are sorted by descending order of their frequency in the dataset for faster tree processing. Then a threshold for coverage is applied and all items that do not meet the requirements are removed. Recursive processing of this compressed version of the dataset grows large itemsets directly, instead of generating candidate items and testing them against the entire database. After a few more steps [22] the recursive process is finalized and the largest sets of items with minimum coverage have been found, and association rule creation begins [26].

3.2.2 Clustering

Clustering is an unsupervised learning method, which creates groups from datasets that consist of objects or entities that are characterized by similar or identical attribute values, but are adequately different from entities that belong to other clusters [13]. For running a clustering algorithm, we need to specify the distance measure (e.g. Euclidean, Manhattan, Jaccard, Cosine distances) [27]. After that, clustering methods often continue with the process of object selection and a method for evaluating the results [28]. For evaluation we can use quality measures like cohesiveness (measure for object-to-object distance), separateness (measure for cluster-to-cluster distance) and silhouette index (mix of cohesiveness and separateness) [29].

Clustering algorithms that we use in our experiments (specifically, Experiment#2, Sect. 4.3) are:

Density-based spatial clustering of applications with noise

(DBSCAN) [30] It is density-based, meaning that given a set of points in some space, it tries to group together points that are packed together, labeling outlying points that are alone in low-density regions. It functions on three (3) abstract steps [31]:

1.
Find the points in the ε (eps) neighborhood of every point and identify the core points with number of neighbors more than minPts.
2.
Find the components that are connected with core points on the neighboring graph, without taking into consideration non-core points.
3.
Assign every non-core point to a nearby cluster if the cluster is an ε (eps) neighbor, else assign it to noise.

For the RapidMiner [17] implementation of this algorithm, we used: epsilon = 1: (Range:real; 0.0 ± ∞; default:1), which specifies the size of the neighborhood and min points = 5: (Range:integer; 1 ± ∞; default:5), which specifies the minimum number of points forming a cluster. As for measure types, there are four (4) options: Mixed Measures, Nominal Measures, Numerical Measures and Bregman Divergences. The last two (2) cannot be used since our dataset does not contain numerical attributes. So, out of the remaining two (2) groups of measure types we chose Mixed Measures, and specifically the Mixed Euclidean Distance for two (2) reasons: a) Nominal Measures contain, Nominal Distance, Dice Similarity, Jaccard Similarity, Kulczynski Similarity, RogersTanimoto Similarity, RussellRao Similarity and Simple Matching Similarity which all form two (2) clusters with no reasonable results except from Nominal Distance. which produces exactly the same results as Mixed Euclidean Distance, and b) according to RapidMiner user statistics, 79% of users utilize the Mixed Euclidean Distance measure which in our case outperforms the rest of the measures.

k-Medoids

is a clustering algorithm related to k-means and the medoidshift algorithm [32]. Both k-means and k-Medoids partition the dataset, and attempt to minimize the distance between points labeled to belong to a cluster and a point designated as the epicenter of the cluster. Running this algorithm in RapidMiner we used the following default parameter values: max runs = 10, max optimization step = 100. We also tried other values, but they produced the same or poorer results. Regarding the measure type, we used Mixed Euclidean Distance, as we did with DBSCAN.

Random-Clustering

[33] It generates simple and uniform random partitions. It has a single parameter controlling the partition of a random permutation into its cycles. The limit distribution of the size index of the generated partition is the join of the independent Poisson distributions with means determined by the size and the parameter. As for RapidMiner’s parameters, in this algorithm the only one required is the number of clusters to be formed (more in Sect. 4.3).

3.3 Dataset

The dataset used for our methodology contains various SMPs; the choice is based on ranking regarding active monthly users, using the expanded and merged version of Table 2 and “Appendix A”. We consider a platform’s user penetration, as well as the variety of its official features, as the most important attributes when enlisting a candidate platform to our methodology. It is built and populated by data retrieved from the official sites of each of the 112 SMPs we review. Some platforms with smaller user penetration implement fewer features. Clearly the list is not exhaustive, given the volatile nature of SM popularity and feature base. We use data pre-processing techniques such as removing duplicates and missing values, or data transformation and reduction as needed to normalize our research dataset (further explained in Observation#1 below).

Having presented the most common SMTs in Sect. 2, Table 1 summarizes the top fifteen (15) ranked SM information networks with regards to active users [34].

Table 1 SM ranking by active users

SM sites
Facebook	Gab	Cross.tv	Plurk
YouTube	Telegram	Flixster	LiveJournal
Instagram	Tagged	Gaia Online	Weibo
Twitter	Myspace	BlackPlanet	Qzone
Reddit	Badoo	MyMFB	QQ
Vine	Stumbleupon	Care2	Baidu
Pinterest	Foursquare	CaringBridge	Line
Ask.fm	MeetMe	GoFundMe	YY
Tumblr	Skyrock A192	Tinder	Sprybirds
Flickr	Pinboard	Crokes	Xing
Google+	Kiwibox	Goodreads	VampireFreaks
LinkedIn	Twoo	Internations	CafeMom
VK	Yelp	PlentyofFish	Ravelry
ClassMates	Snapfish	Minds	ASmallWorld
Meetup	Photobucket	Nexopia	ReverbNation
WhatsApp	Shutterfly	Glocals	SoundCloud
Messenger	500px	Academia.edu	Solaborate
Snapchat	DeviantArt	Busuu	eToro
Quora	Dronestagram	English, baby!	Xanga
GirlsAskGuys	Fotki	Italki.com	Ryze
Nextdoor	Fotolog	Untappd	Zynga
ProductHunt	Imgur	Doximity	Habbo
AngelList	Pixabay	Wayn	FunnyOrDie
Kickstarter	WeHeartIt	CouchSurfing	Tout
WeChat	43Things	TravBuddy	Classmates
Skype	Path	Tournac	MyHeritage
Viber	Uplike	Cellufun	MocoSpace
Viadeo	Last.fm	23andMe	Ancestry.com

Utility	Official features
Connecting (Count = 52)	Fans, Groups, Live Chat, Pokes, Gifts, Messaging, Explore, Instagram Direct, Direct Messaging, Discussion Website, Exploring, Profiles, Messaging to Blogs, Accounts, User Profiles, Circles, Communities, Collections, Emails, User Profile Network, Influencers, Synchronization with Other Social Networks, SMS Service, Members, Neighbors, Chatting, Drafts, Secret chats, Voice Calls, Bands, Dating, Mothers, Weaving, Christian, Talent, Muslims, Activists, Political, Authors, Expats, Follow, Teenagers, Celebrities, Relatives, User Groups, Messages, Group and Voice Chat, Video conferences, Conversations, Chat features
Multimedia (Count = 29)	Photos, Videos, Text, Upload and download options for Photos, Playback Upload Quality and formats, Live Streaming, 3D Videos, 360o Videos, Images, Live Videos, Photographic Filters, Record Short Video Clips, Ability To “Revine” Videos on A Personal Stream, Stream, Photography, Voice, Image Filters, Short videos, Gab, Cloud-Based Messages, Audio, Files, Musicians, Crocheting, Photoblog, VideoBlog, AudioBlog, Pictures
Professional (Count = 36)	Monetization, Licensing, Job Listings, Online Recruiting, For-Pay Research, Snapcash, Products, Startups, Investors, Funding, Channels, Enterprises, Purchases, Home Services, Drones, Knitting, Environmental, Treatments, Medical, Illness, Funding, Rewards, Academics, Papers, Teaching, Language, Health, Business, Promoting, Companies, Technology, Trading, Stock offering, Virtual Currency, Video Streaming for money, Video tutorials for money
Sharing (Count = 23)	Post Text, Instagram Stories, Tweet, Retweet, Links, Hashtags, Sharing Content, Protected Posts, Pins, Boards, Send Questions, Queue, Tags, Questions, What’s Hot, Post to And Read Community Boards, Post, Content Discovery, Location, Inspiration, Spinning, Sharing, Posting, Quoting
Entertainment (Count = 17)	Games, Shopping, Gaming, Art, Music, Culture, Travel, Luxury, Movies, Animes, Books, Comedy, Online Social Gaming, Gamers, Concerts, Fashion, Sports
Opinions (Count = 15)	Polls, Answers, Suggest Edits, Feeds, Recommendations, Reviews, Advice, Recommendation, Discussions, Forums, Opinions, Reviews, Discussion forums,
Profile (Count = 13)	Wall, Calendar, Embedded in Profile, Skills, Memories, Bookmarking, Goals, Career, Records, Professional Profiles, Profile, Journals, Diaries
Publishing (Count = 11)	Dashboard (Blog Posts), Google+ Page, Locations, Google Local, Publishing Platform, Blog, Blogging, Weblog, Pulse, Blogs, Microblogging
Applications (Count = 15)	Apps, Stand-alone Apps, Third-party Services, HTML editing, Interaction and compatibility, Filtering, Additional features, Deprecated Features, Applications, External, Third Party Applications, Mobile, SMS, Bots, third party development
Schedule (Count = 8)	Organization, View Information About Upcoming Reunions, Organize Meetups, Events, Activities, Planning, Event, Event coordination
Privacy (Count = 6)	Classified section, Access control, Identity Service, Privacy, Security and Technology, Enhanced Privacy
Voting (Count = 7)	Likes, Web Content Rating, Voting, +1 Button, Like Buttons, Upvote/Downvote, Stickers
News (Count = 7)	News Feed, Status, Follow People and Trending Topics, Social News Aggregation, Following, News, Tech News
Promoting (Count = 4)	Fan Pages, Links, Advertising, Ad-Free

No.	SM sites	Connecting	Multimedia	Professional	Sharing	Entertainment	Opinions	Profile	Publishing	Applications	Schedule	Privacy	Voting	News	Promoting
1	Facebook	7	4	–	–	1	1	1	–	1	–	1	1	2	2
2	YouTube	–	6	–	1	–	–	–	–	–	–	–	–	–	–
3	Instagram	2	3	1	1	–	–	–	–	2	–	–	–	–	–
4	Twitter	1	2	–	4	–	–	–	–	–	–	–	–	1	–
5	Reddit	1	1	–	3	–	–	–	–	–	–	–	2	1	–
6	Vine	–	2	–	1	–	–	–	–	–	–	–	–	–	–
7	Pinterest	1	2	–	2	–	–	–	–	–	–	–	–	1
8	Ask.fm	1	–	–	1	–	–	–	–	–	–	–	–	–	1
9	Tumblr	1	–	–	4	–	–	–	2	1	–	–	–	–
10	Flickr	1	2	1	–	–	–	–	–	2	1	1	–	–	–
11	Google+	5	2	–	1	–	–	1	3	2	–	2	1	–	–
12	LinkedIn	3	–	3	–	–	–	2	1	4	–	1	–	–	1
13	VK	4	–	–	–	–	–	–	–	–	–	1	1	1	–
14	ClassMates	2	–	–	1	–	–	–	–	–	1	1	–	–	–
15	Meetup	2	–	–	–	–	–	–	–	–	1	–	–	–	–
16	WhatsApp	1	3	–	–	–	–	–	–	–	–	–	–	–	–
17	Messenger	1	3	–	–	–	–	–	–	–	–	–	–	–	–
18	Snapchat	–	3	1	–	–	–	1	–	–	–	–	–	–	–
19	Quora	–	–	–	–	–	3	1	–	–	–	–	1	–	–
20	GirlsAskGuys	–	1	–	2	–	3	–	–	–	–	–	–	–	–
21	Nextdoor	1	–	–	–	–	–	–	–	–	2	–	–	–	–
22	ProductHunt	–	–	1	–	–	–	–	–	–	–	–	1	–	–
23	AngelList	–	–	2	–	–	–	–	–	–	–	–	–	–	–
24	Kickstarter	–	–	2	–	–	–	–	–	–	–	–	–	–	–
25	WeChat	2	–	–	–	1	–	–	–	–	–	–	–	–	–
26	Skype	1	3	–	–	–	–	–	–	–	–	–	–	–	–
27	Viber	1	3	–	–	–	–	–	–	–	–	–	–	–	–
28	Viadeo	–	–	3	–	–	–	–	–	–	–	–	–	–	–
29	Gab	1	1	–	–	–	–	–	–	–	–	–	–	–	1
30	Telegram	4	1	1	–	–	–	1	–	2	–	1	1	–	–
31	Tagged	1	–	–	–	–	–	–	–	–	–	–	–	–
32	Myspace	1	1	–	–	–	–	–	–	–	–	–	–	–	–
33	Badoo	1	–	–	–	–	–	–	–	–	–	–	–	–	–
34	Stumbleupon	–	–	–	1	–	–	–	–	–	–	–	–	–	–
35	Foursquare	–	–	2	1		1	–	–	–	–	–	–	–	–
36	MeetMe	1	–	–	–	–	–	–	–	–	–	–	–	–	–
37	Skyrock A192	–	–	–	–	–	–	–	1	–	–	–	–	–	–
38	Pinboard	–	–	–	–	–	–	1	–	–	–	–	–	–	1
39	Kiwibox	–	1	–	–	1	–	–	1	–	–	–	–	–	–
40	Twoo	1	1	–	–	–	–	–	–	–	–	–	–	–	–

SM sites	Primary	Secondary	Trivia
Facebook	Connecting (7)	Multimedia (4)	Entertainment (1), Opinions (1), Profile (1), Applications (1), Privacy (1), Voting (1), News (2), Promoting (2)
YouTube	Multimedia (6)	Sharing (1)	–
Instagram	Multimedia (3)	Connecting (2)	Professional (1), Sharing (1), Applications (2)
Twitter	Sharing (4)	Multimedia (2)	Connecting (1), News (1)
Reddit	Sharing (3)	Voting (2)	Connecting (1), Multimedia (1), News (1)
Vine	Multimedia (2)	Sharing (1)	–
Pinterest	Multimedia (2), Sharing (2)	Connecting (1), News (1)	–
Ask.fm	Sharing (1), Connecting (1), Promoting (1)	–	–
Tumblr	Sharing (4)	Publishing (2)	Connecting (1), Applications (1)
Flickr	Multimedia (2), Applications (2)	Connecting (1), Professional (1), Schedule (1), Privacy (1)	–
Google+	Connecting (5)	Publishing (3)	Multimedia (2), Sharing (1), Profile (1), Applications (2), Privacy (2), Voting (1)
LinkedIn	Applications (4)	Connecting (3), Professional (3)	Profile (2), Publishing (1), Privacy (1)
VK	Connecting (4)	Privacy (1), Voting (1), News (1)	–
ClassMates	Connecting (2)	Sharing (1), Schedule (1), Privacy (1)	–
Meetup	Connecting (2)	Schedule (1)	–
WhatsApp	Multimedia (3)	Connecting (1)	–
Messenger	Multimedia (3)	Connecting (1)	–
Snapchat	Multimedia (3)	Professional (1), Profile (1)	–
Quora	Opinions (3)	Profile (1), Voting (1)	–
GirlsAskGuys	Opinions (3)	Sharing (2)	Multimedia (1)
Nextdoor	Schedule (2)	Connecting (1)	–
ProductHunt	Professional (1), Voting (1)	–	–
AngelList	Professional (2)	–	–
Kickstarter	Professional (2)	–	–
WeChat	Connecting (2)	Entertainment (1)	–
Skype	Multimedia (3)	Connecting (1)	–
Viber	Multimedia (3)	Connecting (1)	–
Viadeo	Professional (3)	–	–
Gab	Connecting (1), Multimedia (1), Promoting (1)	–	–
Telegram	Connecting (4)	Applications (2)	Multimedia (1), Professional (1), Profile (1), Privacy (1), Voting (1)
Tagged	Connecting (1)	–	–
Myspace	Connecting (1), Multimedia (1)	–	–
Badoo	Connecting (1)	–	–
Stumbleupon	Sharing (1)	–	–
Foursquare	Professional (2)	Sharing (1), Opinions (1)	–
MeetMe	Connecting (1)	–	–
Skyrock A192	Publishing (1)	–
Pinboard	Profile (1), Promoting (1)	–	–
Kiwibox	Multimedia (1), Entertainment (1), Publishing (1)	–	–
Twoo	Connecting (1), Multimedia (1)	–	–

Social Media Types: introducing a data driven taxonomy

Abstract

Similar content being viewed by others

User behavior mining on social media: a systematic literature review

Name it as you like it? Keeping pace with social media something

Social Media Analytics: Techniques, Tools, Platforms a Comprehensive Review

Explore related subjects

1 Introduction

2 Literature review

3 Methodology

3.1 Problem definition

3.2 Methods

3.2.1 Association rule mining

Definition of Support

Definition of Confidence

3.2.2 Clustering

Density-based spatial clustering of applications with noise

k-Medoids

Random-Clustering

3.3 Dataset

3.4 Observations

Observation#1

Observation#2

Observation#3

Observation#4

3.5 Research process

4 Experiments

4.1 Biases

4.1.1 Dataset biases

4.1.2 Biases in Experiment#1

4.1.3 Biases in Experiment#2

4.2 Experiment#1

4.3 Experiment#2

Expreriment#2

Entertainment Networks

Sharing Content Networks

Profiling Networks

General Purpose Networks

Entertainment networks

Profiling Networks

Social Networks

5 Conclusions

5.1 Research summary

5.2 Implications

5.2.1 SM selection

5.2.2 Identification of new trends

5.2.3 Collaborations and acquisitions

5.3 Future work

Notes

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Appendices

Appendix A: The complete set of 112 SM sites

Appendix B: Mapping official features to utilities

Appendix C: Utility occurrences on the SM dataset

Appendix D: SMPs’ primary, secondary, trivia utilities

Appendix E: Frequent itemsets (FP-growth)

Appendix F: Results from clustering with DBSCAN and k-Medoids

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation