Social Media Research

Dasgupta, Nabarun; Winokur, Carly; Pierce, Carrie

doi:10.1007/978-981-15-3013-5_11

Nabarun Dasgupta²,
Carly Winokur³ &
Carrie Pierce³

415 Accesses
1 Citations

Abstract

The use of the social media by people around the globe is widespread. This chapter discusses the contribution which social media research can offer to pharmacovigilance and medicinal product risk communication research. While the use of the social media itself and the development of social media strategies are important topics for research, this chapter focusses on the methods of social media listening and crowdsourcing of information, and provides examples of their utility. It highlights opportunities, limitations, challenges as well as ethical and legal aspects that need to be addressed for future research.

Access provided by Autonomous University of Puebla. Download chapter PDF

Recommendations for the Use of Social Media in Pharmacovigilance: Lessons from IMI WEB-RADR

Article Open access 24 August 2019

Regulatory Definitions and Good Pharmacovigilance Practices in Social Media: Challenges and Recommendations

Article 01 November 2015

Social Media Listening for Routine Post-Marketing Safety Surveillance

Article 21 January 2016

1 The Discipline of Social Media Research: Scope, Theories and Principles

1.1 An Introduction to Social Media Research

The internet has had widespread uptake around the globe and offers opportunities and challenges for risk communication in safety of medicines. In 2015, there were 3.2 billion (International Telecommunication Union (ICU) 2015) internet users worldwide; 63% were from low- and middle-income countries (International Telecommunication Union (ICU) 2015). Even in least-developed countries, a significant number of people access the internet regularly, especially from handheld devices operating over cellular data networks. The internet is used for disseminating and accessing information via websites, electronic mail, purchasing goods, and engaging via social media. In 2015, nearly two-thirds of adults (65%) in the United States (US) used a social networking site. While the majority of these individuals were aged between 18 and 29, 35% of adults aged 65 and older were using social media (Perrin 2015).

1.1.1 Social Media Listening

Billions of people interacting with the internet, or being “online”, on a daily basis generate traces of important information that can be aggregated and analysed for research purposes. The process of using social media to understand how consumers discuss specific topics in online spaces is known as social media listening (Powell et al. 2015). Typically, social media listening is a passive process for the social media users and has been used for commercial purposes, like marketing and retail. However, a large amount of daily discussions in social media pertains to health information and diseases, as well as biomedical and medical products that address these conditions (medicines, devices, vitamins, supplements, etc.) (Powell et al. 2015). Many of these health-related discussions are generated by patients, comprising a large corpus of free-text narratives that can be leveraged for health-specific research (Powell et al. 2015).

1.1.2 Crowdsourcing of Information

Crowdsourcing of information, on the other hand, is generally an active process whereby online participants are solicited for specific information. It may be defined as the systematic effort to collect information from a wide audience, particularly through online tools that can provide mutual benefits to participants and activity sponsors (Bahk et al. 2015).

In practice, both active and passive processes may be used within a single research project. For example, social media listening may be used to form hypotheses, which are then tested using crowdsourced data. Conversely, social media listening can be used post hoc to contextualise and make sense of unexpected crowdsourced information, such as jargon and acronyms.

Information from patients is traditionally captured through qualitative research or surveys, usually with a series of standardised questions (see Chap. 8). However, listening to the patient voice in this structured way may limit the scope of patients’ responses and their willingness to discuss sensitive topics. In contrast, unstructured discussions acquired from online forums—particularly those dedicated to discussions regarding a specific therapeutic area or treatment—could provide a wealth of patient information that typically is not captured in traditional studies due to hearing directly from the patient. Metadata derived from posts and user account profiles can provide a more complete picture for research-related applications than relying solely on a single post, and has the possible benefit of painting a more comprehensive view of a patient’s life than just based on a cross-sectional survey response.

The body of literature on digital health is expanding rapidly (Rothman et al. 2015). While the use of the social media and the development of social media strategies are important topics for research, in this chapter we narrow the focus. Our aim is to describe how to apply emerging tools of social media research—a new discipline under formation—to the post-authorisation safety surveillance of medicinal products and pharmacovigilance overall. This additionally includes the application to medicinal product risk communication research in particular, including for the purpose of planning and evaluation of communication interventions.

1.2 Pharmacovigilance, Risk Communication and the Social Media

Pharmacovigilance monitors a medicinal product to identify and assess adverse events that may occur in patients. Adverse events causally associated with a medicine (i.e. adverse reactions) pose a patient and public health problem. However, both rare and late reactions are difficult to uncover through clinical trials during the development process of a medicine because trials typically include a couple of thousand patients at maximum and are relatively short in duration compared to long-term medicines use in real life. For this reason, safety surveillance after product approval by the regulatory body and during use in healthcare is of critical importance to safeguarding the availability and development of pharmaceutical medicines. Legal obligations for pharmaceutical manufacturers and established practices during this post-authorisation phase refer to characterising, preventing, and minimising risks related to medicinal products. Fundamental to these pharmacovigilance processes are continuous exchange and (re)assessment of risk information. Many organisations currently use a combination of automated and manual processes to perform necessary pharmacovigilance duties, including with traditional individual case safety reports, i.e. reports of an adverse reaction suspected in a patient, that are submitted as the so-called spontaneous reports through national reporting systems. Harms related to medication errors or product quality concerns may also be reported depending on national definitions and requirements. Reports can be submitted via telephone, paper, email, fax, online forms, and mobile apps. Nowadays evidence from observational studies, in addition to spontaneous reports, is very important for further investigating safety concerns or proactively monitoring a medicine at the population level.

More recently, regulatory authorities and other stakeholders have recognised the importance of capturing the patient voice and data contribution for pharmacovigilance. As such, regulatory authorities in many countries recommend to patients to report adverse events they suspect with their medicines, and recommend testing of risk communication for patient comprehension, even asking for patient input on proposed risk minimisation/communication plans and strategies (Snipes 2015) . In general, the patient voice has been established as an important addition to a variety of medical research initiatives (Smith and Benattia 2016). Patient-reported outcomes are now accepted in clinical trials, and there is a renewed focus on patient-reported outcomes derived from unstructured data in other types of research, such as comparative effectiveness (Peacock 2014).

Starting in 2011, questions about the future of social media and pharmacovigilance were raised by senior figures in the field (Edwards and Lindquist 2011). With the rise of social media usage, there is potential for social media to be incorporated into effective pharmacovigilance (PatientsLikeMe 2019), including risk communication, by manufacturers, regulators, and others involved. Social media can be perceived as a new data source to inform pharmacovigilance and risk communication. Nevertheless, the volume and concerns about tenuous causality give rise to legitimate concerns about muddling data from social media with vetted data from carefully honed pharmacovigilance information systems. Yet, the processes in place globally for pharmacovigilance information processing offer a potential framework for dealing with social media data. This will require a careful balance of human and machine tasks, tempered by vastly different concepts of privacy and collaboration.

This chapter provides an overview of how social media research may be used to augment current medicines safety surveillance and risk communication practices through case studies, discussion of its potential opportunities for benefits and limitations, ethical and legal concerns, as well as practical lessons learnt and future outlook. This includes a synopsis of the current public debate on the usefulness of social media research in pharmacovigilance, underpinned by examples. Many high-quality reviews of existing applications have been published recently (Rees et al. 2018; Convertino et al. 2018; Tricco et al. 2018; Wong et al. 2018; Demner-Fushman and Elhadad 2016; Golder et al. 2015; Lardon et al. 2015; Sloane et al. 2015; Sarker et al. 2015) and should be consulted for more in-depth discussion of topics like the merits of particular data sources and computational methods.

2 Research Approaches and Methods

2.1 Selection of Social Media Sites

For clinical trials and epidemiological studies, site selection is central to investigating causal inference from observed associations. Similarly, a wide variety of social media platforms currently exist, and each may be used primarily by a different population; therefore, one social media site may be more appropriate for a specific research purpose than another. Permissions associated with a specific site might only allow for use of certain information. Additionally, each site’s users may have a unique demographic profile that could change over time. For research projects that are interested in specific, well-defined topics or events, Twitter might be useful due to the hashtag (#) feature, which groups posts into a folder system; hashtags are a means of organising content in social media, akin to folders in traditional computer operating systems or electronic mail (Grajaless et al. 2014), but limited by length of content. Researchers specifically have been able to utilise Twitter to connect with patients or potential patients about a variety of health topics. However, for privacy reasons, healthcare professionals and patients should be cautious about what content they publicly share (Grajaless et al. 2014). Closed social media platforms such as a site for patients of a clinical practice allows patients to be actively involved in their care coordination, track their clinical progress, and have greater access to their physicians (Grajaless et al. 2014). While this is beneficial to the patient, this information is often unavailable for research projects. Alternatively, online patient communities offer a theoretically more secure healthcare forum for patients to communicate with one another. These sites are more likely to partner with stakeholders who are interested in using online patient narratives in research that will directly benefit the patients who originally generated the data; however, a site’s terms of use may require organisations to pay or to follow certain guidelines to access the raw data, with varying standard of informing or obtaining consent from patients.

2.2 Study Designs

Studies using social media data often default to cross-sectional epidemiologic designs because they are straightforward to conduct. Metadata about the user account (such as patient gender and location) that accompanies an individual message posted to a site may be used to define prospective cohorts, bringing such research more in line with other epidemiological study designs. For example, if a medicine safety communication intervention is targeted to a high risk subset of patients (say, women of reproductive age actively seeking to become pregnant that should avoid a suspected teratogenic medicine), then individuals with the underlying disease condition who meet the high risk criteria could be identified in social media from post histories and metadata. This subset of patients could be enrolled in a prospective cohort to evaluate message penetration (say, by seeing if these individuals repost warning materials generated from the information campaign).

2.3 Social Media Listening

Early initiators (Knezevic et al. 2011; Bian et al. 2012; Wu et al. 2013; Chary et al. 2013; Abou Taam et al. 2014) presented technical modalities when social media surfaced as an untapped data source for pharmacovigilance. The general approach to social media listening remains the same, even as new tools are developed:

First, data are generated by users of a social media site, usually a general-purpose social network or a disease-specific patient forum.
Second, with permission from site administrators, unformatted text and metadata on user characteristics are transferred to servers held by the analyst.
Third, text is standardised and formatted for machine processing, including removal of verbatim multiplicate copies (e.g. reposts or forwards) (Sharpe 2014), perhaps with steps to preserve anonymity of social media users.
Then, an automated or semi-automated process is conducted to isolate the name of the medicine and the description of the suspected adverse reaction or another medicine-related problem, often with the use of purpose-built or existing publicly available medical semantic language tools. Machine learning tools are usually required to separate the indication for using the medicinal product from the suspected adverse reaction, as well as the removal of spam, advertisements, etc.
A further step of manual review is often executed, with vastly different amounts of human effort involved. The most intensive individual case reviews are conducted by pharmacovigilance experts, and more commonly cursory review is completed by entry-level analysts.
Finally, quantitative descriptive statistics are generated through summarisation, including comparisons to traditional sources of pharmacovigilance data, leading either to a publication for disseminating the evidence or to support internal decision-making, such as for risk management at a pharmaceutical company.

Social media listening to patient and other relevant various communities can be performed manually or through automated tools that filter and/or classify information acquired from social media. It is most commonly performed through a mixed method process of automatic tools coupled with manual review or curation (Tufts Center for the Study of Drug Development 2014). Automated data processes typically employ normalisation (i.e. organising data so that there is no redundancy, and ensuring related items are stored together), text-matching, and natural language processing techniques to collect and filter data, enabling researchers to amass a larger, more complete database (Sharpe 2014). Best analytical practices will likely require a hybrid approach leveraging automated and manual processes to contextualise the data. Manual work may be needed to develop taxonomies for translating colloquial phrases from social media into standardised medicine and medical condition concepts. Human curation is crucial for validating and improving outputs from machine learning tools for data classification. In essence, machine learning tools are excellent at replicating tasks that humans perform well through applying consistency. On the other hand, machine learning stumbles on tasks where discretion is involved, such as when humans disagree on classifications, highlighting the importance of human curation.

There are specific challenges with using data from social media listening in pharmacovigilance that have been well addressed in the scientific literature: determining which posts deserve manual review (Comfort et al. 2018; Alvaro et al. 2015), vernacular patient language (i.e. the language commonly spoken in the respective region as mother tongue) (Sharpe 2014; Jiang et al. 2018a; Emadzadeh et al. 2018; Cocos et al. 2017; Carbonell et al. 2015), 3326 misspellings of medicine and disease names (Bian et al. 2012; Carbonell et al. 2015), drawbacks to manual annotation of training a corpus (Jiang et al. 2018b; Gupta et al. 2018; Liu et al. 2018; Nikfarjam et al. 2015), and separating side effects from indications or benefits within a post (Liu and Wang 2018; Abdellaoui et al. 2017; Eshleman and Singh 2016; Liu et al. 2016; Sarker et al. 2016; Segura-Bedmar et al. 2015). Other issues being addressed by creative computer science include: dealing with constantly evolving internet slang and visual elements of text (e.g. emoticons, emoji) , geolocation of social media posts, maintenance costs of complex dynamic visualisation displays of real-time data, the burn-out from demands of human curation, purposefully misleading information disseminated by malicious actors using automated methods (e.g. bots), the ability to perform retrospective analyses on historical data, and the ability to remove personally identifiable information (PII) (Tufts Center for the Study of Drug Development 2014).

Social media listening can be used for a number of research purposes, including understanding aspects of medicines use and risks, or simply understanding what kind of information patients are asking for. It can also be used to understand audiences of risk communication, their characteristics, communication needs, and preferences more comprehensively for communication planning. Following a communication intervention, social media listening can be used to evaluate its impact.

2.4 Understanding Aspects of Medicines Use and Risks

Social media listening, or monitoring, involves two-way communication, where organisations engage in disseminating messages and also in listening to populations. For pharmacovigilance, insights may be obtained to serve risk assessment and provide for the contextualisation of risk—for example, what it means to patients—in communication materials.

More specifically, healthcare professionals generally underutilise voluntary spontaneous reporting systems of adverse reactions of medicines, due to bandwidth constraints precluding them from having time to submit reports. Patients and informal caregivers may be unaware of the importance or mechanisms by which to report adverse reactions. Additionally, some national authorities may be wary of becoming inundated with reports of minor side effects, as it could distract them from paying attention to more serious problems. Further limitations of spontaneous reporting—regardless of whether it is voluntary or mandatory—include significant underreporting of events, incomplete data quality for clinical evaluation, a lack of geographic diversity (most reports are from the US and Europe), persistent reporting of known adverse reactions, duplicate reports, and unspecified causal links (Sarker et al. 2015). Spontaneous reporting has been described as efficient for rare and very serious events. However, the sizeable limitations leave information gaps among regulatory agencies, healthcare professionals, stakeholders, and patients. While social media cannot fill all gaps and overcome all problems, there may be certain areas in which social media content can complement what is collected via traditional systems.

Two case studies (see Figs. 11.1 and 11.2) provide a methodological introduction and exemplify how social media listening can support understanding aspects of medicines use and risks. These examples demonstrate that social media data can provide the context of real world use of medicines, help identify safety concerns and risk factors, and offer additional information not typically captured by existing reporting systems, such as benefits or lack of efficacy. These two case studies provide interesting parallels and contrasts. Case study 1 (see Fig. 11.1) was conducted using Facebook and Twitter data by a large pharmaceutical company with considerable reliance on manual review and an annotated training corpus. Case study 2 (see Fig. 11.2) comes from an academic group that used consumer-generated product reviews from Amazon online marketplace in a highly automated manner. Both approaches revealed new insights into the safety of the substances and patient perceptions of them. A third case study 3 (see Fig. 11.3) describes how online news and social media could be used to understand infectious disease outbreaks and support safety surveillance of anti-infectives as well patients in making healthy choices.

2.5 Understanding Audiences of Risk Communication and Their Information Needs

Since various social media platforms are used by large proportions of the general population, they can provide stakeholders with access to more diverse and comprehensive patient cohorts than those used in traditional studies (Rothman et al. 2015). Integration of traditional data sources with alternatives such as social media, partnered with rapid buy-in from key stakeholders may allow regulators, pharmaceutical industry, academia, and healthcare professionals to better understand the patient communities they serve. This in turn enables patients’ first-hand experiences to improve the care they receive (Smart Patients, Inc 2015). To leverage this effectively, methods are needed to filter out noise and distil insights from patients (Larkin 2014). A 2015 analysis of vaccine sentiments in Twitter users in the US performed illustrates the application of social media listening to better understand audiences to develop strategies and communication intervention to address their concerns. The analysis showed which themes and terms were more prevalent in positive, neutral, and negative sentiment networks. This approach could guide which messages and words to use for reaching and improving vaccine confidence in the respective populations. Methodologically, the study was performed through coding, creation of semantic networks, and their analysis (Kang et al. 2017).

2.6 Crowdsourcing of Information

Social media cannot fill all gaps and overcome all problems seen with traditional data sources used for pharmacovigilance. Nonetheless, there may be certain areas in which social media content can complement what traditional systems collect, such as data directly from patients. Traditional systems for spontaneous reporting of suspected adverse reactions are burdensome and time-consuming for healthcare professionals and patients, for whom reporting is mostly voluntary. Patients completing reports through traditional channels can take up to an hour. As a result, only 2% of reports received by the US Food and Drug Administration (US FDA) are reported by patients directly, i.e. not by or via a healthcare professional. Online and mobile tools have been developed to address barriers to reports, streamline the reporting process, and make them more user-friendly. Additional tools have been developed to perform digital disease detection in the form of online surveillance and social media listening, allowing for a more complete, accurate picture of medicinal product—adverse event pairs (Bahk et al. 2015). These tools’ hallmark is the ability to support a concept known as crowdsourcing.

Crowdsourcing tools enable stakeholders to directly engage with a patient community. Patient community outreach can be successful if conducted through social media platforms where community groups may pre-exist. These communities may look different depending on the networking site. For example, Facebook hosts pages or member groups that can be set up by any member to provide a space for dedicated discussion according to a patient population, interest group, or disease area (Bahk et al. 2015). A Twitter-based community would be organised by hashtags that identify different patient populations or concepts that are aggregated by a folder system to be easily identified through a simple query (Grajaless et al. 2014). For example, Twitter users may use the hashtag #teamnosleep to self-identify themselves as insomniacs. Social media patient communities typically openly discuss experiences with their disease(s) and/or treatment(s) that include conversations about adverse events and benefits of medicines, news in scientific journals, and official communications, such regulatory guidelines, label changes, and product recalls. Organisations can access these group members by contacting the group administrator(s) for permission to engage with members and discuss the benefits of utilising an online crowdsourcing tool (Bahk et al. 2015). Administrators may encourage the group to participate in the crowdsourcing. This could include utilising social media to share information about potential adverse reactions of a medicine among a specific patient group (Bahk et al. 2015). This method of patient engagement is illustrated in the motivation-incentive-activation-behaviour (MIAB) concept. In the MIAB concept, motivation is the reason for patient interest, and incentive is what leads the patient to act. Activation is the set of factors that lead to the patient’s actual participation, and behaviour is the activity of interest and outcome—in this case, submitting a suspected adverse reaction report (Bahk et al. 2015). It has been proven that patients are more likely to engage in activities that reduce their own burden or that provide some benefit in exchange for some equal level of effort (Bahk et al. 2015). A proven history of patient buy-in to social listening and to other digital tools for pharmacovigilance may encourage patients to participate in crowdsourcing activities. This can be seen as a more active form of two-way communication, which has implications for traditional communication efforts as well as offering opportunities.

3 Utility of Applied Methods for Researching Medicinal Product Risk Communication

3.1 Opportunities of Social Media Research

“Fast”, “cost-effective”, “large-scale”, “transparent”, “patient-generated”, “real-time” and “general usefulness” are all phrases commonly used to describe the strengths of social media listening and crowdsourcing.

Social media listening is often available prospectively and in real time, allowing stakeholders to quickly grasp disease prevalence and other epidemiological insights, the impact of a medical intervention, (like a medicine), health topics, and questions of interest to medicine users. Pharmaceutical companies often use such listening alongside launches of new medicines or post-authorisation studies to gather information on how the patient population is responding to treatments. It has also been used to determine where to host a study or launch a new product or intervention due to previously unknown medical need and patient demand (Larkin 2014). Just as importantly, medicinal risk communication may benefit from social media mining, in monitoring and evaluations of communication interventions, or even in the planning phase of communication. Reliance on online health forums for medical advice could be risky to patients; they could be misinformed by each other, improperly self-diagnose, or inappropriately use a medication. Hence, it could be beneficial to capture complex topics and confusing messages. These insights can be used to inform healthcare professional communications to patients, for example. Social media listening enables capturing a large amount of unsolicited, patient-generated data that are available publicly or with permission. End users are provided with the resulting data either in verbatim form or in aggregate, via datasets, summary reports, or visualisations. Since the population of social media users is pre-existing, this method is thought to be cost-effective for the potential amount of data and information gathered from these sources (O’Connor et al. 2014). To collect, clean, analyse, and visualise the same volume of data from other sources would take years, and the timely actionable insight provided would be limited due to the time required to disseminate results (Donahue 2012).

As patients become more knowledgeable about their medical conditions, their articulation of first-hand experiences and perspectives contribute to a valuable data source that can improve the care they receive (Smart Patients, Inc 2015). The widespread use of social media platforms provides communication researchers and practitioners with the ability to understand and design communication interventions for populations that would otherwise be hard-to-reach audiences. The use of new technology and the rapid uptake of social media will provide for better responses to the patient communities they serve.

Many patients report a lack of trust in healthcare professionals, preferring to share information with fellow patients and caregivers (Peacock 2015). Since some diseases, specifically rare diseases or those with social stigma, are associated with an isolating experience that can span several geographical areas, many individuals look to social media to communicate with their peers (Peacock 2015). These patient forums offer anonymity and privacy that may result in patients providing unfiltered data that are more readily available than data from traditional sources. This content can be incredibly beneficial to organisations leveraging social media listening as a research tool: these conversations are unsolicited, and often unfiltered and unabashed. Online discussions among patients about medicines often extend to wider aspects of use, such as off-label use (i.e. use with a medical purpose not in accordance with the terms of the marketing authorisation), as well as issues with product quality, formulation, handling and disposal, sensitive or stigmatised topics, and reluctance to adhere to treatment due to troublesome adverse reactions.

Crowdsourcing offers the opportunity to specifically solicit information on medicines’ use behaviours, risk knowledge and perceptions, communication needs, and preferences as well as feedback on communication events.

Finally, information from patient populations may reflect preconceived notions of shared beliefs due to community mentality, which should be considered in research projects. A carefully planned social listening campaign that accounts for nuances of social media data and potential biases gleans insights from a diverse range of global patient populations.

3.2 Limitations of Social Media Research

While social media data may be readily available in unprecedented volumes, these data represent unsolicited responses, often making it challenging to understanding its quantity or quality. Once personal identifiers are removed from social media data, it is impossible—and ethically challenging—to verify a reported adverse event by following up with a social media user. Additionally, it is difficult to validate the information until data from traditional sources are available for a comparison analysis. Despite the exuberance generated by the potential of social media mining, in practice there has been a vigorous and necessary debate about the practical application of social media mining for pharmacovigilance. In fact, multiple recent, sophisticated, large-scale efforts and systematic reviews have concluded that routine use of social media for pharmacovigilance underperforms pharmacovigilance data collection systems, including industry-dominated traditional reports of suspected adverse reactions submitted to national authorities (Rees et al. 2018; Convertino et al. 2018; Caster et al. 2018; Kheloufi et al. 2017; Pierce et al. 2017). Others have acknowledged these limitations and noted that social media may fill niche knowledge gaps in medicine safety or may require the use of more sophisticated computing tools (Lardon et al. 2018; Bousquet et al. 2018; Anderson et al. 2017). In most cases of serious adverse reactions identified by regulatory authorities, vigilant physician reporters were the most consistent and earliest source of information on new safety signals, compared to social media.

The authors of the largest evaluation to date (Caster et al. 2018) identified key limitations. In their evaluation, they analysed more than two million Twitter, Facebook, and patient forum posts, using an automated Bayesian classifier and purpose-built patient vernacular dictionary to assign risk scores to posts. Two reference datasets of known positive and negative controls were used for comparison. In addition, a major global database of adverse reactions (i.e. VigiBase) was used in head-to-head comparisons with social media. The analysis calculated traditional pharmacovigilance reporting disproportionality ratios for each medicine in social media and compared them against controls. The results were extensive and decisive: “This study investigated the potential usefulness of social media as a broad-based stand-alone data source for statistical signal detection in pharmacovigilance. Our results provide very little evidence in favour of social media in this respect: in neither of the two complementary reference sets, containing validated safety signals and label changes, respectively, did standard disproportionality analysis yield any predictive ability in a large dataset of combined Facebook and Twitter posts… [M]anual assessment of Facebook and Twitter posts underlying 25 early signals of disproportionality showed that only 40% of posts contained the correct drug and the correct event as an adverse experience, and for only three of those 25 signals did the posts strengthen the belief in a causal association” (Caster et al. 2018). The authors offered some possible explanations. First, some medications may have very little discussion in social media channels. Second, identifying rare events in social media may be difficult if the specific colloquial terms are not detected, and the underlying algorithm to detect adverse reactions may have limited detection ability for the types of very rare events of interest to safety reviewers. Third, there is possible bias when comparing social media results to established reference or validation datasets of known signals. Relatively few reference datasets are in public scientific literature, and the nature of the comparison can vary greatly. Fourth, using statistical aberration detection methods originally optimised for traditional pharmacovigilance systems may not be appropriate for social media-based applications (Caster et al. 2018).

In relation to medicinal product risk communication research, like many other data sources, social media data have inherent biases that must be considered when interpreting results. Biases specific to social media data result from each social media network having its own user demographic profile, making it difficult to generalise findings to a larger population of patients who may not fit this profile. This could, for example, influence the provision of useful data pertaining to medicines most commonly used by specific populations, like older or paediatric patients. In addition, certain brands or types of medicinal products may be represented differentially in the social media; thus, an organisation ought to consider determining how often products are discussed online prior to launching a social media research project. Another bias dimension of using unstructured text is literacy bias. Individuals with limited written language skills will only be represented in the data if someone else posts about their experiences for them. The use of emoji and voice-to-text tools may be able to mitigate some of this bias.

For some products, such as medicines against the human immunodeficiency virus (HIV) and acquired immune deficiency syndrome (AIDS), or hepatitis B, individuals may not be willing to communicate publicly about their treatment experiences due to stigmas associated with their diseases and concerns about being identifiable. This could result in bias due to large self-selection or incomplete information sharing. Honest conversations are more likely to be found on specific patient forums as opposed to on public social media sites. Moreover, if patients suspect that they are being monitored, they may go elsewhere to post comments about their disease or treatment regimen, posing a risk to social listening projects.

There is another issue to consider. The need to improve health outcomes, increase safety and safe use of medicines and manage risk are major drivers behind collecting patient data. Communication practitioners and researchers should note that as more data have been collected, concerns about privacy have grown beyond patient privacy. Notably, one of the biggest lessons learnt from using social media for pharmacovigilance is that patients will talk. While this may seem to many like a treasure trove of information, there is major concern that patients will become unblinded when social media is used alongside clinical trials (Lipset 2014). This occurred during a 2009 clinical trial when a patient discovered that she had been placed on the study product (as opposed to placebo or comparator) (Lipset 2014). This realisation led to more individuals seeking online patient communities to share symptoms and compare notes about pill formulations and taste to try to determine which treatment they were receiving. Many patients do not understand the consequences of these interactions, which could end a clinical trial early, and delay or even prevent a new treatment from becoming available to other afflicted patients. This underscores the importance of clinical trial subjects understanding that their social media discussions may compromise randomisation and be an inherent threat to validity in clinical trials. Such discussions among clinical trial participants should be discouraged or sequestered while the clinical trial is underway. Social media monitoring for such discussions could therefore be useful to proactively understand this threat to clinical trial validity.

3.3 Ethical and Legal Aspects

Regulatory guidelines and best practices are slowly emerging regarding when and which organisations have the legal responsibility for mining patient narratives through social media listening (Lengsavath et al. 2017). The regulatory dimensions are addressed as part of the WEB-RADR project (web-radr.eu) (Ghosh and Lewis 2015) and by a few authors (Sloane et al. 2015; Lengsavath et al. 2017; Naik et al. 2015). Despite the ambiguity and evolving regulatory environment, major pharmaceutical companies have executed social media listening projects in recent years (Powell et al. 2015; Comfort et al. 2018; Caster et al. 2018). Currently, the most evident disadvantage to using social media for research relating to medicinal product safety and communication is the lack of regulatory guidance and best practices regarding the use of social media data.

Social media listening also poses ethical and privacy concerns, especially within private online communities (Stergiopoulos 2014). To meet moral obligation, many organisations will only listen in and/or engage with patients on public social media platforms once they have announced their affiliation and presence to the patient(s) (Stergiopoulos 2014). In addition, ethical and privacy regulations are distinct across different geographic regions. Hence, organisations that wish to engage in social media listening must be cognisant of these differences to avoid or address privacy breaches in a timely manner (Stergiopoulos 2014). Due to the speed at which information travels on social media, a researcher may benefit from considering issues that may arise from inappropriately using social media (Stergiopoulos 2014).

Pharmaceutical manufacturers must also consider, as part of their protocol, how to conduct social media listening activities in a way that addresses liability and compliance, meeting regulatory requirements. Legally required reporting of suspected adverse reactions necessitates patient information. This poses a challenge in social media listening, as there is limited ability to confirm that individuals are using their true identity when posting on social media sites, or to approach them if they are obviously using an alias name. When monitoring social media alongside clinical trials, this challenge becomes more complicated, as there is often no way of confirming a patient’s participation in a specific clinical trial (Thompson 2014). Furthermore, even if a person can be confirmed as a trial participant, there would be no way of confirming in which arm of the trial a participant is participating, which treatment(s) that participant is receiving, or if any adverse event reported in social media has already been recorded and dealt with appropriately (Barry 2014). It is therefore highly recommended that legal and compliance departments review the use of any social media for recruitment or use alongside a clinical trial, prior to the start of social media listening activities (Dizon et al. 2012). This practice could also be subject to institutional review board (IRB) approval and require compliance with national privacy laws (Dizon et al. 2012). Alternatively, the rules and requirements for surveillance campaigns and observational studies are often less scrutinising. Therefore, it is important to determine the feasibility of using social media for a specific project prior to committing resources.

When considering the use of a third-party vendor to acquire social media data, an organisation should ensure that the vendor meets all compatibility and accountability standards required for the research project as well as provide all needed software services. The regulatory and societal expectations of privacy with social media data are rapidly changing and should be considered in earnest to maintain the credibility and viability of the research effort.

More specifically, Appendix 11.1 provides an introduction to the data protection regulation applicable in the European Union (EU) and derives some globally applicable principles.

4 Outlook: Relevance, Improvements and Future Potential

As a field, we are at a crossroads in pharmacovigilance. The potential of social media is hard to deny, but the execution in relation to the collection of adverse reactions has born little fruit (Rees et al. 2018; Convertino et al. 2018; Caster et al. 2018). Yet, many researchers regularly derive new insights from monitoring social media content (Lardon et al. 2018; Kurzinger et al. 2018; Patel et al. 2018; Keller et al. 2018; Chen et al. 2018). One research article’s title summarises this succinctly: “Descriptions of adverse drug reactions are less informative in forums than in the French pharmacovigilance database but provide more unexpected reactions.” (Karapetiantz et al. 2018). This may very well be the key insight from the past decade of efforts to understand the role of social media for collecting adverse reaction data; given that any surveillance system is inherently designed to identify what is expected, as broadly defined among the scope of outcomes. The challenge for the future will be to narrow the scope of inquiry and to focus on social media mining applications that are most likely to generate new knowledge; our focus to date has been on information more generally. When considering an assessment of a new safety concern with a medicine, evidence from animal studies, laboratory findings, clinical trials, pharmacoepidemiological studies, and treatment experience all come into play. Machines do not appear to be on the cusp of replacing this complex human assessment in the immediate future; perhaps, harvesting new knowledge from the exuberant promise of social media will require the development of automated multi-factorial safety reviewing.

A further objective of social media research for pharmacovigilance purposes is to capture information about patients and medicinal products through a patient-centric lens. This is achieved by turning to social media to amplify the patient voice to understand patients’ knowledge, attitudes, and behaviours—to understand them as audiences of our communication—and to collect data which help evidence-based planning and evaluating of communication interventions that support informed therapeutic choice and safe use of medicines. Social media is a communication channel, which is an important research topic in itself. Such research may determine who uses social media and how, with a view to inform communication strategies for incorporating the social media not only for listening but also messaging. Beyond pharmacovigilance per se, social media data present the tantalising possibility of providing insight into how physicians communicate with each other (Albarqouni et al. 2019; Graff et al. 2018; Falzon et al. 2016), topics that patients want to know more about (Charlie et al. 2018), and how the public reacts to health news in real time (Adams and Schiffers 2017). These broader dimensions of medicines safety and communication have not yet been evaluated in social media adequately.

In conclusion, social media listening and crowdsourcing of information provide a timely and insightful complement to traditional methods for medicinal product risk communication research, and is applicable globally. Given people’s increasing use of the internet and social media, and patients’ views on the prospects of its utility for data gathering in support of patient-centred care (see Chap. 16), the emerging discipline of social media research is becoming an essential part of a multidisciplinary and multilayered approach to medicinal product risk communication research (see Chap. 1). As a source for data on real-time patient discussions, social media can be used to understand aspects of use of medicines in healthcare, information needs and adverse reactions as characterised by patients, as well as to monitor and improve risk communication efforts. Online discussions among patients about medicines often extend to wider aspects of use, such as off-label use, and issues with product quality, formulation and handling and disposal, and even reluctance to adhere to treatment regimens due to adverse reactions experienced by the patient. Social media can also be used to identify specific patient groups for soliciting perspectives on certain safety concerns and risk communication needs. Lastly, as social media listening and crowdsourcing information gains traction as a viable source for insights, it will become necessary to acknowledge its myriad challenges—in particular inherent noise, incomplete data when follow-up is impossible, privacy and patient protection, and lack of regulatory guidance. More coordinated research among academics, regulators, pharmaceutical industry, and subject matter experts is needed to develop best practice guidance. Practical solutions that adequately address these social media research challenges without impacting the usefulness of the data for pharmacovigilance, including improving communication about risks and safe use of medicines, will be of utmost importance.

Conclusions

Social media research can provide a timely and insightful complement to traditional data sources for pharmacovigilance as well as medicinal product risk communication research, in particular for planning and evaluating of communication interventions.
As a source for real-time patient discussions, social media listening can facilitate understanding aspects of use of medicines in healthcare, adverse reactions as characterised by patients, audiences and their information needs as well as help monitor and improve risk communication efforts. Online discussions among patients about medicines often extend to wider aspects of use, such as off-label use, as well as issues with product quality, formulation, handling and disposal, sensitive or stigmatised topics, and reluctance to adhere to treatment due to adverse reactions.
Social media can also be used to identify specific patient groups for soliciting perspectives on certain safety concerns and risk communication needs, an approach called crowdsourcing for information.
Social media is an evolving global communication channel. Understanding who uses these media and how is important for informing communication strategies, for both listening and tailoring messaging.
Social media research needs to consider specific potential for bias as well as ethical and legal concerns. Therefore, more collaboration is needed among researchers, regulators, the pharmaceutical industry, and subject matter experts. This collaboration is critical to develop best practice guidance and practical solutions that adequately address these challenges without impacting the usefulness of the data for pharmacovigilance and communication about risks and safe use of medicines.

References

Abdellaoui R, Schück S, Texier N, Burgun A (2017) Filtering entities to optimize identification of adverse drug reaction from social media: how can the number of words between entities in the messages help? JMIR Public Health Surveill 3(2):e36. Accessible at: https://publichealth.jmir.org/2017/2/e36/
Article Google Scholar
Abou Taam M, Rossard C, Cantaloube L, Bouscaren N, Roche G, Pochard L, Montastruc F, Herxheimer A, Montastruc JL, Bagheri H (2014) Analysis of patients’ narratives posted on social media websites on benfluorex’s (Mediator®) withdrawal in France. J Clin Pharm Ther. 39:53–55. Accessible at: https://onlinelibrary.wiley.com/doi/abs/10.1111/jcpt.12103
Adams S, Schiffers P (2017) Co-constructed health narratives during a ‘media event’: the case of the first Dutch Twitter heart operation. Digit Health 3:2055207617712046. Accessible at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6001202/.
Albarqouni L, Hoffmann T, McLean K, Price K, Glasziou P (2019). Role of professional networks on social media in addressing clinical questions at general practice: a cross-sectional study of general practitioners in Australia and New Zealand. BMC Fam Pract 20(1):43. Accessible at: https://bmcfampract.biomedcentral.com/articles/10.1186/s12875-019-0931-x
Alvaro N, Conway M, Doan S, Lofi C, Overington J, Collier N (2015) Crowdsourcing Twitter annotations to identify first-hand experiences of prescription drug use. J Biomed Inform 58:280–287. Accessible at: https://www.sciencedirect.com/science/article/pii/S1532046415002415?via%3Dihub
Anderson LS, Bell HG, Gilbert M, Davidson JE, Winter C, Barratt MJ, Win B, Painter JL, Menone C, Sayegh J, Dasgupta N (2017) Using social listening data to monitor misuse and nonmedical use of bupropion: a content analysis. JMIR Public Health Surveill 3:e6
Article Google Scholar
Bahk C, Goshgarian M, Donahue K, Freifeld CC, Menone CM et al (2015) Increasing patient engagement in pharmacovigilance through online community outreach and mobile reporting applications: an analysis of adverse event reporting for the Essure device in the US. Pharm Med 29:331–341
Article Google Scholar
Barry F (2014) Pfizer: how Facebook can ‘unblind’ a clinical trial. Outsourcing-pharma.com. June 9. Accessible at: https://www.outsourcing-pharma.com/Article/2014/06/09/Pfizer-How-Facebook-can-unblind-a-clinical-trial#
Bian J, Topaloglu U, Yu F (2012) Towards large-scale Twitter mining for drug-related adverse events. SHB12. Accessible at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5619871/
Bousquet C, Audeh B, Bellet F, Lillo-LeLouet A (2018) Comment on “Assessment of the utility of social media for broad-ranging statistical signal detection in pharmacovigilance: results from the WEB-RADR Project”. Drug Saf 41:1371–1373. Accessible at: https://springerlink.bibliotecabuap.elogim.com/article/10.1007%2Fs40264-018-0747-y
Carbonell P, Mayer MA, Bravo A (2015) Exploring brand-name drug mentions on Twitter for pharmacovigilance. Stud Health Technol Inform, 210. Accessible at: https://www.ncbi.nlm.nih.gov/pubmed/?term=25991101
Caster O, Dietrich J, Kurzinger ML, Lerch M, Maskell S, Noren GN, Tcherny-Lessenot S, Vroman B, Wisniewski A, van Stekelenborg J (2018) Assessment of the utility of social media for broad-ranging statistical signal detection in pharmacovigilance: results from the WEB-RADR project. Drug Saf 41:1355–1369
Article CAS Google Scholar
Charlie AM, Gao Y, Heller SL (2018) What do patients want to know? Questions and concerns regarding mammography expressed through social media. J Am Coll Radiol 15(10):1478–1486. Accessible at: https://www.jacr.org/article/S1546-1440(17)31170-5/fulltext
Article Google Scholar
Chary M, Genes N, McKenzie A, Manini AF (2013) Leveraging social networks for toxicovigilance. J Med Toxicol 9:184–191. Accessible at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3657021/
Chen X, Faviez C, Schuck S, Lillo-Le-Louët A, Texier N et al (2018) Mining patients’ narratives in social media for pharmacovigilance: adverse effects and misuse of methylphenidate. Front Pharmacol 9:541
Article Google Scholar
Cocos A, Fiks AG, Masino AJ (2017) Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts. J Am Med Inform Assoc 24:813–821. Accessible at: https://academic.oup.com/jamia/article-abstract/24/4/813/3041102?redirectedFrom=fulltext
Comfort S, Perena S, Hudson Z, Dorrell D, Meireis S, Nagarajan M, Ramakrishnan C, Fine J (2018) Sorting through the safety data haystack: using machine learning to identify individual case safety reports in social-digital media. Drug Saf 41:579–590. Accessible at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5966485/
Convertino I, Ferraro S, Blandizzi C, Tuccori M (2018) The usefulness of listening social media for pharmacovigilance purposes: a systematic review. Expert Opin Drug Saf 17:1081–1093. Accessible at: https://www.ncbi.nlm.nih.gov/pubmed/?term=30285501
Demner-Fushman D, Elhadad N (2016) Aspiring to unintended consequences of natural language processing: a review of recent developments in clinical and consumer-generated text processing. Yearb Med Inform 10:224–233. Accessible at: https://www.ncbi.nlm.nih.gov/pubmed/?term=27830255
Dizon D, Graham D, Thompson M, Johnson L, Johnston C et al (2012) Practical guidance: the use of social media in oncology practice. Bus Oncol 8(5):e113–e124
Google Scholar
Donahue M (2012) Patient recruitment via social media: lessons learned. Pharm Exec. February 13. Accessible at: http://www.pharmexec.com/patient-recruitment-social-media-lessons-learned
Edwards IR, Lindquist M (2011) Social media and networks in pharmacovigilance: boon or bane? Drug Saf 34:267–271. Accessible at: https://www.ncbi.nlm.nih.gov/pubmed/?term=21417499
Emadzadeh E, Sarker A, Nikfarjam A, Gonzalez G (2018) Hybrid semantic analysis for mapping adverse drug reaction mentions in tweets to medical terminology. AMIA Annu Symp Proc 16:679–688. Accessible at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5977584/
Eshleman R, Singh R (2016) Leveraging graph topology and semantic context for pharmacovigilance through twitter-streams. BMC Bioinformatics 17(Suppl 13):335. Accessible at: https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1220-5
Falzon D et al (2016) Digital health for the End TB Strategy: developing priority products and making them work. Eur Respir J 26(48):29–45
Article Google Scholar
Freifeld CC, Brownstein JS, Menone CM, Bao W, Filice R, Kass-Hout T, Dasgupta N (2014) Digital drug safety surveillance: monitoring pharmaceutical products in twitter. Drug Saf 37:343–350. Accessible at: https://springerlink.bibliotecabuap.elogim.com/article/10.1007%2Fs40264-014-0155-x
Ghosh R, Lewis D (2015) Aims and approaches of Web-RADR: a consortium ensuring reliable ADR reporting via mobile devices and new insights from social media. Exp Opin Drug Saf 14:1845–1853. Accessible at: https://www.tandfonline.com/doi/abs/10.1517/14740338.2015.1096342?journalCode=ieds20
Golder S, Norman G, Loke YK (2015) Systematic review on the prevalence, frequency and comparative value of adverse events data in social media. Br J Clin Pharmacol 80(4):878–888. Accessible at: https://bpspubs.onlinelibrary.wiley.com/doi/full/10.1111/bcp.12746
Graff SL, Close J, Cole S, Matt-Amaral L, Beg R, Markham MJ (2018) Impact of closed Facebook group participation on female hematology/oncology physicians. J Oncol Pract 4(12):e758–e769
Article Google Scholar
Grajaless F III, Sheps S, Ho K, Novak-Lauscher H, Eysenbach G (2014) Social media: a review and tutorial of applications in medicine and health care. J Med Internet Res 16(2):e13
Article Google Scholar
Gupta S, Pawar S, Ramrakhiyani N, Palshikar GK, Varma V (2018) Semi-supervised recurrent neural network for adverse drug reaction mention extraction. BMC Bioinformatics 19(Suppl 8):212. Accessible at: https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-018-2192-4
International Telecommunication Union (ICU). ICT Facts and Figures The World in 2015. Accessible at: http://www.itu.int/en/ITU-D/Statistics/Documents/facts/ICTFactsFigures2015.pdf
Jiang K, Chen T, Calix RA, Bernard GR (2018a) Identifying consumer health terms of side effects in Twitter posts. Stud Health Technol Inform 251:273–276. Accessible at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6041256/
Jiang K, Feng S, Song Q, Calix RA, Gupta M, Bernard GR (2018b) Identifying tweets of personal health experience through word embedding and LSTM neural network. BMC Bioinformatics 19(Suppl 8):210. Accessible at: https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-018-2198-y
Kang GJ, Ewing-Nelson SR, Mackey L, Schlitt JT, Marathe A, et al (2017) Semantic network analysis of vaccine sentiment in online social media. Vaccine 35:3621–3638. Accessible at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5548132/
Karapetiantz P, Bellet F, Audeh B, Lardon J, Leprovost D et al (2018) Descriptions of adverse drug reactions are less informative in forums than in the French pharmacovigilance database but provide more unexpected reactions. Front Pharmacol 9:439
Article Google Scholar
Keller MS, Mosadeghi S, Cohen ER, Kwan J, Spiegel BMR (2018) Reproductive health and medication concerns for patients with inflammatory bowel disease: thematic and quantitative analysis using social listening. J Med Internet Res 20(6):e206. Accessible at: https://www.jmir.org/2018/6/e206/
Kheloufi F, Default A, Blin O, Micallef J (2017) Investigating patient narratives posted on Internet and their informativeness level for pharmacovigilance purpose: The example of comments about statins. Therapie 72:483–490. Accessible at: https://www.ncbi.nlm.nih.gov/pubmed/?term=28065444
Knezevic MZ, Bivolarevic IC, Peric TS, Jankovic SM (2011) Using Facebook to increase spontaneous reporting of adverse drug reactions. Drug Saf 34:351–352
Article Google Scholar
Kuhn M, Campillos M, Letunic I, Jensen LJ, Bork P (2010) A side effect resource to capture phenotopic effects of drugs. Mol Syst Biol 6:343. Accessible at: https://www.embopress.org/doi/full/10.1038/msb.2009.98
Kuhn M, Letunic I, Jensen LJ, Bork P (2016) The SIDER database of drugs and side effects. Nucleic Acids Res 44(D1):D1075–D1079. Accessible at: https://academic.oup.com/nar/article/44/D1/D1075/2502602
Kurzinger ML, Schuck S, Texier N, Abdellaoui R, Faviez C, Pouget J, Zhang L, Tcherny-Lessenot S, Lin S, Juhaeri J (2018) Web-based signal detection using medical forums data in France: comparative analysis. J Med Internet Res 20:e10466. Accessible at: https://www.jmir.org/2018/11/e10466/
Lardon J, Abdellaoui R, Bellet F, Asfari H (2015) Adverse drug reaction identification and extraction in social media: a scoping review. J Med Internet Res 17:e171. Accessible at: https://www.ncbi.nlm.nih.gov/pubmed/?term=26163365
Lardon J, Bellet F, Aboukhamis R, Asfari H, Souvignet J, Jaulent MC, Beyens MN, Lillo-LeLouet A, Bousquet C (2018) Evaluating Twitter as a complementary data source for pharmacovigilance. Exp Opin Drug Saf 17:763–774. Accessible at: https://www.tandfonline.com/doi/abs/10.1080/14740338.2018.1499724?journalCode=ieds20
Larkin M (2014) Social media for pharma: an expert’s view. Elsevier. December 2. Accessible at: http://www.elsevier.com/connect/social-media-for-pharma-an-expertsview
Lengsavath M, Dal Pra A, de Ferran AM, Brosch S, Härmark L, Newbould V, Goncalves S (2017) Social media monitoring and adverse drug reaction reporting in pharmacovigilance: an overview of the regulatory landscape. Ther Innov Regul Sci 51(1):125–131
Article Google Scholar
Lipset C (2014) Engage with research participants about social media. Nat Med 20:231
Article CAS Google Scholar
Liu J, Wang G (2018) Pharmacovigilance from social media: an improved random subspace method for identifying adverse drug events. Int J Med Inform 117:33–43. Accessible at: https://www.sciencedirect.com/science/article/abs/pii/S1386505618304416?via%3Dihub
Liu J, Zhao S, Zhang X (2016) An ensemble method for extracting adverse drug events from social media. Artif Intell Med 70:62–76. Accessible at: https://linkinghub.elsevier.com/retrieve/pii/S0933-3657(15)30037-3
Liu J, Zhao S, Wang G (2018) SSEL-ADE: a semi-supervised ensemble learning framework for extracting adverse drug events from social media. Artif Intell Med 84:34–49. Accessible at: https://www.sciencedirect.com/science/article/pii/S0933365717301847?via%3Dihub
Medical Dictionary for Regulatory Activities (MedDRA). Accessible at: http://www.meddra.org
Naik P, Umrath T, van Stekelenborg J, Ruben R, Abdul-Karim N, et al (2015) Regulatory definitions and good pharmacovigilance practices in social media: challenges and recommendations. Ther Innov Regul Sci 49:840–851. Accessible at: https://journals.sagepub.com/doi/abs/10.1177/2168479015587362?rfr_dat=cr_pub%3Dpubmed&url_ver=Z39.88-2003&rfr_id=ori%3Arid%3Acrossref.org&journalCode=dijc
Nikfarjam A, Sarker A, O’Connor K, Ginn R, Gonzalez G (2015) Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features. J Med Am Inform Assoc 22:671–681
Google Scholar
O’Connor A, Jackson L, Goldsmith L, Skirton H (2014) Can I get a retweet please?: health research recruitment and the Twittersphere. J Adv Nurs 70:599–609
Article Google Scholar
Park HA, Jung H, On J, Park SK, Kang H (2018) Digital epidemiology: use of digital data collected for non-epidemiological purposes in epidemiological studies. Healthc Inform Res 24:253–262. Accessible at: https://www.ncbi.nlm.nih.gov/pubmed/?term=30443413
Patel R, Belousov M, Jani M, Dasgupta N, Winokur C, Nenandic G, Dixon WG (2018) Frequent discussion of insomnia and weight gain with glucocorticoid therapy: an analysis of Twitter posts. NPJ Digit Med, 1. Accessible at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6364798/
PatientsLikeMe. About us. patientslikeme.com. 2019. And: Okun S, Goodwin K (2017) Building a learning health community: By the people, for the people. Learn Health Sys 1:e10028. Both accessible at: https://www.patientslikeme.com/about
Peacock E (2014) Global forum special section: transforming recruitment for clinical trials via patient social networks. DIA. October 1. Accessible at: http://www.diaglobal.org/en/resources/news#article=65afd337-6fc3-4abf-8d43-797172fc1314
Peacock E (2015) Engaging patient social networks in clinical trials and burden of disease studies. Drug Information Association (DIA). October 1
Google Scholar
Perrin A (2015) Social media usage: 2005–2015. Pew Research Center. Accessible at: http://www.pewinternet.org/2015/10/08/social-networking-usage-2005-2015/
Pierce CE, Bouri K, Pamer C, Proestel S, Rodriguez HW, Ven Le H, Freifeld CC, Brownstein JS, Walderhaug M, Edwards IR, Dasgupta N (2017) Evaluation of Facebook and Twitter monitoring to detect safety signals for medical products: an analysis of recent FDA safety alerts. Drug Saf 40:317–331. Accessible at: https://springerlink.bibliotecabuap.elogim.com/article/10.1007%2Fs40264-016-0491-0
Powell GE, Seifert HA, Reblin T, Burstein PJ, Blowers J et al (2015) Social media listening for routine post-marketing safety surveillance. Drug Saf 39:443–454
Article Google Scholar
Rees S, Mian S, Grabowski N (2018) Using social media in safety signal management: is it reliable? Ther Adv Drug Saf 9:591–599. Accessible at: https://www.ncbi.nlm.nih.gov/pubmed/?term=30283627
Rothman M, Gnanaskathy A, Wicks P, Papadopoulos E (2015) Can we use social media to support content validity of patient-reported outcome instruments in medial product development? Value Health 18:1–4
Article Google Scholar
Sarker A, Ginn R, Nikfarjam A, O’Connor K, Smith K et al (2015) Utilizing social media data for pharmacovigilance: a review. J Biomed Inform 54:202–212
Article Google Scholar
Sarker A, Nikfarjam A, Gonzalez G (2016) Social media mining shared task workshop. Pac Symp Biocomput 21:581–592. Accessible at: https://www.worldscientific.com/doi/abs/10.1142/9789814749411_0054
Segura-Bedmar I, Martinez P, Revert R, Moreno-Schneider J (2015) Exploring Spanish health social media for detecting drug effects. BMC Med Inform Decis Mak 15(Suppl 2):S6. Accessible at: https://bmcmedinformdecismak.biomedcentral.com/articles/10.1186/1472-6947-15-S2-S6
Sharpe T (2014) Global forum special section: patient perspective on social media. Drug Information Association (DIA). October 1. Accessible at: http://www.diaglobal.org/en/resources/news#article=7b3df92f-82b7-443a-a759-ad46b203f0b3
Sloane R, Osanlou O, Lewis D, Bollegala D, Maskell S, Pirmohamed M (2015) Social media and pharmacovigilance: a review of the opportunities and challenges. Br J Clin Pharmacol 80:910–920. Accessible at: https://www.ncbi.nlm.nih.gov/pubmed/?term=26147850
Smart Patients, Inc. (2015) Accessible at: https://www.smartpatients.com/about
Smith M, Benattia I (2016) The patient’s voice in pharmacovigilance: pragmatic approaches to building a patient-centric drug safety organization. Drug Saf 39:779–785
Article Google Scholar
Snipes K (2015) Using social media and digital media to increase patient recruitment and retention. Clinical Leader. June 15
Google Scholar
Stergiopoulos S (2014) Global forum special section: social listening to enhance clinical research. Drug Information Association (DIA). October 1. Accessible at: http://www.diaglobal.org/en/resources/news#article=c0736ec3-2280-4261-b621-f756d3a4bf6e
Sullivan R, Sarker A, O’Connor K, Goodin A, Karlsrud M, Gonzalez G (2016) Finding potentially unsafe nutritional supplements from user reviews with topic modeling. Pac Symp Biocomput 21:528–539
PubMed PubMed Central Google Scholar
Thompson M (2014) Social media in clinical trials. ASCO p E101. Accessible at: https://www.researchgate.net/publication/262608279_Social_Media_in_Clinical_Trials
Tricco AC, Zarin W, Lillie E, Jeblee S, Warren R, Khan PA, Robson R, Hirst G, Straus SE (2018) Utility of social media and crowd-intelligence data for pharmacovigilance: a scoping review. BMC Med Inform Decis Mak 18:38. Accessible at: https://www.ncbi.nlm.nih.gov/pubmed/?term=29898743
Tufts Center for the Study of Drug Development (2014) Industry usage of social and digital media communities in clinical research. White Paper, Boston. Accessible at: http://csdd.tufts.edu/files/uploads/TCSDD_Social_Media_Final.pdf
Wong A, Plasek JM, Montecalvo SP, Zhou L (2018) Natural language processing and its implications for the future of medication safety: a narrative review of recent advances and challenges. Pharmacotherapy 38:822–841. Accessible at: https://www.ncbi.nlm.nih.gov/pubmed/?term=29884988
Wu H, Fang H, Stanhope SJ (2013) Exploiting online discussions to discover unrecognized drug side effects. Methods Inf Med 52:152–159
Article CAS Google Scholar

Download references

Acknowledgements

The authors thank Lorna M Woods at the School of Law, University of Essex, United Kingdom for the review of this appendix.

Author information

Authors and Affiliations

Eshelman School of Pharmacy and Injury Prevention Research Center, University of North Carolina, Chapel Hill, NC, USA
Nabarun Dasgupta
Booz Allen Hamilton, McLean, VA, USA
Carly Winokur & Carrie Pierce

Authors

Nabarun Dasgupta
View author publications
You can also search for this author in PubMed Google Scholar
Carly Winokur
View author publications
You can also search for this author in PubMed Google Scholar
Carrie Pierce
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nabarun Dasgupta .

Editor information

Editors and Affiliations

European Medicines Agency (EMA), Amsterdam, Noord-Holland, The Netherlands
Priya Bahri

Appendices

Appendix 11.1: Legal Aspects Relevant to Internet-Based and Social Media Research

Researchers making use of data from the internet and the social media need to consider legal aspects. The applicable law will vary depending on where relevant actors, e.g. internet and social media users and researchers, are located. The relevant rules are usually those of the country where researchers are based. The European Union (EU) data protection rules, found in General Data Protection Regulation (GDPR) (Regulation (EU) 2016/679), can have “extra-territorial effect”—that is they bind researchers outside the EU when they target those within the EU.

1.1 Types of Law to Consider

In addition to adhereing to legislation on personal data protection, confidentiality and privacy, other legal aspect may be of relevance to the research project. Other legal concerns include contracts (e.g. with data vendors), intellectual property (e.g. onwership of digital content, reproduction and transfer rights, ownership of algortihms developed), sector-specific regulation (e.g. medical product marketers), as well as civil and criminal law (e.g. stalking, bullying, etc.).

1.2 Personal Data Protection Law

Personal data protection law—discussed here in more detail as the most relevant law to consider for internet-based and social media research related to health matters—does in general not prohibit the processing of data, but it lays down conditions for when, on what basis and how the processing of personal data should take place, and it gives enforceable rights to persons who are data subjects. Reference is made here to the EU GDPR, which is recognised by many—consumer organisations notably too—as a global standard. There, personal data are defined as any information relating to an identified or identifiable natural person. Researchers will often work with data that have been pseudo-anonymised by the data provider. That means that the data subject is not identified but there can still be a risk of possibly identifying the person through combining data or using additional information. This is particularly a risk when a patient has a rare disease. Where however information is truly anonymous, i.e. where the information does not relate to an identified or identifiable natural person or to personal data rendered anonymous in such a manner that the data subject is not or no longer identifiable, data protection legislation is not necessary to be applied. Statistics on the number and the length of visits of people on a website, stratified by country, age and sex, are examples of data likely to be anonymous data.

1.3 Grounds for Processing of Personal Data Relevant to Health Research

The EU GDPR specifies the grounds on which personal data may be processed—consent, performance of a contract, performance of a legal obligation, protecting the vital interests of the data subject, necessary for the performance of a task in the public interest, and the legitimate interests of the processor (subject to fundamental interests of the data subject). The EU GDPR also specifies strict rules as to what consent means. The EU GDPR prohibits the processing for special categories of personal data, including ethnic data, genetic and biometric data for the purpose of uniquely identifying a natural person, as well as data concerning health, sex life and sexual orientation. Such data are however allowed to be processed on defined exempting grounds, which include:

explicit consent by the data subject has been given; or
the personal data have manifestly been made public by the data subject; or
the data processing is necessary for the purposes of preventive or occupational medicine and provision of care, whether for an individual or populations; or
processing is necessary for reasons of public health, including ensuring high quality and safety of healthcare, medicinal products or medical devices.

These exemptions can be given for medicinal product risk communication research making use of data from the internet and the social media for understanding, planning, evaluating or improving communication. For example, patients may have identified themselves in comments on websites or publically accessible social media posts, or patients of a closed social media group may have given consent for their data to be used for the purpose of such research, to, e.g. identify their risk perceptions or questions for the safe use of medicines. Where patients publish their information under a pseudonym, researchers should not make attempts to identify that person through combining data, but may attempt to contact them if needed for a specific research project.

1.4 Principles for the Processing of Personal Data

Where the processing of personal data is allowed, the EU GDPR requires the data processing (i.e. collection, recording, organisation, structuring, storage, adaptation or alteration, retrieval, consultation, use, disclosure by transmission, dissemination or otherwise making available, alignment or combination, restriction, erasure or destruction) to be:

lawful, fair and transparent in relation to the data subject (principle of lawfulness, fairness and transparency);
for the specified purpose only (principle of purpose limitation);
adequate, relevant and limited to what is needed (principle of data minimisation);
based on accurate data (principle of accuracy);
performed in a way that permits identification of data subjects for no longer than is necessary (principle of storage limitation);
secure, which includes that the data should be protected against unauthorised or unlawful processing and accidental loss, destruction or damage (principle of integrity and confidentiality).

1.5 Rights of Data Subjects

As mentioned before, data protection law gives enforceable rights to persons who are data subjects towards the data controller. When planning research, the protocol needs to guarantee the following rights of data subjects, either because locally applicable legislation requires this or because it can be considered ethical good research practice:

right of access to the data subject’s data and information on the conditions of data processing;
right to rectification in order to correct or complete data;
right to erasure of data, i.e. the right to be forgotten;
right to restriction of processing;
right to data portability, i.e. to obtain the data in a readable format and to transfer them to another data controller; and
right to object to data processing at any time.

The rights of data subjects—here the users of social media—may be limited in respect of processing for archiving purposes in the public interest, scientific or historical research purposes or statistical purposes. This will be specified by each EU member state (which could mean that the position will be different across member states) and must be subject to safeguards—again these will be specified by each member state.

1.6 Concluding Remarks

Researchers making use of data from the internet and the social media need to consider various types of law applicable in the given jurisdictions of all actors involved. Researchers need to in particular adhere to personal data protection, confidentiality and privacy legislation and are accountable in this respect towards data subjects. In jurisdictions where such legislation does not exist, the principles presented here can be considered good research practice. Research protocols and data processing need to be designed accordingly (Woods 2017). Regularly updated guidance on the EU GPRD is provided by the European Data Protection Board (EDPB) (European Data Protection Board (EDPB) 2018).

Appendix 11.1

European Data Protection Board (EDPB) (2018) [guidance documents published on website]. Brussels: EDPB; Accessible at: https://edpb.europa.eu/edpb_en.
Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation). Official Journal of the European Union; 4 May 2016: L 119/1-88.
Woods LM (2017) Legal considerations relevant to social media and health [lecture]. Pharmacovigilance and social media [training course on 15 October 2017]. Liverpool: 17th Annual Meeting of the International Society of Pharmacovigilance; 15-18 October 2017.

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Dasgupta, N., Winokur, C., Pierce, C. (2020). Social Media Research. In: Bahri, P. (eds) Communicating about Risks and Safe Use of Medicines. Adis, Singapore. https://doi.org/10.1007/978-981-15-3013-5_11

Download citation

DOI: https://doi.org/10.1007/978-981-15-3013-5_11
Published: 18 June 2020
Publisher Name: Adis, Singapore
Print ISBN: 978-981-15-3012-8
Online ISBN: 978-981-15-3013-5
eBook Packages: MedicineMedicine (R0)

Publish with us

Policies and ethics