Abstract
Personal data has been described as the “the new oil of the Internet.” The global data monetization market is projected to increase to USD 6.1bn by 2025, and the success of giants like Facebook or Google speaks for itself. Almost all companies create, store, share and/or use personal data i.e. information from or about individuals. While the current assumption is that data subjects voluntarily share their data in exchange for a “free” service, the awareness of the value of personal data and data sovereignty is growing amongst consumers, businesses, and regulators alike. However, there is currently no consensus on which factors influence the value of personal data and how personal data should be priced regarding self-determination and data sovereignty. With this narrative review, we answer the following research question: Which factors influence the pricing of personal data? We show that research on the subject is diverse and that there is no consensus on the optimal pricing mechanism. We identify individual privacy and risk preferences, informational self-determination, sensitivity of data and data volume and inferability as most prevalent influence factors. We underline the need to establish ways for data owners to exercise data sovereignty and informed consent about data usage.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Personal data has been described as the “the new oil of the Internet and the new currency for the digital world” [1] and is created, stored, shared, and used by almost all companies. Market studies expect the overall global data monetization market to increase from USD 2.3bn in 2020 to USD 6.1bn in 2025 [2]. A sharp increase of data creation and low cost of storage are main drivers. Companies such as Facebook and Google have created massive data environments and developed their business models around the voluntary sharing of personal data by individuals in exchange for using their services [3, 4].
The OECD defines personal data as “any information relating to an identified or identifiable individual (data subject)” [5]. It shows traits of a public good rather than a commercial good as it is difficult to exclude other parties from using it in an effective way and usage by one party does not prevent other parties from using it as well [3, 5, 6]. Data owners on the other hand have a need for privacy [7]. With the collection of personal data happening continuously and through a multitude of interconnected devices and modalities, there is an abundance of data and it is challenging to introduce effective control mechanisms on who transfers and uses the data [3]. Additionally, at least in Europe, privacy and the right for information to remain private is of high value and are protected by complex laws and regulations such as the General Data Protection Regulation [8] or new regulations aiming to increase data sovereignty [9, 10]. The current business assumption is that users are willing to share their data in exchange for the (free) service they receive. However, studies such as Schwartz [11], have shown that users are often not fully aware that they are paying for the service with their data and that their data may be collected and sold to data brokers who in turn sell data bundles to different data users along a data value chain. Other studies such as Sindermann et al. [12] investigate whether this influences customers willingness to pay for Social Media and show that only a minority supports a monetary payment model. As consumers learn more about how their data is used, this duality will eventually have consequences for companies’ business models [3, 12]. Thus, it will be necessary for companies to attach a monetary value to the personal data shared with them and allow data producers/owners to actively consent to the use of their personal data in return for monetary or non-monetary compensation and develop a data market [6]. This can be facilitated through a data broker who transfers data from data owner to data user and monetary or non-monetary compensation from data user to data owner. The price of data depends highly on a diverse range of factors, some inherent in the data and some based on data context and the data subject. Combined with the need to account for the value attributed to privacy by data subjects, this adds an extra layer of complexity to pricing models as the value of the privacy of the data owner may exceed the value companies are willing to pay. The question of how to price personal data remains open despite multiple calls for research concerning this topic [6, 13]. Literature reviews in this area focus on pricing models [14, 15] or on the value of privacy [7] and have been published some 5 years ago. A more current review was undertaken by Wdowin and Deepeeven [16] as part of a research project and describes some factors related to the value of data. However, there is no detailed description of the search process and which criteria were chosen to define papers included in the review. There is currently no consensus in the literature on which factors influence the value of personal data. Given the omnipresence of personal data across divergent research fields and the regulatory efforts focusing on data sovereignty, a narrative synthesis of current findings is needed to provide a sound basis for research, model development and decision makers to answer the research question: Which factors influence the pricing of personal data?
We contribute to the discussion on pricing personal data in the following respects: (1) we identify relevant influence factors for pricing personal data and (2) provide a structure and categories for further qualitative and quantitative analysis of the subject.
2 Research Approach and Sample
2.1 Research Approach
We perform a narrative review [17] to conceptually integrate different fields of research on influence factors of the price of personal data. We use the following keywords: pricing of data, data markets, value of data, data valuation, data monetization, economics of personal data, pricing personal data and worth of privacy. We search EBSCO Business Source Complete as well as eLib, a specific directory which encompasses Web of Science, Scopus, Tema, Springer Link, Science Direct and other open access directories. We search for the keywords in TITLE/ABSTRACT of publications between 2001 and 2021. We find 1,535 papers in total. We screen the title and abstract based on (a) the topic of pricing privacy, pricing personal data or personal data markets and (b) peer review. We exclude papers that are out of scope, e.g. focusing on company data, pricing goods or services using personal data or using personal data for price discrimination. We exclude journal editorials or short summaries of conference papers. Overall, we exclude 1383 papers. We initially include 152 papers in our sample. We perform forward/backward referencing and include 8 additional in-scope papers respectively grey literature. Following the PRISMA [18] recommendations, we then perform a full text screening and exclude an additional 107 papers (1 duplicate, 106 papers out of scope based on the above mentioned criteria) from our final sample. Our overall sample consists of 54 papers. We extract relevant influence factors using an inductive content analysis [19] to identify the major themes in the literature. Two researchers perform the initial open coding independently and then perform multiple rounds of clustering and categorization to determine the main influence factors on the price of personal data. We derive a description for each factor and analyze the frequency of occurrence within our sample. We count each factor once per paper to determine the frequency.
2.2 Sample Description
The papers in the sample are published between 2005 and 2021 (see Fig. 1). The topic has become more frequently analyzed in the literature since 2014, highlighting its importance for current political and public discussion. There is a large spread of publications amongst different scientific outlets. About 60% of the analyzed papers were published in different scientific journals, while 30% were published in conference proceedings and the remaining 9% were published as grey literature reports. The topic of pricing personal data lies at an interface between business, economics, and information technology research. This is reflected in the publication outlets. The journals differed widely within the sample, with Electronic Markets being the most used outlet (3 publications), followed by Computer Law & Security Review (2 publications) and the IEEE Internet of Things Journal (2 publications). The remaining journals published one paper on the subject.
3 Results
3.1 General Results
Overall, the analyzed papers portrait a diverse stream of research. We classified them into eight categories: case study; commentary; data market model; data market model, technicalFootnote 1; data pricing model; experiment; literature review and report. The results are provided in Fig. 2. The appendix provides an overview of the core topics covered.
The two most prevalent categories are theoretical data market models (14 papers +5 papers on algorithm-based data markets) and experiments (14 papers). Data market models focus on the development of different theoretical scenarios for establishing a personal data market. Most of the papers are theoretical in nature and deal with specific game-theoretical [20, 21] or auction-based [22] approaches for market creation. Some underline their findings with simulations based on real life data sets [23, 24]. Algorithm-based models show different ways to technologically facilitate data market settings and highlight the difficulties incorporating real-life influence factors such as anonymization and noise as well as profit-maximization calculations into an algorithm [25,26,27]. The experiments focus mainly on (a) eliciting willingness to pay for e.g. keeping personal data such as social media data [3, 28, 29] or preferences private [30] and (b) willingness to accept money for e.g. social media data [31] or location data [32, 33] from the participantsFootnote 2. A notable finding is a gap between WPT and WTA-values with the average WTA-value being significantly higher than the WTP to protects ones privacy [35,36,37]. We further find several papers on what we call data pricing models, which focus on different pricing methods and ways to elicit prices for personal data. Literature reviews in this area focus on pricing models [14, 15] or on the value of privacy [7] and have been published some 5 years ago. A more current review was undertaken by Wdowin and Deepeeven [16] as part of a research project and describes some factors related to the value of data. However, there is no detailed description of the search process and which criteria were chosen to define papers included in the review. The few case studies found in our research focus on the complexity of choice and value estimation for personal data.
3.2 Influence Factors on Pricing Personal Data
We find a multitude of influence factors on the price of personal data. We categorize them into four overarching categories: (1) data properties, (2) data context, (3) perceptions of data owner and (4) perceptions of data user. A detailed overview of the subsumed influence factors, their description and frequency of occurrence across the sample can be found in the respective Tables 1, 2, 3 and 4. “Data properties” refer to the inherent properties of a personal data dataset. This category represents general properties of data that are not limited to personal data, but apply to all types of data. We included this category as it lists important factors for pricing data. Within this category, sensitivity of data was found most frequently in the literature, followed by the data content, volume and coverage of data, quality data and level of data aggregation. The remaining factors were mentioned less frequently.
“Data context” refers to the environment and background of the personal data dataset. Within this category, data volume and inferability were found most often in the literature, often referring to arbitrage situations. Further, cost and length of data storage and cost of data gathering were discussed in the literature, while the remaining factors occurred less often.
The category “perceptions of data owner” focuses on the preferences and views of the data owner and the related willingness to consider selling personal data in general. Individual privacy and risk preferences and informational self-determination were found most frequently in the literature and are the most frequent influence factors mentioned overall.
The category “perceptions of data user” focuses on the preferences and views of the data user. Within this category, the trust in the data market and the individual utility of the purchased data for the data user were most frequently mentioned.
Overall, individual preferences on risk and privacy, informational self-determination, sensitivity of data and data volume and inferability were identified as the most mentioned influence factors in the literature.
4 Synthesis of Findings
Our main finding is that while there already is a body of literature concerned with pricing personal data, the research base is heterogenous. Research is spread out amongst fields of science, journals, and research streams. It seems that there is not yet a consensus on key questions and a lack of overarching frameworks on the subject. This diverse research base, however, highlights the importance of the topic.
We have identified two main research aspects: (1) theoretical and what we call technical data market models and (2) experiments to elicit willingness to accept money/willingness to pay for privacy in exchange for data. Market models are focused on theoretical aspects of pricing personal data and are utilizing game theoretical and profit maximization for developing narrow scope data market models. Technical data market models provide algorithms for different pricing mechanisms such as query-based pricing. While most papers try to validate their models with real life data sets, no paper provides insights from a real-life application of their model, which leads us to assume that data access is limited and that companies already operating data markets are unwilling to disclose their models as they are presumably core to their respective business models. Experiments focus on very narrow experimental settings, often with students as their subjects and are mainly focused on social media data (likes, shares, and general personal information) and a few using location data. To our knowledge, there are no experiments focusing on pricing more sensitive personal data such as electronic health records and very few studies with demographically diverse participants. Particularly when considering more sensitive personal data it would be interesting to gather information from a broad demographic including diverse age groups, educational backgrounds and levels of digital aptitude, as experiments show (1) irrationality concerning ones data (e.g. [35]) and (2) for parts of the participants a plain refusal to partake in data pricing (e.g. [38]). Additionally, there are some studies on data pricing models and very few case studies and literature reviews on the subject. While each of the streams has created a significant insight into the topic, it would be most useful to combine them to merge the aspects of (irrational) decision making of data subjects with more economical and algorithm-based thinking into a more practical data market model/algorithm, as has been attempted by e.g. Biswas et al. [39].
Looking at the factors that influence the price of personal data, we develop four categories of influence factors: (1) data properties, (2) data context, (3) perceptions of data owner and (4) perceptions of data user. All categories include several factors which we also rank by occurrence in the papers of the literature review. The most prevalent factors are individual privacy and risk preferences and informational self-determination. This is not surprising, as those factors are what differentiates personal data from e.g. company data and should thus have a strong impact on the price of personal data. Factors relating to trust, while still important, occur much less frequently, despite seemingly being a factor in increasing market participation by data owners. Market models thus may exclude an important factor in their setups if they exclude trust-creating mechanisms, Sensitivity, data content, and data volume and inferability are further factors that frequently appear in the literature. Those factors are rather generic and certainly applicable not only to the price of personal, but also data in general. Pricing also seems to be dependent on cost and length of data storage and cost of data gathering, which is not surprising since the utility derived from the data needs to outweigh the cost and since most data consumers operate within a restricted budget. Other factors such as culture, ownership level or origin of data appear infrequently and seem to be of less importance. Relating to culture, this is interesting since there are significant differences in how cultures approach the topic of privacy [40]. The most frequently used pricing method is market pricing. Other pricing concepts, such as option, per query or auction models appear much less frequently. This may be due to their perceived complexity in setup and operation.
5 Conclusion and Limitations
We conduct a narrative literature review and show a diverse array of research streams and questions that are related to the topic of pricing personal data and emphasize its importance. Due to the broad frame an aggregation of the results can only be done in a limited way. We believe that the resulting influence factors of our work are a valuable contribution to the scientific discussion and model development in future research. Our qualitative analysis of influence factors based on the underlying literature shows that the influence factors of the price of personal data can be classified into four categories. We provide a first description and ranking of the factors based on our literature review. These factors are only a starting point to researching the influence factors for pricing personal data and need further verification through empirical analyses such as a quantitative study or structured equation modelling. We aim to validate these factors empirically in future projects and to develop an operational model to quantify a price for personal data.
Notes
- 1.
The category “Data Market Model, Technical” refers to papers focusing on algorithms and technical solutions for data markets.
- 2.
Willingness to pay (WTP) refers to the maximum amount of money an individual would be willing to pay to secure a specific change, while willingness to accept (WTA) refers to the minimum amount a person would be willing to accept to forego said change [34].
References
Kuneva, M.: Keynote speech. Roundtable on online data collection, targeting and profiling, 31 March 2009
MarketsandMarkets: Data Monetization Market (2020). https://www.marketsandmarkets.com/Market-Reports/data-monetization-market-127405959.html. Accessed 23 Sep 2020
Spiekermann, S., Korunovska, J.: Towards a value theory for personal data. J. Inf. Technol. (2017). https://doi.org/10.1057/jit.2016.4
Feijóo, C., Gómez-Barroso, J.L., Voigt, P.: Exploring the economic value of personal information from firms’ financial statements. Int. J. Inf. Manag. (2014). https://doi.org/10.1016/j.ijinfomgt.2013.12.005
OECD: Exploring the Economics of Personal Data: A Survey of Methodologies for Measuring Monetary Value (2013). https://doi.org/10.1787/5k486qtxldmq-en
Sidgman, J., Crompton, M.: Valuing personal data to foster privacy: a thought experiment and opportunities for research. J. Inf. Syst. (2016). https://doi.org/10.2308/isys-51429
Acquisti, A., Taylor, C., Wagman, L.: The economics of privacy. J. Econ. Lite. (2016). https://doi.org/10.1257/jel.54.2.442
European Commission: EU data protection rules (2020). https://ec.europa.eu/info/law/law-topic/data-protection/eu-data-protection-rules_en. Accessed 27 July 2020
Madiega, T.: Digital sovereignty for Europe. European Union (2020)
European Commission: A European Strategy for data (2021). https://digital-strategy.ec.europa.eu/en/policies/strategy-data. Accessed 13 Jan 2022
Schwartz, P.M.: Property, privacy, and personal data. Harv. Law Rev. (2004). https://doi.org/10.2307/4093335
Sindermann, C., Kuss, D.J., Throuvala, M.A., Griffiths, M.D., Montag, C.: Should we pay for our social media/messenger applications? Preliminary data on the acceptance of an alternative to the current prevailing data business model. Front. Psychol. (2020). https://doi.org/10.3389/fpsyg.2020.01415
Currie, C.S.M., Dokka, T., Harvey, J., Strauss, A.K.: Future research directions in demand management. J. Revenue Pricing Manag. (2018). https://doi.org/10.1057/s41272-018-0139-z
Malgieri, G., Custers, B.: Pricing privacy–the right to know the value of your personal data. Comput. Law Secur. Rev. (2018). https://doi.org/10.1016/j.clsr.2017.08.006
Fricker, S.A., Maksimov, Y.V.: Pricing of data products in data marketplaces. In: Ojala, A., Holmström Olsson, H., Werder, K. (eds.) ICSOB 2017. LNBIP, vol. 304, pp. 49–66. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69191-6_4
Wdowin, J., Diepeveen, S.: The value of data: literature review, Cambrige, UK (2020)
Cook, D.J., Mulrow, C.D., Haynes, R.B.: Systematic reviews: synthesis of best evidence for clinical decisions. Ann. Intern. Med. (1997). https://doi.org/10.7326/0003-4819-126-5-199703010-00006
Moher, D., Liberati, A., Tetzlaff, J., Altman, D.G.: Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. BMJ (Clin. Res. Ed.) (2009). https://doi.org/10.1136/bmj.b2535
Elo, S., Kyngäs, H.: The qualitative content analysis process. J. Adv. Nurs. 62, 107–115 (2008)
Niyato, D., Alsheikh, M.A., Wang, P., Kim, D.I., Han, Z.: Market model and optimal pricing scheme of big data and Internet of Things (IoT). In: 2016 IEEE International Conference on Communications (ICC). 2016 IEEE International Conference on Communications (ICC), pp. 1–6 (2016). https://doi.org/10.1109/ICC.2016.7510922
Wang, W., Ying, L., Zhang, J.: The value of privacy: strategic data subjects, incentive mechanisms and fundamental limits. In: Proceedings of the SIGMETRICS 2016: Proceedings of the 2016 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Science (2016)
Jiao, Y., Wang, P., Niyato, D., Abu Alsheikh, M., Feng, S.: Profit maximization auction and data management in big data markets. In: 2017 IEEE Wireless Communications and Networking Conference (WCNC). 2017 IEEE Wireless Communications and Networking Conference (WCNC), pp. 1–6 (2017). https://doi.org/10.1109/WCNC.2017.7925760
Bataineh, A.S., Mizouni, R., Barachi, M.E., Bentahar, J.: Monetizing personal data: a two-sided market approach. Procedia Comput. Sci. (2016). https://doi.org/10.1016/j.procs.2016.04.211
Xu, L., Jiang, C., Qian, Y., Zhao, Y., Li, J., Ren, Y.: Dynamic privacy pricing: a multi-armed bandit approach with time-variant rewards. IEEE Trans. Inf. Forensics Secur. (2017). https://doi.org/10.1109/TIFS.2016.2611487
Nget, R., Cao, Y., Yoshikawa, M.: How to balance privacy and money through pricing mechanism in personal data market (2017)
Yang, J., Xing, C.: Personal data market optimization pricing model based on privacy level. Information 10, 123 (2019)
Di Vimercati, S.D.C., Foresti, S., Livraga, G., Samarati, P.: Toward owners’ control in digital data markets. IEEE Syst. J. 1–8 (2020). https://doi.org/10.1109/JSYST.2020.2970456
Mahmoodi, J., et al.: Internet users’ valuation of enhanced data protection on social media: which aspects of privacy are worth the most? Front. Psychol. (2018). https://doi.org/10.3389/fpsyg.2018.01516
Bauer, M., et al.: Using crash outcome data evaluation system (CODES) to examine injury in front vs. rear-seated infants and children involved in a motor vehicle crash in New York State. Inj. Epidemiol. 8, 1–10 (2021)
Frik, A., Gaudeul, A.: A measure of the implicit value of privacy under risk. J. Consum. Mark. (2020). https://doi.org/10.1108/JCM-06-2019-3286
Benndorf, V., Normann, H.-T.: The willingness to sell personal data. DICE discussion paper, vol. 143. DICE, Düsseldorf (2014)
Staiano, J., Oliver, N., Lepri, B., de Oliveira, R., Caraviello, M., Sebe, N.: Money walks: a human-centric study on the economics of personal mobile data. In: Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 13 September 2014, pp. 583–594 (2014). https://doi.org/10.1145/2632048.2632074
Danezis, G., Lewis, S., Anderson, R.J.: How much is location privacy worth? In: WEIS
Hanemann, W.M.: Willingness to pay and willingness to accept: how much can they differ? Am. Econ. Rev. 81, 635–647 (1991)
Acquisti, A., John, L.K., Loewenstein, G.: What is privacy worth? J. Leg. Stud. (2013). https://doi.org/10.1086/671754
Winegar, A.G., Sunstein, C.R.: How much is data privacy worth? A preliminary investigation. SSRN J. (2019). https://doi.org/10.2139/ssrn.3413277
Grossklags, J., Acquisti, A.: When 25 cents is too much: an experiment on willingness-to-sell and willingness-to- protect personal information. In: Proceedings of the Workshop on the Economics of Information Security Proceedings 2007 (2007)
Nielsen, A.: Measuring lay reactions to personal data markets. In: Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society (AIES 2021) (2021). https://doi.org/10.1145/3461702.3462582
Biswas, S., Jung, K., Palamidessi, C.: An incentive mechanism for trading personal data in data markets. In: Cerone, A., Ölveczky, P.C. (eds.) ICTAC 2021. LNCS, vol. 12819, pp. 197–213. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-85315-0_12
Cockcroft, S., Rekker, S.: The relationship between culture and information privacy policy. Electron. Mark. 26(1), 55–72 (2015). https://doi.org/10.1007/s12525-015-0195-9
Hacker, P., Petkova, B.: Reining in the big promise of big data: transparency, inequality, and new regulatory frontiers. Northwestern J. Technol. Intellect. Prop. 15(1), 6–42 (2017)
Holt, T.J., Smirnova, O., Chua, Y.T.: Exploring and estimating the revenues and profits of participants in stolen data markets. Deviant Behav. (2016). https://doi.org/10.1080/01639625.2015.1026766
Jentzsch, N.: Monetarisierung der privatsphäre: welchen preis haben persönliche daten? DIW Wochenbericht (2014)
Perera, C., Ranjan, R., Wang, L.: End-to-end privacy for open big data markets. IEEE Cloud Comput. 2(4), 44–53 (2015). https://doi.org/10.1109/MCC.2015.78
Raskar, R., Vepakomma, P., Swedish, T., Sharan, A.: Data markets to support AI for all: pricing, valuation and governance (2019)
Choi, J.P., Jeon, D.-S., Kim, B.-C.: Privacy and personal data collection with information externalities. J. Public Econ. (2019). https://doi.org/10.1016/j.jpubeco.2019.02.001
Dimakopoulos, P.D., Sudaric, S.: Privacy and platform competition. Int. J. Ind. Organ. (2018). https://doi.org/10.1016/j.ijindorg.2018.01.003
Gkatzelis, V., Aperjis, C., Huberman, B.A.: Pricing private data. Electron. Mark. 25(2), 109–123 (2015). https://doi.org/10.1007/s12525-015-0188-8
Li, X.-B., Raghunathan, S.: Pricing and disseminating customer data with privacy awareness. Decis. Support Syst. (2014). https://doi.org/10.1016/j.dss.2013.10.006
Oh, H., Park, S., Lee, G.M., Choi, J.K., Noh, S.: Competitive data trading model with privacy valuation for multiple stakeholders in IoT data markets. IEEE Internet Things J. 7(4), 3623–3639 (2020). https://doi.org/10.1109/JIOT.2020.2973662
Radhakrishnan, A., Das, S.: Data markets for smart grids: an introduction. In: 2018 IEEE Innovative Smart Grid Technologies - Asia (ISGT Asia). 2018 IEEE Innovative Smart Grid Technologies–Asia (ISGT Asia), pp. 1010–1015 (2018). https://doi.org/10.1109/ISGT-Asia.2018.8467818
Spiekermann, S., Acquisti, A., Böhme, R., Hui, K.-L.: The challenges of personal data markets and privacy. Electron. Mark. 25(2), 161–167 (2015). https://doi.org/10.1007/s12525-015-0191-0
Spiekermann, S., Novotny, A.: A vision for global privacy bridges: technical and legal measures for international data markets Comput. Law Secur. Rev. (2015). https://doi.org/10.1016/j.clsr.2015.01.009
Tian, L., Li, J., Li, W., Ramesh, B., Cai, Z.: Optimal contract-based mechanisms for online data trading markets. IEEE Internet Things J. 6(5), 7800–7810 (2019). https://doi.org/10.1109/JIOT.2019.2902528
Balazinska, M., Howe, B., Suciu, D.: Data markets in the cloud: an opportunity for the database community. Proc. VLDB Endow. 5(12), 1962–1965 (2011)
Li, C., Li, D.Y., Miklau, G., Suciu, D.: A theory of pricing private data. ACM Trans. Database Syst. (2014). https://doi.org/10.1145/2691190.2691191
Mehta, S., Dawande, M., Janakiraman, G., Mookerjee, V.: How to sell a dataset? Pricing policies for data monetization. SSRN J. (2019). https://doi.org/10.2139/ssrn.3333296
Niu, C., Zheng, Z., Wu, F., Tang, S., Gao, X., Chen, G.: Online pricing with reserve price constraint for personal data markets (2019)
Niu, C., Zheng, Z., Wu, F., Tang, S., Gao, X., Chen, G.: ERATO: trading noisy aggregate statistics over private correlated data. IEEE Trans. Knowl. Data Eng. (2019). https://doi.org/10.1109/TKDE.2019.2934100
Shen, Y., Guo, B., Duan, X., Dong, X., Zhang, H.: A pricing model for big personal data. Tsinghua Sci. Technol. 21(5), 482–490 (2016). https://doi.org/10.1109/TST.2016.7590317
Shen, Y., et al.: Pricing personal data based on data provenance. Appl. Sci. Basel 9(12) (2019). https://doi.org/10.3390/app9163388
Zhang, Z., Song, W., Shen, Y.: A reasonable data pricing mechanism for personal data transactions with privacy concern. In: U, L.H., Spaniol, M., Sakurai, Y., Chen, J. (eds.) APWeb-WAIM 2021. LNCS, vol. 12859, pp. 64–71. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-85899-5_5
Hann, I.-H., Hui, K.-L., Lee, S.-Y.T., Png, I.P.L.: Overcoming online information privacy concerns: an information-processing theory approach. J. Manag. Inf. Syst. (2007). https://doi.org/10.2753/MIS0742-1222240202
Lim, S., Woo, J., Lee, J., Huh, S.-Y.: Consumer valuation of personal information in the age of big data. J. Assoc. Inf. Sci. Technol. (2018). https://doi.org/10.1002/asi.23915
Schomakers, E.-M., Lidynia, C., Ziefle, M.: All of me? Users’ preferences for privacy-preserving data markets and the importance of anonymity. Electron. Mark. 30(3), 649–665 (2020). https://doi.org/10.1007/s12525-020-00404-9
Coyle, D., Diepeveen, S., Wdowin, J., Kay, L., Tennison, J.: The value of data: policy implications, Cambrige, UK (Februaray 2020). https://www.nuffieldfoundation.org/project/valuing-data-foundations-for-data-policy
Rose, J., Rehse, O., Robe, B.: The value of our digital identity (2012). https://www.bcg.com/publications/2012/digital-economy-consumer-insight-value-of-our-digital-identity.aspx. Accessed 29 Apr 2020
Acknowledgements
This research was partially funded by the German Federal Ministry of Education and Research (BMBF) within the scope of the research project DaWID (Platform for value determination and self-determined data release; funding reference number: 16SV8383).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix - Overview of Included Papers
Appendix - Overview of Included Papers
# | Paper | Classification | Key issue explored |
---|---|---|---|
13 | Feijóo et al. [4] | Case Study | Case study on estimation of personal data |
18 | Hacker & Petkova [41] | Case Study | Case study on active choice of using data as currency |
20 | Holt et al. [42] | Case Study | Case study on value of data in stolen data markets |
21 | Jentzsch [43] | Commentary | Commentary on the difficulties of valuing personal data |
37 | Perera et al. [44] | Commentary | Commentary on the challenges of privacy protection in IoT |
39 | Raskar et al. [45] | Commentary | Commentary on challenges of data pricing and data markets |
44 | Sidgman & Crompton [6] | Commentary | Theoretical commentary on current challenges and research opportunities |
4 | Bataineh et al. [23] | Data Market Model | Two-sided data market model with experimental comparison based on real life data set |
8 | Choi et al. [46] | Data Market Model | Theoretical data market model with consumer consent for data collection |
12 | Dimakopoulos & Sudaric [47] | Data Market Model | Theoretical data market with platform competition |
16 | Gkatzelis et al. [48] | Data Market Model | Theoretical data market model for unbiased data samples |
22 | Jiao et al. [22] | Data Market Model | Data market model with Bayesian profit maximization auction |
23 | Lei Xu et al. [24] | Data Market Model | Theoretical data market model with privacy and learning policies in a multi-armed bandit model |
24 | Li & Raghunathan [49] | Data Market Model | Data market model when purpose of data use is unclear |
34 | Niyato et al. [20] | Data Market Model | Theoretical data market model for optimal big data pricing with simulation |
36 | Oh et al. [50] | Data Market Model | Theoretical data market model between broker and service provider under profit maximization and respect for privacy protection and valuation |
38 | Radhakrishnan & Das [51] | Data Market Model | Theoretical data market model for smart grid data |
45 | Spiekermann et al. [52] | Data Market Model | Theoretical data market model focusing on challenges of personal data markets |
46 | Spiekermann & Novotny [53] | Data Market Model | Theoretical data market model focusing on operating principles |
49 | Tian et al. [54] | Data Market Model | Theoretical data market model based on optimal contract-based mechanisms |
50 | Wang et al. [21] | Data Market Model | Theoretical data market model with data owners exhibiting informed consent in a Nash equilibrium with a non-trusted data collector |
3 | Balazinska et al. [55] | Data Market Model, Technical | Technical data market model with query-based pricing |
11 | De Capitani Di Vimercati et al. [27] | Data Market Model, Technical | Technical data market model focusing on including privacy issues in a cloud setting |
25 | Li et al. [56] | Data Market Model, Technical | Technical data market model with query-based pricing |
30 | Nget et al.[25] | Data Market Model, Technical | Technical market model and simulation of query-based pricing mechanism |
53 | Yang & Xing [26] | Data Market Model, Technical | Algorithm for personal data pricing with multi-level privacy division |
7 | Biswas et al. [39] | Data Pricing Model | Theoretical model to induce data provider to accurately report privacy price within differential privacy |
29 | Mehta et al. [57] | Data Pricing Model | Theoretical data pricing model with price-quantity schedule and approximation scheme for data seller |
32 | Niu et al.[58] | Data Pricing Model | Technical pricing model for trading aggregate statistics over private correlated data |
33 | Niu et al. [59] | Data Pricing Model | Algorithm for personal data pricing with reverse price constraint |
42 | Shen et al. [60] | Data Pricing Model | Data pricing model for Big Personal Data based on tuple granularity |
43 | Shen et al. [61] | Data Pricing Model | Data pricing model based on data provenance |
54 | Zhang et al. [62] | Data Pricing Model | Data pricing with privacy concern introducing privacy cost concept |
1 | Acquisti et al. [35] | Experiment | Experiment on WTP/WTA money for private data and privacy |
5 | Bauer et al. [29] | Experiment | Survey-based experiment on value of Facebook user information from user perspective |
6 | Benndorf & Normann [31] | Experiment | Experiment to extraxt WTA money with take-it-or-leave-it offers |
10 | Danezis et al. [33] | Experiment | Experiment on WTA money for location tracking |
15 | Frik & Gaudeul [30] | Experiment | Method and experimental validation for elicitating the implicit value of privacy under risk |
17 | Grossklags & Acquisti [37] | Experiment | Experiment on WTP/WTA money for private data and privacy |
19 | Hann et al. [63] | Experiment | Conjoint analysis to estimate individual’s utility of mitigate privacy concerns |
26 | Lim et al. [64] | Experiment | Discrete choice experiment to estimate value of types of personal information leakage |
27 | Mahmoodi et al. [28] | Experiment | Experiment quantifying WTP for different levels of privacy on social media platforms & analysis of psychological factors (ongoing) |
31 | Nielsen [38] | Experiment | Experiment showing lay peoples reaction to data markets is diverse and shows unwillingness to participate in data market |
41 | Schomakers et al. [65] | Experiment | Mixed-method study on data sharing and privacy preferences in data markets |
47 | Spiekermann & Korunovska [3] | Experiment | Experiment on WTP/WTA money for private data and privacy |
48 | Staiano et al. [32] | Experiment | Living lab experiment focusing on pricing and correlated behaviour patterns |
52 | Winegar & Sunstein [36] | Experiment | Survey-based experiment on the disparity of WTP and WTA money to give up privacy |
2 | Acquisti et al. [7] | Literature Review | Literature review on privacy |
14 | Fricker & Maksimov [15] | Literature Review | Literature review on pricing of data products |
28 | Malgieri & Custers [14] | Literature Review | Literature review on pricing of personal data |
51 | Wdowin & Diepeveen[16] | Literature Review | Literature review on value of personal data |
9 | Coyle et al. [66] | Report | Policy Recommendation for capturing data value |
35 | OECD [5] | Report | Overarching report on pricing personal data |
40 | Rose et al. [67] | Report | Report on value of digital identity based on EU survey |
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Busch-Casler, J., Radic, M. (2022). Personal Data Markets: A Narrative Review on Influence Factors of the Price of Personal Data. In: Guizzardi, R., Ralyté, J., Franch, X. (eds) Research Challenges in Information Science. RCIS 2022. Lecture Notes in Business Information Processing, vol 446. Springer, Cham. https://doi.org/10.1007/978-3-031-05760-1_1
Download citation
DOI: https://doi.org/10.1007/978-3-031-05760-1_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-05759-5
Online ISBN: 978-3-031-05760-1
eBook Packages: Computer ScienceComputer Science (R0)