Privacy Preserving Data Classification Using Inner Product Encryption

Ligier, Damien; Carpov, Sergiu; Fontaine, Caroline; Sirdey, Renaud

doi:10.1007/978-3-319-59608-2_44

Damien Ligier¹⁹,
Sergiu Carpov¹⁹,
Caroline Fontaine²⁰ &
…
Renaud Sirdey¹⁹

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 198))

Included in the following conference series:

International Conference on Security and Privacy in Communication Systems

1785 Accesses

Abstract

In the context of data outsourcing more and more concerns raise about the privacy of user’s data. One solution is to outsource the data in encrypted form. Meanwhile obtaining a service based on machine learning predictions on user data remains very important in real-life situations.

This paper presents ways to combine machine learning algorithms and IPE in order to perform classification on encrypted data. The proposed privacy preserving classification schemes allow to keep user’s data encrypted but at the same time revealing to a server classification results on this data. We study the performance of such classification schemes and their information leakage.

Access provided by CONRICYT-eBooks. Download conference paper PDF

Improvement on a privacy-preserving outsourced classification protocol over encrypted data

Article 02 May 2020

Outsourced Privacy-Preserving SVM Classifier Model over Encrypted Data in IoT

Privacy-Preserving Machine Learning Using Cryptography

Keywords

1 Introduction

With the generalization of data outsourcing, more and more concerns raise about the privacy and the security of the outsourced data. In this context, machine learning methods have to be conceived and deployed by keeping in mind and assuring the user’s privacy.

In a privacy preserving data classification process, one has to be able to extract knowledge (e.g. in the case of a classifier, deduct the class label of an individual without compromising his private data) by assuring the protection of the sensitive data and, if possible, by hiding data access patterns from which useful properties could be inferred.

In this work we propose a privacy preserving classification algorithm based on functional encryption, in particular the inner product encryption. The performance of the classification algorithm is evaluated on the MNIST database [3].

An inner product encryption scheme is a functional encryption one that enables the evaluation of inner products. In those public encryption schemes vectors are encrypted and each secret key is associated with one vector. For example if \(c_v\) is an encryption of the vector v and \(sk_w\) is a secret key associated with the vector w, when one decrypts \(c_v\) with \(sk_w\) he gets \(\langle v,w\rangle \). Note that secret keys are generated with the master secret key by the authority.

In the use case we focus in the paper, there is an entity called server that has performed a training step of a linear classifier. Thus he has a set of linear classification coefficients and he wants to keep them secret. There are many users that have informations that they want to keep secret as well but they also want to release classification results to the server (for example for obtaining a service). We introduce a third party that both of the server and the users can trust and we call it authority. His goal is in a first time to check that the server’s coefficients are not dishonest and in a second time, to generate an instance of an inner product encryption to perform the classification over the encrypted inputs.

2 Privacy Preserving Classification

A linear classification algorithm makes a decision on the membership of an input data object, based on a linear combination of its features (characteristics). For example, in an image classification algorithm the input object is an image and the features can be image pixels. In a binary classification, the decision is made as a function of a threshold overrun by the dot product between object features and linear classifier coefficients.

In this work we propose a privacy preserving data classification method. Input data is encrypted using an inner product encryption scheme. In the context of ML algorithms, the inner product encryption can be seen as a linear binary classifier. In order to perform a multi-class linear classification, we need to compute several inner products on the same input data. Usually, linear classifiers provide worse results when compared to other more elaborate classification methods. At the same time, only a linear classifier is able to provide a prediction for data encrypted using the inner product encryption.

In order to fill this gap we propose a combined classification method, in which a linear classifier is applied to encrypted data and is followed by a more complex classification algorithm (for example an ensemble method in our case but not limited to). For each piece of input encrypted data, several inner products are computed. These products are then used as input features for a second, more elaborate, classifier. In this way, we are able to perform classification of encrypted data with increased performance in terms of an evaluation metric (e.g. error rate).

3 Performance

We use our implementation (in C++ using FLINT library [2]) of the functional encryption for inner product scheme of Agrawal et al. [1] which provides full security under the DDH assumption. We work in a group \(\mathbb {F}_p^*\) such that p is a safe prime of approximatively 2048-bits.

We try our construction with the MNIST database [3] which is a collection of handwritten digit images (\(28\times 28\) pixels with 256 levels of grey). So in this use case, the classifier has 10 output classes (digits from 0 to 9).

The experiments were performed on a regular laptop computer with an Intel Core i7-4650U CPU and 8 GB of RAM. A plaintext and a secret key have size about one kB and a ciphertext has a size about 200 kB.

The algorithm to generate a secret key associated with a vector and the encryption algorithm take less than a second to be computed. The decryption algorithm takes about 23 s to be computed.

We get \(14\%\) of error rate with a single linear classifier, and \(7\%\) using a second classifier after the first one which does not take significant time to perform.

4 Classification Security

We emphasize that the use case of a such construction can be unsecure even if the cryptographic scheme is secure. An attacker gets a system of diophantine equations. The easiness of solving it and the precision of the description of the inputs increase the ability to compute which vector has been encrypted.

5 Conclusion and Future Work

In this work we have used an instantiation of an inner product encryption in order to perform classification over encrypted data. The learning process is kept secret as only linear classifiers coefficients are public. In the use-case we study, we have a trusted authority, a server computing the classification and the users who encrypt their data. Obtained execution times are reasonably small (a prediction is made in approximatively 69 s without any parallelization) as well as the size of the ciphertexts. We have studied a method to ensure that we cannot find original image from the inner product values. In perspective, we consider to study more deeply the leakage of inner product encryption schemes and to propose methods to lower it.

References

Agrawal, S., Libert, B., Stehle, D.: Fully Secure Functional Encryption for Inner Products, from Standard Assumptions (2015)
Google Scholar
Hart, W., Johansson, F., Pancratz, S.: FLINT: Fast Library for Number Theory, Version 2.4.0 (2013). http://flintlib.org
LeCun, Y., Cortes, C., Burges, C.J.: The MNIST Database. http://yann.lecun.com/exdb/mnist/

Download references

Author information

Authors and Affiliations

CEA LIST, Point Courrier 172, 91191, Gif-sur-Yvette Cedex, France
Damien Ligier, Sergiu Carpov & Renaud Sirdey
CNRS/Lab-STICC and Telecom Bretagne and UEB, Technopôle Brest-Iroise, France
Caroline Fontaine

Authors

Damien Ligier
View author publications
You can also search for this author in PubMed Google Scholar
Sergiu Carpov
View author publications
You can also search for this author in PubMed Google Scholar
Caroline Fontaine
View author publications
You can also search for this author in PubMed Google Scholar
Renaud Sirdey
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Damien Ligier .

Editor information

Editors and Affiliations

Singapore Management University, Singapore, Singapore
Robert Deng
Jinan University, Guangzhou, Guangdong, China
Jian Weng
University at Buffalo, Buffalo, New York, USA
Kui Ren
SRI International, Menlo Park, California, USA
Vinod Yegneswaran

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ligier, D., Carpov, S., Fontaine, C., Sirdey, R. (2017). Privacy Preserving Data Classification Using Inner Product Encryption. In: Deng, R., Weng, J., Ren, K., Yegneswaran, V. (eds) Security and Privacy in Communication Networks. SecureComm 2016. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 198. Springer, Cham. https://doi.org/10.1007/978-3-319-59608-2_44

Download citation

DOI: https://doi.org/10.1007/978-3-319-59608-2_44
Published: 14 June 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59607-5
Online ISBN: 978-3-319-59608-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Privacy Preserving Data Classification Using Inner Product Encryption

Abstract

Similar content being viewed by others

Improvement on a privacy-preserving outsourced classification protocol over encrypted data

Outsourced Privacy-Preserving SVM Classifier Model over Encrypted Data in IoT

Privacy-Preserving Machine Learning Using Cryptography

Keywords

1 Introduction

2 Privacy Preserving Classification

3 Performance

4 Classification Security

5 Conclusion and Future Work

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Privacy Preserving Data Classification Using Inner Product Encryption

Abstract

Similar content being viewed by others

Improvement on a privacy-preserving outsourced classification protocol over encrypted data

Outsourced Privacy-Preserving SVM Classifier Model over Encrypted Data in IoT

Privacy-Preserving Machine Learning Using Cryptography

Keywords

1 Introduction

2 Privacy Preserving Classification

3 Performance

4 Classification Security

5 Conclusion and Future Work

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation