About SensorSift

SensorSift: Balancing Sensor Data Privacy and Utility in Automated Face Understanding. Miro Enev, Jaeyeon Jung, Liefeng Bo, Xiaofeng Ren, Tadayoshi Kohno. Annual Computer Security Applications Conference (ACSAC), 2012
[bibtex] [pdf] [ slides ]

Algorithm

Privacy Partial Least Squares (PPLS)

our paper

Code

here

BSD license


			
				X: NxD1 matrix, N is the number of raw data samples and D1 is their feature dimensionality
				pub_labels: NxD2 matrix, indicating the presence/absence of the D2 public attributes (via binary labels) 
				priv_labels: NxD3 matrix, indicating the presence/absence of the D3 private attributes (via binary labels) 
				ncomp: the maximum sift dimensionality desired (default is 10)
				lambda: tradeoff parameter between privacy/utility used in the objective function (default is 1)

sift_mat

Example Workflow:

train_samples

test_samples

% generate sift using training samples sift_mat = PPLS_sift_gen ( X(train_samples, :), ... pub_labels(train_samples, :), ... priv_labels(train_samples, :) ); % apply sift to test data X_sifted = X(test_samples,:) * sift_mat; % evaluate performance using ensemble of classifiers for method_num = 1:num_methods accuracy(method_num) = ML_method( method_num, X_sifted, ... pub_labels(test_samples, :), ... priv_labels(test_samples, :) );

Measuring Performance // Results

PubLoss - how much classification accuracy is sacrificed on sifted public attribute(s) relative to raw (unsifted) data
PrivLoss - increase in classification performance of private attributes relative to random guessing

ClassAvgAccuracy = ( tP/(tP+fP) + tN/(tN+fN) )/2

Neural Net (Feed Forward)
Random Forest (Decision Trees)
Linear Support Vector Machine
Non-Linear Support Vector Machine
Nearest Neighbors Clustering

our paper

Case Study

http://www.cs.columbia.edu/CAVE/databases/pubfig/

Feedback/Contact

Miro Enev

About SensorSift

Algorithm

Code

Example Workflow:

Measuring Performance // Results

Case Study

Feedback/Contact

Supporting Organizations