15
share sensi)ve data with confidence Latanya Sweeney [email protected] latanyasweeney.org

DataTags: Sharing Privacy Sensitive Data by Latanya Sweeney

Embed Size (px)

Citation preview

 share  sensi)ve  data  with  confidence  

Latanya  Sweeney  [email protected]    latanyasweeney.org  

Gender, Race Ethnicity Micro-ethnicity (sub groups) Median Household income** Attended elite boarding school? Hometown Foreign Country? Hometown State* Primary academic major* Secondary academic major* Dorm Freshman Year* Dorm Neighborhood, 4 zones Network linkages of roommates

On Facebook? Facebook: Political view Facebook: Interested In Facebook: number of friends (school) Facebook: total number of friends Facebook: number of Picture Friends Facebook: Favorite Movies* Facebook: Favorite Music* Facebook: Favorite Books* Facebook: linkages of friends

Jason’s  Dataset  

Joe’s  Dataset  

Description* Date of visit (month, day and year)

Transaction# Unique patient identifier* Patient 5-digit ZIP code* Month, day and Year of Birth* Gender

Unique Provider ID Provider 5-digit ZIP code* ICD9 diagnosis code 1* ICD9 diagnosis code 2* ICD9 diagnosis code 3* ICD9 diagnosis code 4* ICD9 diagnosis code 5* ICD9 diagnosis code 6

Francesca’s Combined Data  

Chevron  Refinery  

Liberty/  Atchison  Villages  Interstate  

Levin-­‐Richmond  Terminal  Corp  (marine)  

General  Chemical  Corp  Rail  yard  

Julia’s  Interviews  

N  l

All  these  people  need  to  store  data    in  a  manner  that  respects    

legal  and  ethical  commitments.    

#1    Readme  Files  

#2    Uniform  storage  and  handling  

How    does  a  

researcher  comply    

with  more  than  2000  privacy  laws?  

Legal  Experts  Codify  Jurisprudence    

into  the  six  levels.  

Set  of  computer  rules  for  tagging  data  on  

ingesUon  

Data  with  its  dataTag  deposit  into  a  dataTags-­‐compliant  repository  

Wiki  approach  

HarmonizaUon  

Modeling  

Legal  Experts  Codify  Jurisprudence    

into  the  six  levels.  

Set  of  computer  rules  for  tagging  data  on  

ingesUon  

Data  with  its  dataTag  deposit  into  a  dataTags-­‐compliant  repository  

Expert  System:  decision  tree  

Expert  System:  rule-­‐based  

Legal  Experts  Codify  Jurisprudence    

into  the  six  levels.  

Set  of  computer  rules  for  tagging  data  on  

ingesUon  

Data  with  its  dataTag  deposit  into  a  dataTags-­‐compliant  repository  

Interview  Server  

Remote  API  Q&A  

Binary,  Remote  Exec  

Legal  Experts  Codify  Jurisprudence    

into  the  six  levels.  

Set  of  computer  rules  for  tagging  data  on  

ingesUon  

Data  with  its  dataTag  deposit  into  a  dataTags-­‐compliant  repository  

Dataverse  

iRods  

Legal  Experts  Codify  Jurisprudence    

into  the  six  levels.  

Set  of  computer  rules  for  tagging  data  on  

ingesUon  

Data  with  its  dataTag  deposit  into  a  dataTags-­‐compliant  repository  

Interview  datatags.org