skip to main content
research-article

Privacy-Preserving Linkage of Genomic and Clinical Data Sets

Published: 01 July 2019 Publication History

Abstract

The capacity to link records associated with the same individual across data sets is a key challenge for data-driven research. The challenge is exacerbated by the potential inclusion of both genomic and clinical data in data sets that may span multiple legal jurisdictions, and by the need to enable re-identification in limited circumstances. Privacy-Preserving Record Linkage PPRL methods address these challenges. In 2016, the Interdisciplinary Committee of the International Rare Diseases Research Consortium IRDiRC launched a task team to explore approaches to PPRL. The task team is a collaboration with the Global Alliance for Genomics and Health GA4GH Regulatory and Ethics and Data Security Work Streams, and aims to prepare policy and technology standards to enable highly reliable linking of records associated with the same individual without disclosing their identity except under conditions in which the use of the data has led to information of importance to the individual's safety or health, and applicable law allows or requires the return of results. The PPRL Task Force has examined the ethico-legal requirements, constraints, and implications of PPRL, and has applied this knowledge to the exploration of technology methods and approaches to PPRL. This paper reports and justifies the findings and recommendations thus far.

References

[1]
R. Schnell, "Privacy-preserving data linkage," Methodological Developments in Data Linkage, K. Harron, H. Goldstein, and C. Dibben, eds. Hoboken, NJ, USA: Wiley, 2016, pp. 201-225.
[2]
Global Alliance for Genomics and Health, "Framework for responsible sharing of genomic and health-related data," 2014, [Online]. Available: https://www.ga4gh.org/ga4ghtoolkit/regulatoryandethics/framework-for-responsible-sharing-genomic-and-health-related-data/.
[3]
Freedom of Information and Protection of Privacy Act, Revised Statutes of British Columbia 1996, chapter 165, sections 35(1)(b), 36.1 (1).
[4]
E-Health Act, Statutes of British Columbia 2008, chapter 38, section 14(2.1)(d).
[5]
Personal Information Protection Act, Statutes of British Columbia 2003, chapter 63, section 21(1)(c).
[6]
Health Insurance Portability and Accountability Act of 1996 (U.S.), Public Law No. 104-191, US Statutes at Large, vol. 110, pp. 1936ff, 1996.
[7]
General Data Protection Regulation, Official Journal of the European Union, vol. 59, L 119/1. 2016.
[8]
D. Vatsalan, Z. Sehili, P. Christen, and E. Rahm, "Privacy-preserving record linkage for big data: Current approaches and research challenges," Handbook of Big Data Technologies. Berlin, Germany: Springer, 2016.
[9]
P. Christen, "Privacy-preserving record linkage," ScaDS Leipzig, 2016, [Online]. Available: http://users.cecs.anu.edu.au/~christen/publications/christen2016scads.pdf.
[10]
National Institutes of Health, "Global unique identifier (GUID)," 2016, [Online]. Available: https://data-archive.nimh.nih.gov/guid/.
[11]
Australian Institute of Health and Welfare, "SLK-581 Guide for use," Australian government, Jul. 2016, [Online]. Available: https://www.aihw.gov.au/getmedia/cf980d57-c72f-4925-b1fc-36be6e80d3cd/aodts-nmds-2017-18-slk-581-guide.pdf.aspx.
[12]
G. van Grootheest, M. C. H. de Groot, D. J. van der Laan, J. H. Smit, and B. F. M. Bakker, "Record linkage for health studies: Three demonstration projects," 2015, [Online]. Available: http://www.biolink-nl.eu/public/2015_recordlinkageforhealthstudies.pdf.
[13]
L. Hardesty, "Securing the cloud: A new algorithm solves a major pproblem with homomorphic encryption, which would let web servers process data without decrypting it," MIT News, 2013, [Online]. Available: http://news.mit.edu/2013/algorithm-solves-homomorphic-encryption-problem-0610.
[14]
M. Lablans, A. Borg, F. Ückert, "A RESTful interface to pseudonymization services in modern web applications," BMC Med. Inf. Decision Making, vol. 15, no. 2, 2015.
[15]
M. Nitzlnader, G. Schreier, "Patient identity management for secondary use of biomedical research data in a distributed computing environment," eHealth2014 - Health Informatics Meets eHealth, A. Hörbst, et al., eds., Amsterdam, The Netherlands: IOS Press, 2014, [Online]. Available: https://eupid.eu/assets/downloads/nitzlnader2014.pdf.
[16]
"Mainzelliste," [Online]. Available: https://mainzelliste.de.
[17]
Institute of Medical Biostatics, Epidemiology, and Informatics (IMBEI), "Mainzelliste as an open source service," University Medical Center of the Johannes Gutenberg University Mainz 2013, [Online]. Archived at: https://web.archive.org/web/20160815024055/http://www.unimedizin-mainz.de:80/imbei/medicalinformatics/ag-verbundforschung/mainzelliste.html?L=1.
[18]
FORCE11, "Guiding principles for findable, accessible, interoperable and re-usable data publishing version b1.0," [Online]. Available: https://www.force11.org/node/6062.

Cited By

View all
  • (2022)Generating-Set Evaluation of Bloom Filter Hardening Techniques in Private Record LinkageInformation Systems Security10.1007/978-3-031-23690-7_3(44-63)Online publication date: 16-Dec-2022
  • (2022)Privacy-Preserving Record Linkage Using Local Sensitive Hash and Private Set IntersectionApplied Cryptography and Network Security Workshops10.1007/978-3-031-16815-4_22(398-424)Online publication date: 20-Jun-2022
  • (2021)Recent Developments in Privacy-preserving Mining of Clinical DataACM/IMS Transactions on Data Science10.1145/34477742:4(1-32)Online publication date: 15-Nov-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image IEEE/ACM Transactions on Computational Biology and Bioinformatics
IEEE/ACM Transactions on Computational Biology and Bioinformatics  Volume 16, Issue 4
July 2019
360 pages

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 01 July 2019
Published in TCBB Volume 16, Issue 4

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Generating-Set Evaluation of Bloom Filter Hardening Techniques in Private Record LinkageInformation Systems Security10.1007/978-3-031-23690-7_3(44-63)Online publication date: 16-Dec-2022
  • (2022)Privacy-Preserving Record Linkage Using Local Sensitive Hash and Private Set IntersectionApplied Cryptography and Network Security Workshops10.1007/978-3-031-16815-4_22(398-424)Online publication date: 20-Jun-2022
  • (2021)Recent Developments in Privacy-preserving Mining of Clinical DataACM/IMS Transactions on Data Science10.1145/34477742:4(1-32)Online publication date: 15-Nov-2021
  • (2020)P-Signature-Based Blocking to Improve the Scalability of Privacy-Preserving Record LinkageData Privacy Management, Cryptocurrencies and Blockchain Technology10.1007/978-3-030-66172-4_3(35-51)Online publication date: 17-Sep-2020

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media