Home
Posts
Talks
Publications
Projects
Contact
CV
Light
Dark
Automatic
privacy
Data Study Group Final Report: Department for Work and Pensions
A summary report of methods to gauge the suitability and privacy of synthetic datasets, produced for the Alan Turing Institute’s Data Study Group programme.
Richard Plant
,
Oleksandr Deineha
,
Charlotte Grace
,
Samruddhi Mhatre
,
Tanut Treetanthiploet
,
Daniel Valdenegro
,
Liang Zhou
,
Oliver Strickson
PDF
Project
DOI
CAPE: Context-Aware Private Embeddings for Private Language Learning
Large pre-trained language models have pushed the boundaries of the state-of-the-art in NLP, but their use runs the risk of encoding unwanted private personal information derived from input texts. Our proposed system adds calibrated noise and an adversarial training objective to reduce private information leakage.
Richard Plant
,
Dimitra Gkatzia
,
Valerio Giuffrida
PDF
Code
Cite
×