Publications:
2022) Neural Text Sanitization with Explicit Measures of Privacy Risk. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). (
2022) Automatic Evaluation of Disclosure Risks of Text Anonymization Methods. In Privacy in Statistical Databases (PSD 2022). Paris, France. (
2022) GDPR and unstructured data: is anonymization possible? International Data Privacy Law, 12(3). (
2022) The Text Anonymization Benchmark (TAB): A Dedicated Corpus and Evaluation Framework for Text Anonymization. Computational Linguistics, 48(4): 1053-1101.
2022) Bootstrapping Text Anonymization Models with Distant Supervision
. In Proceedings of the Language Resources and Evaluation Conference. ELRA, Marseille, France. ( Pierre Lison, Ildikó Pilán, David Sánchez, Montserrat Batet, and Lilja Øvrelid. 2021. Anonymisation Models for Text Data: State of the Art, Challenges and Future Directions. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pages 4188–4203, Online. Association for Computational Linguistics. [pdf] Pierre Lison, Jeremy Barnes, and Aliaksandr Hubin. 2021. skweak: Weak supervision made easy for NLP. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations, pages 337–346, Online. Association for Computational Linguistics. [pdf] [code]