The CLEANUP Project


Welcome to the public webpage of the CLEANUP project! CLEANUP is a four-years research project funded by the Research Council of Norway. The goal of CLEANUP is to develop new machine learning methods to automatically anonymise (or at least strongly de-identify) text documents containing personal data, such as electronic health records, court rulings or chat-based interactions with customers.

The project brings together a consortium of researchers from machine learning, natural language processing, computational privacy, statistical modelling, health informatics and IT law. In addition, partners from the Norwegian public and private sector (covering the fields of insurance, welfare, healthcare and legal publishing) contribute to the project with their data and domain knowledge.

Oh, and if you were wondering what CLEANUP stands for : it's "Machine Learning for the Anonymisation of Unstructured Personal Data" (yes, we were a bit creative with the acronym).

News:

[2020-04-30]

The official website of the CLEANUP project is now up and running!

[2020-02-01]

The CLEANUP project has now officially started!.