Activities per year
Abstract
We describe an anonymization tool that was commissioned by and specified together with Schultz, a publishing company specialized in Danish law related publications. Unavailability of training data and the need to guarantee compliance with pre-existing anonymization guidelines forced us to implement a tool using manually crafted rules. We used Bracmat, a programming language that is specialized in transforming tree data structures, to meet the requirement to pass the XML structure of the input document unscathed through the whole workflow. The tool attains a reassuringly good recall, makes almost no chunk errors and reduces the found entity designators to a nearly correct set of entities that the input text refers to, minimizing the time needed for manual check and post-editing.
Translated title of the contribution | Anonymisering af Dommerkendelser |
---|---|
Original language | English |
Publication date | 25 Jul 2016 |
Number of pages | 4 |
DOIs | |
Publication status | Published - 25 Jul 2016 |
Event | Iberian Conference on Information Systems and Technologies - Gran Canaria, Spain Duration: 15 Jun 2016 → 18 Jun 2016 Conference number: 11 |
Conference
Conference | Iberian Conference on Information Systems and Technologies |
---|---|
Number | 11 |
Location | Gran Canaria |
Country/Territory | Spain |
Period | 15/06/2016 → 18/06/2016 |
Keywords
- Faculty of Humanities
Activities
- 1 Lecture and oral contribution
-
Automatisk anonymisering af fortrolige dokumenter:
Claus Povlsen (Lecturer)
14 Aug 2012Activity: Talk or presentation types › Lecture and oral contribution