Abstract
We present \lair{}: A domain-specific language that enables users to specify actions to be taken upon meeting specific semantic frames in a text, in particular to rephrase and redact the textual content. While \lair{} presupposes superficial knowledge of frames and frame semantics, it requires only limited prior programming experience. It neither contain scripting or I/O primitives, nor does it contain general loop constructions and is not Turing-complete. We have implemented a \lair{} compiler and integrated it in a pipeline for automated redaction of web pages. We detail our experience with automated redaction of web pages for subjectively undesirable content; initial experiments suggest that using a small language based on semantic recognition of undesirable terms can be highly useful as a supplement to traditional methods of text sanitization.
Original language | English |
---|---|
Title of host publication | Proceedings of the 3rd IEEE International Conference on Semantic Computing (ICSC 2009) |
Number of pages | 6 |
Publisher | IEEE Computer Society Press |
Publication date | 2009 |
Pages | 47-52 |
ISBN (Print) | 978-0-7695-3800-6 |
DOIs | |
Publication status | Published - 2009 |
Event | IEEE International Conference on Semantic Computing - Berkeley, United States Duration: 14 Sept 2009 → 16 Sept 2009 Conference number: 3 |
Conference
Conference | IEEE International Conference on Semantic Computing |
---|---|
Number | 3 |
Country/Territory | United States |
City | Berkeley |
Period | 14/09/2009 → 16/09/2009 |