Abstract
We investigate authorship attribution using
classifiers based on frame semantics. The purpose
is to discover whether adding semantic
information to lexical and syntactic methods
for authorship attribution will improve them,
specifically to address the difficult problem of
authorship attribution of translated texts. Our
results suggest (i) that frame-based classifiers
are usable for author attribution of both translated
and untranslated texts; (ii) that framebased
classifiers generally perform worse than
the baseline classifiers for untranslated texts,
but (iii) perform as well as, or superior to
the baseline classifiers on translated texts; (iv)
that—contrary to current belief—naïve classifiers
based on lexical markers may perform
tolerably on translated texts if the combination
of author and translator is present in the training
set of a classifier.
classifiers based on frame semantics. The purpose
is to discover whether adding semantic
information to lexical and syntactic methods
for authorship attribution will improve them,
specifically to address the difficult problem of
authorship attribution of translated texts. Our
results suggest (i) that frame-based classifiers
are usable for author attribution of both translated
and untranslated texts; (ii) that framebased
classifiers generally perform worse than
the baseline classifiers for untranslated texts,
but (iii) perform as well as, or superior to
the baseline classifiers on translated texts; (iv)
that—contrary to current belief—naïve classifiers
based on lexical markers may perform
tolerably on translated texts if the combination
of author and translator is present in the training
set of a classifier.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies: short papers |
Antal sider | 6 |
Vol/bind | 2 |
Forlag | Association for Computational Linguistics |
Publikationsdato | 2011 |
Sider | 65-70 |
ISBN (Trykt) | 978-1-932432-88-6 |
Status | Udgivet - 2011 |
Begivenhed | 49th Annual Meeting of the Association for Computational Linguistics: human language technologies - Portland, USA Varighed: 19 jun. 2011 → 24 jun. 2011 Konferencens nummer: 49 |
Konference
Konference | 49th Annual Meeting of the Association for Computational Linguistics |
---|---|
Nummer | 49 |
Land/Område | USA |
By | Portland |
Periode | 19/06/2011 → 24/06/2011 |