Temporal context for authorship attribution: a study of Danish secondary schools

Niels Dalum Hansen, Christina Lioma, Birger Larsen, Stephen Alstrup

5 Citationer (Scopus)

Abstract

We study temporal aspects of authorship attribution - a task which aims to distinguish automatically between texts written by different authors by measuring textual features. This task is important in a number of areas, including plagiarism detection in secondary education, which we study in this work. As the academic abilities of students evolve during their studies, so does their writing style. These changes in writing style form a type of temporal context, which we study for the authorship attribution process by focussing on the students’ more recent writing samples. Experiments with real world data from Danish secondary school students show 84% prediction accuracy when using all available material and 71.9% prediction accuracy when using only the five most recent writing samples from each student.

This type of authorship attribution with only few recent writing samples is significantly faster than conventional approaches using the complete writings of all authors. As such, it can be integrated into working interactive plagiarism detection systems for secondary education, which assist teachers by flagging automatically incoming student work that deviates significantly from the student’s previous work, even during scenarios requiring fast response and heavy data processing, like the period of national examinations.

OriginalsprogEngelsk
TitelMultidisciplinary information retrieval : 7th Information Retrieval Facility Conference, IRFC 2014, Copenhagen, Denmark, November 10-12, 2014, Proceedings
RedaktørerDavid Lamas, Paul Buitelaar
Antal sider19
ForlagSpringer
Publikationsdato2014
Sider22-40
ISBN (Trykt)978-3-319-12978-5
ISBN (Elektronisk)978-3-319-12979-2
DOI
StatusUdgivet - 2014
Begivenhed3rd Open Interdisciplinary MUMIA Conference and 7th Information Retrieval Facility Conference - Copenhagen, Danmark
Varighed: 10 nov. 201412 nov. 2014

Konference

Konference3rd Open Interdisciplinary MUMIA Conference and 7th Information Retrieval Facility Conference
Land/OmrådeDanmark
ByCopenhagen
Periode10/11/201412/11/2014
NavnLecture notes in computer science
Vol/bind8849
ISSN0302-9743

Emneord

  • Det Natur- og Biovidenskabelige Fakultet

Fingeraftryk

Dyk ned i forskningsemnerne om 'Temporal context for authorship attribution: a study of Danish secondary schools'. Sammen danner de et unikt fingeraftryk.

Citationsformater