Temporal context for authorship attribution: a study of Danish secondary schools

Niels Dalum Hansen, Christina Lioma, Birger Larsen, Stephen Alstrup

5 Citations (Scopus)

Abstract

We study temporal aspects of authorship attribution - a task which aims to distinguish automatically between texts written by different authors by measuring textual features. This task is important in a number of areas, including plagiarism detection in secondary education, which we study in this work. As the academic abilities of students evolve during their studies, so does their writing style. These changes in writing style form a type of temporal context, which we study for the authorship attribution process by focussing on the students’ more recent writing samples. Experiments with real world data from Danish secondary school students show 84% prediction accuracy when using all available material and 71.9% prediction accuracy when using only the five most recent writing samples from each student.

This type of authorship attribution with only few recent writing samples is significantly faster than conventional approaches using the complete writings of all authors. As such, it can be integrated into working interactive plagiarism detection systems for secondary education, which assist teachers by flagging automatically incoming student work that deviates significantly from the student’s previous work, even during scenarios requiring fast response and heavy data processing, like the period of national examinations.

Original languageEnglish
Title of host publicationMultidisciplinary information retrieval : 7th Information Retrieval Facility Conference, IRFC 2014, Copenhagen, Denmark, November 10-12, 2014, Proceedings
EditorsDavid Lamas, Paul Buitelaar
Number of pages19
PublisherSpringer
Publication date2014
Pages22-40
ISBN (Print)978-3-319-12978-5
ISBN (Electronic)978-3-319-12979-2
DOIs
Publication statusPublished - 2014
Event3rd Open Interdisciplinary MUMIA Conference and 7th Information Retrieval Facility Conference - Copenhagen, Denmark
Duration: 10 Nov 201412 Nov 2014

Conference

Conference3rd Open Interdisciplinary MUMIA Conference and 7th Information Retrieval Facility Conference
Country/TerritoryDenmark
CityCopenhagen
Period10/11/201412/11/2014
SeriesLecture notes in computer science
Volume8849
ISSN0302-9743

Keywords

  • Faculty of Science
  • authorship attribution, plagiarism detection

Fingerprint

Dive into the research topics of 'Temporal context for authorship attribution: a study of Danish secondary schools'. Together they form a unique fingerprint.

Cite this