Using sequence alignment and voting to improve optical music recognition from multiple recognizers

Esben Paul Bugge, Kim Lundsteen Juncher, Brian Søborg Mathiesen, Jakob Grue Simonsen

10 Citations (Scopus)

Abstract

Digitalizing sheet music using Optical Music Recognition (OMR) is error-prone, especially when using noisy images created from scanned prints. Inspired by DNA-sequence alignment, we devise a method to use multiple sequence alignment to automatically compare output from multiple third party OMR tools and perform automatic error-correction of pitch and duration of notes. We perform tests on a corpus of 49 one-page scores of varying quality. Our method on average reduces the amount of errors from an ensemble of 4 commercial OMR tools. The method achieves, on average, fewer errors than each recognizer by itself, but statistical tests show that it is significantly better than only 2 of the 4 commercial recognizers. The results suggest that recognizers may be improved somewhat by sequence alignment and voting, but that more elaborate methods may be needed to obtain substantial improvements. All software, scanned music data used for testing, and experiment protocols are open source and available at: http://code.google.com/p/omr-errorcorrection/.

Original languageEnglish
Title of host publicationProceedings of the 12th International Society for Music Information Retrieval Conference
EditorsAnssi Klapuri, Colby Leider
Number of pages6
PublisherUniversity of Miami
Publication date2011
Pages405-410
ISBN (Print)978-0-615-54865-4
Publication statusPublished - 2011
Event12th International Society for Music Information Retrieval Conference - Miami, Florida, United States
Duration: 24 Oct 201128 Oct 2011
Conference number: 12

Conference

Conference12th International Society for Music Information Retrieval Conference
Number12
Country/TerritoryUnited States
CityMiami, Florida
Period24/10/201128/10/2011

Fingerprint

Dive into the research topics of 'Using sequence alignment and voting to improve optical music recognition from multiple recognizers'. Together they form a unique fingerprint.

Cite this