Overview of the 2017 ALTA Shared Task: correcting OCR errors

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contribution

Abstract

This paper presents an overview of the 8th ALTA shared task that ran in 2017.
The task was to correct OCR errors from scans of newspapers stored in the Trove
database maintained by the National Library of Australia. We introduce the task,
describe the data and present the results of the participating teams.
Original languageEnglish
Title of host publicationAustralasian Language Technology Association Workshop 2017
Subtitle of host publicationProceedings of the Workshop
EditorsJojo Sze-Meng Wong, Gholamreza Haffari
Pages115-118
Number of pages4
Publication statusPublished - 2017
EventAustralasian Language Technology Association Workshop 2017 - Brisbane, Australia
Duration: 6 Dec 20178 Dec 2017

Conference

ConferenceAustralasian Language Technology Association Workshop 2017
Country/TerritoryAustralia
CityBrisbane
Period6/12/178/12/17

Fingerprint

Dive into the research topics of 'Overview of the 2017 ALTA Shared Task: correcting OCR errors'. Together they form a unique fingerprint.

Cite this