Overview of the 2017 ALTA Shared Task: correcting OCR errors

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionResearch

Abstract

This paper presents an overview of the 8th ALTA shared task that ran in 2017.
The task was to correct OCR errors from scans of newspapers stored in the Trove
database maintained by the National Library of Australia. We introduce the task,
describe the data and present the results of the participating teams.
LanguageEnglish
Title of host publicationAustralasian Language Technology Association Workshop 2017
Subtitle of host publicationProceedings of the Workshop
EditorsJojo Sze-Meng Wong, Gholamreza Haffari
Pages115-118
Number of pages4
Publication statusPublished - 2017
EventAustralasian Language Technology Association Workshop 2017 - Brisbane, Australia
Duration: 6 Dec 20178 Dec 2017

Conference

ConferenceAustralasian Language Technology Association Workshop 2017
CountryAustralia
CityBrisbane
Period6/12/178/12/17

Fingerprint

Optical character recognition

Cite this

Molla, D., & Cassidy, S. (2017). Overview of the 2017 ALTA Shared Task: correcting OCR errors. In J. S-M. Wong, & G. Haffari (Eds.), Australasian Language Technology Association Workshop 2017: Proceedings of the Workshop (pp. 115-118)
Molla, Diego ; Cassidy, Stephen. / Overview of the 2017 ALTA Shared Task : correcting OCR errors. Australasian Language Technology Association Workshop 2017: Proceedings of the Workshop. editor / Jojo Sze-Meng Wong ; Gholamreza Haffari. 2017. pp. 115-118
@inproceedings{be4c91a16e80496e8b13138c246effc0,
title = "Overview of the 2017 ALTA Shared Task: correcting OCR errors",
abstract = "This paper presents an overview of the 8th ALTA shared task that ran in 2017.The task was to correct OCR errors from scans of newspapers stored in the Trovedatabase maintained by the National Library of Australia. We introduce the task,describe the data and present the results of the participating teams.",
author = "Diego Molla and Stephen Cassidy",
year = "2017",
language = "English",
pages = "115--118",
editor = "Wong, {Jojo Sze-Meng} and Gholamreza Haffari",
booktitle = "Australasian Language Technology Association Workshop 2017",

}

Molla, D & Cassidy, S 2017, Overview of the 2017 ALTA Shared Task: correcting OCR errors. in JS-M Wong & G Haffari (eds), Australasian Language Technology Association Workshop 2017: Proceedings of the Workshop. pp. 115-118, Australasian Language Technology Association Workshop 2017, Brisbane, Australia, 6/12/17.

Overview of the 2017 ALTA Shared Task : correcting OCR errors. / Molla, Diego; Cassidy, Stephen.

Australasian Language Technology Association Workshop 2017: Proceedings of the Workshop. ed. / Jojo Sze-Meng Wong; Gholamreza Haffari. 2017. p. 115-118.

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionResearch

TY - GEN

T1 - Overview of the 2017 ALTA Shared Task

T2 - correcting OCR errors

AU - Molla, Diego

AU - Cassidy, Stephen

PY - 2017

Y1 - 2017

N2 - This paper presents an overview of the 8th ALTA shared task that ran in 2017.The task was to correct OCR errors from scans of newspapers stored in the Trovedatabase maintained by the National Library of Australia. We introduce the task,describe the data and present the results of the participating teams.

AB - This paper presents an overview of the 8th ALTA shared task that ran in 2017.The task was to correct OCR errors from scans of newspapers stored in the Trovedatabase maintained by the National Library of Australia. We introduce the task,describe the data and present the results of the participating teams.

M3 - Conference proceeding contribution

SP - 115

EP - 118

BT - Australasian Language Technology Association Workshop 2017

A2 - Wong, Jojo Sze-Meng

A2 - Haffari, Gholamreza

ER -

Molla D, Cassidy S. Overview of the 2017 ALTA Shared Task: correcting OCR errors. In Wong JS-M, Haffari G, editors, Australasian Language Technology Association Workshop 2017: Proceedings of the Workshop. 2017. p. 115-118