Multi-level annotation in the Emu speech database management system

Steve Cassidy, Jonathan Harrington

Research output: Contribution to journalArticleResearchpeer-review

Abstract

Researchers in various fields, from acoustic phonetics to child language development, rely on digitized collections of spoken language data as raw material for research. Access to this data had, in the past, been provided in an ad-hoc manner with labelling standards and software tools developed to serve only one or two projects. A few attempts have been made at providing generalized access to speech corpora but none of these have gained widespread popularity. The Emu system, described here, is a general purpose speech database management system which supports complex multi-level annotations. Emu can read a number of popular label and data file formats and supports overlaying additional annotation with inter-token relations on existing time-aligned label files. Emu provides a graphical labelling tool which can be extended to provide special purpose displays. The software is easily extended via the Tcl/Tk scripting language which can be used, for example, to manipulate annotations and build graphical tools for database creation. This paper discusses the design of the Emu system, giving a detailed description of the annotation structures that it supports. It is argued that these structures are sufficiently general to allow Emu to read potentially any time-aligned linguistic annotation.

LanguageEnglish
Pages61-77
Number of pages17
JournalSpeech Communication
Volume33
Issue number1-2
DOIs
Publication statusPublished - Jan 2001

Fingerprint

Labeling
Annotation
Labels
Speech analysis
management
Linguistics
Raw materials
Acoustics
Display devices
spoken language
language
raw materials
phonetics
acoustics
popularity
linguistics
Software Tools
Display
Speech
Software

Cite this

Cassidy, Steve ; Harrington, Jonathan. / Multi-level annotation in the Emu speech database management system. In: Speech Communication. 2001 ; Vol. 33, No. 1-2. pp. 61-77.
@article{ee5429b15a0a44a7a231f89e2213b9ac,
title = "Multi-level annotation in the Emu speech database management system",
abstract = "Researchers in various fields, from acoustic phonetics to child language development, rely on digitized collections of spoken language data as raw material for research. Access to this data had, in the past, been provided in an ad-hoc manner with labelling standards and software tools developed to serve only one or two projects. A few attempts have been made at providing generalized access to speech corpora but none of these have gained widespread popularity. The Emu system, described here, is a general purpose speech database management system which supports complex multi-level annotations. Emu can read a number of popular label and data file formats and supports overlaying additional annotation with inter-token relations on existing time-aligned label files. Emu provides a graphical labelling tool which can be extended to provide special purpose displays. The software is easily extended via the Tcl/Tk scripting language which can be used, for example, to manipulate annotations and build graphical tools for database creation. This paper discusses the design of the Emu system, giving a detailed description of the annotation structures that it supports. It is argued that these structures are sufficiently general to allow Emu to read potentially any time-aligned linguistic annotation.",
author = "Steve Cassidy and Jonathan Harrington",
year = "2001",
month = "1",
doi = "10.1016/S0167-6393(00)00069-8",
language = "English",
volume = "33",
pages = "61--77",
journal = "Speech Communication",
issn = "0167-6393",
publisher = "Elsevier",
number = "1-2",

}

Multi-level annotation in the Emu speech database management system. / Cassidy, Steve; Harrington, Jonathan.

In: Speech Communication, Vol. 33, No. 1-2, 01.2001, p. 61-77.

Research output: Contribution to journalArticleResearchpeer-review

TY - JOUR

T1 - Multi-level annotation in the Emu speech database management system

AU - Cassidy,Steve

AU - Harrington,Jonathan

PY - 2001/1

Y1 - 2001/1

N2 - Researchers in various fields, from acoustic phonetics to child language development, rely on digitized collections of spoken language data as raw material for research. Access to this data had, in the past, been provided in an ad-hoc manner with labelling standards and software tools developed to serve only one or two projects. A few attempts have been made at providing generalized access to speech corpora but none of these have gained widespread popularity. The Emu system, described here, is a general purpose speech database management system which supports complex multi-level annotations. Emu can read a number of popular label and data file formats and supports overlaying additional annotation with inter-token relations on existing time-aligned label files. Emu provides a graphical labelling tool which can be extended to provide special purpose displays. The software is easily extended via the Tcl/Tk scripting language which can be used, for example, to manipulate annotations and build graphical tools for database creation. This paper discusses the design of the Emu system, giving a detailed description of the annotation structures that it supports. It is argued that these structures are sufficiently general to allow Emu to read potentially any time-aligned linguistic annotation.

AB - Researchers in various fields, from acoustic phonetics to child language development, rely on digitized collections of spoken language data as raw material for research. Access to this data had, in the past, been provided in an ad-hoc manner with labelling standards and software tools developed to serve only one or two projects. A few attempts have been made at providing generalized access to speech corpora but none of these have gained widespread popularity. The Emu system, described here, is a general purpose speech database management system which supports complex multi-level annotations. Emu can read a number of popular label and data file formats and supports overlaying additional annotation with inter-token relations on existing time-aligned label files. Emu provides a graphical labelling tool which can be extended to provide special purpose displays. The software is easily extended via the Tcl/Tk scripting language which can be used, for example, to manipulate annotations and build graphical tools for database creation. This paper discusses the design of the Emu system, giving a detailed description of the annotation structures that it supports. It is argued that these structures are sufficiently general to allow Emu to read potentially any time-aligned linguistic annotation.

UR - http://www.scopus.com/inward/record.url?scp=0035156352&partnerID=8YFLogxK

U2 - 10.1016/S0167-6393(00)00069-8

DO - 10.1016/S0167-6393(00)00069-8

M3 - Article

VL - 33

SP - 61

EP - 77

JO - Speech Communication

T2 - Speech Communication

JF - Speech Communication

SN - 0167-6393

IS - 1-2

ER -