Querying databases of annotated speech

S. Cassidy, S. Bird

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contribution

8 Citations (Scopus)
6 Downloads (Pure)


Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic 'transcriptions'. Such databases are typically multidimensional, heterogeneous and dynamic. These properties present a number of tough challenges for representation and query. The temporal nature of the data adds an additional layer of complexity. This paper presents and harmonises two independent efforts to model annotated speech databases, one at Macquarie University, and one at the University of Pennsylvania. Various query languages are described along with illustrative applications to a variety of analytical problems. The research reported here forms a part of several ongoing projects to develop platform-independent open-source tools for creating, browsing, searching, querying and transforming linguistic databases, and to disseminate large linguistic databases over the Internet.

Original languageEnglish
Title of host publicationProceedings - 11th Australasian Database Conference, ADC 2000
EditorsMaria E. Orlowska
Place of PublicationCanberra
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Number of pages9
ISBN (Electronic)9780769505282
ISBN (Print)0769505287
Publication statusPublished - 2000
Event11th Australasian Database Conference, ADC 2000 - Canberra, Australia
Duration: 31 Jan 20003 Feb 2000


Other11th Australasian Database Conference, ADC 2000

Bibliographical note

Copyright 2000 IEEE. Reprinted from Proceedings : 11th Australasian Database Conference, ADC 2000, 31 January-3 February 2000, Canberra, Australia. This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of Macquarie University’s products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

Fingerprint Dive into the research topics of 'Querying databases of annotated speech'. Together they form a unique fingerprint.

Cite this