Querying databases of annotated speech

S. Cassidy, S. Bird

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionResearchpeer-review

Abstract

Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic 'transcriptions'. Such databases are typically multidimensional, heterogeneous and dynamic. These properties present a number of tough challenges for representation and query. The temporal nature of the data adds an additional layer of complexity. This paper presents and harmonises two independent efforts to model annotated speech databases, one at Macquarie University, and one at the University of Pennsylvania. Various query languages are described along with illustrative applications to a variety of analytical problems. The research reported here forms a part of several ongoing projects to develop platform-independent open-source tools for creating, browsing, searching, querying and transforming linguistic databases, and to disseminate large linguistic databases over the Internet.

LanguageEnglish
Title of host publicationProceedings - 11th Australasian Database Conference, ADC 2000
EditorsMaria E. Orlowska
Place of PublicationCanberra
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages12-20
Number of pages9
ISBN (Electronic)9780769505282
ISBN (Print)0769505287
DOIs
Publication statusPublished - 2000
Event11th Australasian Database Conference, ADC 2000 - Canberra, Australia
Duration: 31 Jan 20003 Feb 2000

Other

Other11th Australasian Database Conference, ADC 2000
CountryAustralia
CityCanberra
Period31/01/003/02/00

Fingerprint

Linguistics
Query languages
Transcription
Internet

Bibliographical note

Copyright 2000 IEEE. Reprinted from Proceedings : 11th Australasian Database Conference, ADC 2000, 31 January-3 February 2000, Canberra, Australia. This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of Macquarie University’s products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

Cite this

Cassidy, S., & Bird, S. (2000). Querying databases of annotated speech. In M. E. Orlowska (Ed.), Proceedings - 11th Australasian Database Conference, ADC 2000 (pp. 12-20). [819808] Canberra: Institute of Electrical and Electronics Engineers (IEEE). https://doi.org/10.1109/ADC.2000.819808
Cassidy, S. ; Bird, S. / Querying databases of annotated speech. Proceedings - 11th Australasian Database Conference, ADC 2000. editor / Maria E. Orlowska. Canberra : Institute of Electrical and Electronics Engineers (IEEE), 2000. pp. 12-20
@inproceedings{bf2e49ca38b54092a75ec45b53a1156e,
title = "Querying databases of annotated speech",
abstract = "Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic 'transcriptions'. Such databases are typically multidimensional, heterogeneous and dynamic. These properties present a number of tough challenges for representation and query. The temporal nature of the data adds an additional layer of complexity. This paper presents and harmonises two independent efforts to model annotated speech databases, one at Macquarie University, and one at the University of Pennsylvania. Various query languages are described along with illustrative applications to a variety of analytical problems. The research reported here forms a part of several ongoing projects to develop platform-independent open-source tools for creating, browsing, searching, querying and transforming linguistic databases, and to disseminate large linguistic databases over the Internet.",
author = "S. Cassidy and S. Bird",
note = "Copyright 2000 IEEE. Reprinted from Proceedings : 11th Australasian Database Conference, ADC 2000, 31 January-3 February 2000, Canberra, Australia. This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of Macquarie University{\^a}€™s products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.",
year = "2000",
doi = "10.1109/ADC.2000.819808",
language = "English",
isbn = "0769505287",
pages = "12--20",
editor = "Orlowska, {Maria E.}",
booktitle = "Proceedings - 11th Australasian Database Conference, ADC 2000",
publisher = "Institute of Electrical and Electronics Engineers (IEEE)",
address = "United States",

}

Cassidy, S & Bird, S 2000, Querying databases of annotated speech. in ME Orlowska (ed.), Proceedings - 11th Australasian Database Conference, ADC 2000., 819808, Institute of Electrical and Electronics Engineers (IEEE), Canberra, pp. 12-20, 11th Australasian Database Conference, ADC 2000, Canberra, Australia, 31/01/00. https://doi.org/10.1109/ADC.2000.819808

Querying databases of annotated speech. / Cassidy, S.; Bird, S.

Proceedings - 11th Australasian Database Conference, ADC 2000. ed. / Maria E. Orlowska. Canberra : Institute of Electrical and Electronics Engineers (IEEE), 2000. p. 12-20 819808.

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionResearchpeer-review

TY - GEN

T1 - Querying databases of annotated speech

AU - Cassidy,S.

AU - Bird,S.

N1 - Copyright 2000 IEEE. Reprinted from Proceedings : 11th Australasian Database Conference, ADC 2000, 31 January-3 February 2000, Canberra, Australia. This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of Macquarie University’s products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

PY - 2000

Y1 - 2000

N2 - Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic 'transcriptions'. Such databases are typically multidimensional, heterogeneous and dynamic. These properties present a number of tough challenges for representation and query. The temporal nature of the data adds an additional layer of complexity. This paper presents and harmonises two independent efforts to model annotated speech databases, one at Macquarie University, and one at the University of Pennsylvania. Various query languages are described along with illustrative applications to a variety of analytical problems. The research reported here forms a part of several ongoing projects to develop platform-independent open-source tools for creating, browsing, searching, querying and transforming linguistic databases, and to disseminate large linguistic databases over the Internet.

AB - Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic 'transcriptions'. Such databases are typically multidimensional, heterogeneous and dynamic. These properties present a number of tough challenges for representation and query. The temporal nature of the data adds an additional layer of complexity. This paper presents and harmonises two independent efforts to model annotated speech databases, one at Macquarie University, and one at the University of Pennsylvania. Various query languages are described along with illustrative applications to a variety of analytical problems. The research reported here forms a part of several ongoing projects to develop platform-independent open-source tools for creating, browsing, searching, querying and transforming linguistic databases, and to disseminate large linguistic databases over the Internet.

UR - http://www.scopus.com/inward/record.url?scp=84962808606&partnerID=8YFLogxK

U2 - 10.1109/ADC.2000.819808

DO - 10.1109/ADC.2000.819808

M3 - Conference proceeding contribution

SN - 0769505287

SP - 12

EP - 20

BT - Proceedings - 11th Australasian Database Conference, ADC 2000

PB - Institute of Electrical and Electronics Engineers (IEEE)

CY - Canberra

ER -

Cassidy S, Bird S. Querying databases of annotated speech. In Orlowska ME, editor, Proceedings - 11th Australasian Database Conference, ADC 2000. Canberra: Institute of Electrical and Electronics Engineers (IEEE). 2000. p. 12-20. 819808 https://doi.org/10.1109/ADC.2000.819808