Annotating a corpus of spoken English: The Engineering Lecture Corpus (ELC)

Research output: Chapter in Book/Report/Conference proceedingConference proceeding

Abstract

This paper describes an approach to what we are calling the ‘pragmatic’ annotation of the Engineering Lecture Corpus (ELC). The ELC contains 70 English-medium engineering lectures from across the world, currently including Malaysia, New Zealand, the United Kingdom and Italy (www.coventry.ac.uk/elc). The lectures are in the form of videos, raw text transcripts and XML files encoded using traditional TEI methods, but also marked for a limited number of features which shed light on the specific nature of lecture discourse. These functions will be discussed in terms of: how the current working list was reached, markup and annotation processes, and possible uses of the complete corpus
Original languageEnglish
Title of host publicationProceedings of the VIIth GSCP International Conference
Subtitle of host publicationSpeech and Corpora
EditorsHeliana Mello, Massimo Pettorino, Tommaso Raso
Place of PublicationFlorence
PublisherFirenze University Press
Pages58-62
Number of pages5
ISBN (Electronic)9788866553519
Publication statusPublished - 29 Feb 2012

Fingerprint

XML

Cite this

Alsop, S., & Nesi, H. (2012). Annotating a corpus of spoken English: The Engineering Lecture Corpus (ELC). In H. Mello, M. Pettorino, & T. Raso (Eds.), Proceedings of the VIIth GSCP International Conference: Speech and Corpora (pp. 58-62). Florence: Firenze University Press.

Annotating a corpus of spoken English : The Engineering Lecture Corpus (ELC). / Alsop, Sian; Nesi, Hilary.

Proceedings of the VIIth GSCP International Conference: Speech and Corpora. ed. / Heliana Mello; Massimo Pettorino; Tommaso Raso. Florence : Firenze University Press, 2012. p. 58-62.

Research output: Chapter in Book/Report/Conference proceedingConference proceeding

Alsop, S & Nesi, H 2012, Annotating a corpus of spoken English: The Engineering Lecture Corpus (ELC). in H Mello, M Pettorino & T Raso (eds), Proceedings of the VIIth GSCP International Conference: Speech and Corpora. Firenze University Press, Florence, pp. 58-62.
Alsop S, Nesi H. Annotating a corpus of spoken English: The Engineering Lecture Corpus (ELC). In Mello H, Pettorino M, Raso T, editors, Proceedings of the VIIth GSCP International Conference: Speech and Corpora. Florence: Firenze University Press. 2012. p. 58-62
Alsop, Sian ; Nesi, Hilary. / Annotating a corpus of spoken English : The Engineering Lecture Corpus (ELC). Proceedings of the VIIth GSCP International Conference: Speech and Corpora. editor / Heliana Mello ; Massimo Pettorino ; Tommaso Raso. Florence : Firenze University Press, 2012. pp. 58-62
@inproceedings{5beba002d49241de8fc4f118735188e6,
title = "Annotating a corpus of spoken English: The Engineering Lecture Corpus (ELC)",
abstract = "This paper describes an approach to what we are calling the ‘pragmatic’ annotation of the Engineering Lecture Corpus (ELC). The ELC contains 70 English-medium engineering lectures from across the world, currently including Malaysia, New Zealand, the United Kingdom and Italy (www.coventry.ac.uk/elc). The lectures are in the form of videos, raw text transcripts and XML files encoded using traditional TEI methods, but also marked for a limited number of features which shed light on the specific nature of lecture discourse. These functions will be discussed in terms of: how the current working list was reached, markup and annotation processes, and possible uses of the complete corpus",
author = "Sian Alsop and Hilary Nesi",
year = "2012",
month = "2",
day = "29",
language = "English",
pages = "58--62",
editor = "Mello, {Heliana } and Pettorino, {Massimo } and Raso, {Tommaso }",
booktitle = "Proceedings of the VIIth GSCP International Conference",
publisher = "Firenze University Press",

}

TY - GEN

T1 - Annotating a corpus of spoken English

T2 - The Engineering Lecture Corpus (ELC)

AU - Alsop, Sian

AU - Nesi, Hilary

PY - 2012/2/29

Y1 - 2012/2/29

N2 - This paper describes an approach to what we are calling the ‘pragmatic’ annotation of the Engineering Lecture Corpus (ELC). The ELC contains 70 English-medium engineering lectures from across the world, currently including Malaysia, New Zealand, the United Kingdom and Italy (www.coventry.ac.uk/elc). The lectures are in the form of videos, raw text transcripts and XML files encoded using traditional TEI methods, but also marked for a limited number of features which shed light on the specific nature of lecture discourse. These functions will be discussed in terms of: how the current working list was reached, markup and annotation processes, and possible uses of the complete corpus

AB - This paper describes an approach to what we are calling the ‘pragmatic’ annotation of the Engineering Lecture Corpus (ELC). The ELC contains 70 English-medium engineering lectures from across the world, currently including Malaysia, New Zealand, the United Kingdom and Italy (www.coventry.ac.uk/elc). The lectures are in the form of videos, raw text transcripts and XML files encoded using traditional TEI methods, but also marked for a limited number of features which shed light on the specific nature of lecture discourse. These functions will be discussed in terms of: how the current working list was reached, markup and annotation processes, and possible uses of the complete corpus

UR - https://www.torrossa.com/pages/ipplatform/itemDetails.faces

M3 - Conference proceeding

SP - 58

EP - 62

BT - Proceedings of the VIIth GSCP International Conference

A2 - Mello, Heliana

A2 - Pettorino, Massimo

A2 - Raso, Tommaso

PB - Firenze University Press

CY - Florence

ER -