The EMI campus as site and source for a multimodal corpus: Issues and challenges of corpus construction at a Sino-British university

Michael P. Stevens, Yu-Hua Chen, Simon Harrison

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

This chapter reports on the motivations, design, and challenges of building the Corpus of Chinese Academic Written and Spoken English (CAWSE), an open-access corpus of Chinese students’ English samples collected from a Sino-British university in China. To date, the corpus comprises over 1.5 million words in the subcorpus of written assessment, a subcorpus of 63.8 hours of spoken assessment, and a multimodal subcorpus totaling 24 hours of audio/video classroom data, including 107 tasks already or in process of being transcribed and annotated. Focusing on the design and construction of the multimodal subcorpus comprised of video recordings of student interaction in classroom settings, we report specif-ically on the challenges that arise from the institutional and material realities of working with gatekeepers, handling data ethically, and selecting and processing data that is representative. We also explain the motivations behind focusing on student language, which we characterize as an emerging variety of L2 academic English as a Lingua Franca (ELF). We hope to provide insight into the com-plexities of developing a multimodal corpus from the unique setting of a foreign-based EMI campus.
Original languageEnglish
Title of host publicationVariation in Time and Space: Observing the World through Corpora
EditorsAnna Čermáková, Markéta Malá
PublisherDe Gruyter
ISBN (Print)978-3-11-060471-9
Publication statusAccepted/In press - 2020

Publication series

NameDiskursmuster - Discourse Patterns

Fingerprint

European Monetary Institute
university
classroom
gatekeeper
video recording
student
open access
video
China
interaction
language

Cite this

Stevens, M. P., Chen, Y-H., & Harrison, S. (Accepted/In press). The EMI campus as site and source for a multimodal corpus: Issues and challenges of corpus construction at a Sino-British university. In A. Čermáková, & M. Malá (Eds.), Variation in Time and Space: Observing the World through Corpora (Diskursmuster - Discourse Patterns). De Gruyter.

The EMI campus as site and source for a multimodal corpus : Issues and challenges of corpus construction at a Sino-British university. / Stevens, Michael P. ; Chen, Yu-Hua; Harrison, Simon.

Variation in Time and Space: Observing the World through Corpora. ed. / Anna Čermáková; Markéta Malá. De Gruyter, 2020. (Diskursmuster - Discourse Patterns).

Research output: Chapter in Book/Report/Conference proceedingChapter

Stevens, MP, Chen, Y-H & Harrison, S 2020, The EMI campus as site and source for a multimodal corpus: Issues and challenges of corpus construction at a Sino-British university. in A Čermáková & M Malá (eds), Variation in Time and Space: Observing the World through Corpora. Diskursmuster - Discourse Patterns, De Gruyter.
Stevens MP, Chen Y-H, Harrison S. The EMI campus as site and source for a multimodal corpus: Issues and challenges of corpus construction at a Sino-British university. In Čermáková A, Malá M, editors, Variation in Time and Space: Observing the World through Corpora. De Gruyter. 2020. (Diskursmuster - Discourse Patterns).
Stevens, Michael P. ; Chen, Yu-Hua ; Harrison, Simon. / The EMI campus as site and source for a multimodal corpus : Issues and challenges of corpus construction at a Sino-British university. Variation in Time and Space: Observing the World through Corpora. editor / Anna Čermáková ; Markéta Malá. De Gruyter, 2020. (Diskursmuster - Discourse Patterns).
@inbook{0bc99e08b88943848cae60c0e1c2727d,
title = "The EMI campus as site and source for a multimodal corpus: Issues and challenges of corpus construction at a Sino-British university",
abstract = "This chapter reports on the motivations, design, and challenges of building the Corpus of Chinese Academic Written and Spoken English (CAWSE), an open-access corpus of Chinese students’ English samples collected from a Sino-British university in China. To date, the corpus comprises over 1.5 million words in the subcorpus of written assessment, a subcorpus of 63.8 hours of spoken assessment, and a multimodal subcorpus totaling 24 hours of audio/video classroom data, including 107 tasks already or in process of being transcribed and annotated. Focusing on the design and construction of the multimodal subcorpus comprised of video recordings of student interaction in classroom settings, we report specif-ically on the challenges that arise from the institutional and material realities of working with gatekeepers, handling data ethically, and selecting and processing data that is representative. We also explain the motivations behind focusing on student language, which we characterize as an emerging variety of L2 academic English as a Lingua Franca (ELF). We hope to provide insight into the com-plexities of developing a multimodal corpus from the unique setting of a foreign-based EMI campus.",
author = "Stevens, {Michael P.} and Yu-Hua Chen and Simon Harrison",
year = "2020",
language = "English",
isbn = "978-3-11-060471-9",
series = "Diskursmuster - Discourse Patterns",
publisher = "De Gruyter",
editor = "Anna Čerm{\'a}kov{\'a} and Mark{\'e}ta Mal{\'a}",
booktitle = "Variation in Time and Space: Observing the World through Corpora",

}

TY - CHAP

T1 - The EMI campus as site and source for a multimodal corpus

T2 - Issues and challenges of corpus construction at a Sino-British university

AU - Stevens, Michael P.

AU - Chen, Yu-Hua

AU - Harrison, Simon

PY - 2020

Y1 - 2020

N2 - This chapter reports on the motivations, design, and challenges of building the Corpus of Chinese Academic Written and Spoken English (CAWSE), an open-access corpus of Chinese students’ English samples collected from a Sino-British university in China. To date, the corpus comprises over 1.5 million words in the subcorpus of written assessment, a subcorpus of 63.8 hours of spoken assessment, and a multimodal subcorpus totaling 24 hours of audio/video classroom data, including 107 tasks already or in process of being transcribed and annotated. Focusing on the design and construction of the multimodal subcorpus comprised of video recordings of student interaction in classroom settings, we report specif-ically on the challenges that arise from the institutional and material realities of working with gatekeepers, handling data ethically, and selecting and processing data that is representative. We also explain the motivations behind focusing on student language, which we characterize as an emerging variety of L2 academic English as a Lingua Franca (ELF). We hope to provide insight into the com-plexities of developing a multimodal corpus from the unique setting of a foreign-based EMI campus.

AB - This chapter reports on the motivations, design, and challenges of building the Corpus of Chinese Academic Written and Spoken English (CAWSE), an open-access corpus of Chinese students’ English samples collected from a Sino-British university in China. To date, the corpus comprises over 1.5 million words in the subcorpus of written assessment, a subcorpus of 63.8 hours of spoken assessment, and a multimodal subcorpus totaling 24 hours of audio/video classroom data, including 107 tasks already or in process of being transcribed and annotated. Focusing on the design and construction of the multimodal subcorpus comprised of video recordings of student interaction in classroom settings, we report specif-ically on the challenges that arise from the institutional and material realities of working with gatekeepers, handling data ethically, and selecting and processing data that is representative. We also explain the motivations behind focusing on student language, which we characterize as an emerging variety of L2 academic English as a Lingua Franca (ELF). We hope to provide insight into the com-plexities of developing a multimodal corpus from the unique setting of a foreign-based EMI campus.

M3 - Chapter

SN - 978-3-11-060471-9

T3 - Diskursmuster - Discourse Patterns

BT - Variation in Time and Space: Observing the World through Corpora

A2 - Čermáková, Anna

A2 - Malá, Markéta

PB - De Gruyter

ER -