english corpus


Our  spoken corpora contain audio data with phonological annotations that focus on three areas of segmental features (vowels, consonants and syllable structures) and four areas of suprasegmental features (lexical stress, pause, linking and intonation).

The corpus has the following characteristics:

1. It provides high-quality recordings that are ideally suited for phonetic and acoustic analysis by researchers around the world.

2. It produces recordings and phonological annotations that are easily accessible and immediately available to all learners, teachers and researchers, both in and outside  EdUHK.

3. It provides a platform for learners to access and rate the corpus data in order to discover the linguistic features on their own and to enhance their active engagement in their own learning.

4. It describes the distinctive linguistic features of English production from Hong Kong and Mainland university students.