Assessing spoken lexical and lexicogrammatical proficiency using features of word, bigram, and dependency bigram use

Kristopher Kyle*, Masaki Eguchi

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)

Abstract

The measurement of second language (L2) productive lexical proficiency has driven a great deal of research over the past two decades. Research has indicated that more proficient speakers and writers tend to use a wider range of words and that more proficient writers tend to use words that are more sophisticated (less frequent in reference corpora). Research over the past 15 years has also demonstrated that the way words are used in context (i.e., collocation use) is also an important indicator of both written and spoken proficiency. In this study, we extend recent research that has modeled writing proficiency using collocation indices based on grammatical dependencies (e.g., verb–direct object) to spoken contexts. In particular, we model speaking proficiency scores from a large corpus of oral proficiency interview responses using a range of well-known indices of productive proficiency and newly developed grammatical dependency indices. The results indicated that all index types demonstrated small to moderate correlations with speaking proficiency individually but explained a large proportion of the variance when used in a multivariate model that included dependency collocation indices.

Original languageEnglish
Pages (from-to)531-564
Number of pages34
JournalModern Language Journal
Volume107
Issue number2
DOIs
Publication statusPublished - 2023 Jun 1
Externally publishedYes

Keywords

  • collocations
  • corpus linguistics
  • learner corpus research
  • lexicogrammatical use
  • natural language processing
  • oral proficiency

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Assessing spoken lexical and lexicogrammatical proficiency using features of word, bigram, and dependency bigram use'. Together they form a unique fingerprint.

Cite this