Skip to main navigation Skip to search Skip to main content

Predicting CEFR levels in learners of English: The use of microsystem criterial features in a machine learning approach

  • Thomas Gaillat
  • , Andrew Simpkin
  • , Nicolas Ballier
  • , Bernardo Stearns
  • , Annanda Sousa
  • , Manon Bouyé
  • , Manel Zarrouk
  • Université Rennes 2
  • Université Paris Descartes-Sorbonne Paris Cité
  • University of Galway
  • University of Paris 13

Research output: Contribution to a Journal (Peer & Non Peer)Articlepeer-review

24 Citations (Scopus)

Abstract

This paper focuses on automatically assessing language proficiency levels according to linguistic complexity in learner English. We implement a supervised learning approach as part of an automatic essay scoring system. The objective is to uncover Common European Framework of Reference for Languages (CEFR) criterial features in writings by learners of English as a foreign language. Our method relies on the concept of microsystems with features related to learner-specific linguistic systems in which several forms operate paradigmatically. Results on internal data show that different microsystems help classify writings from A1 to C2 levels (82% balanced accuracy). Overall results on external data show that a combination of lexical, syntactic, cohesive and accuracy features yields the most efficient classification across several corpora (59.2% balanced accuracy).

Original languageEnglish
Pages (from-to)130-146
Number of pages17
JournalReCALL
Volume34
Issue number2
DOIs
Publication statusPublished - 10 May 2022

Keywords

  • automatic essay scoring
  • criterial features
  • language functions
  • linguistic complexity
  • microsystem
  • supervised learning

Fingerprint

Dive into the research topics of 'Predicting CEFR levels in learners of English: The use of microsystem criterial features in a machine learning approach'. Together they form a unique fingerprint.

Cite this