Statistical modeling of student performance to improve Chinese dictation skills with an intelligent tutor

John Kowalski; Yanhui Zhang; Geoffrey Gordon

doi:10.5281/zenodo.3554679

Statistical modeling of student performance to improve Chinese dictation skills with an intelligent tutor

John Kowalski, Yanhui Zhang, Geoffrey Gordon

Research output: Journal Publication › Article › peer-review

Abstract

The Pinyin Tutor has been used the past few years at over thirty institutions around the world to teach students to transcribe spoken Chinese phrases into Pinyin. Large amounts of data have been collected from this program on the types of errors students make on this task. We analyze these data to discover what makes this task difficult and use our findings to iteratively improve the tutor. For instance, is a particular set of consonants, vowels, or tones causing the most difficulty? Or perhaps do certain challenges arise in the context in which these sounds are spoken? Since each Pinyin phrase can be broken down into a set of features (for example, consonants, vowel sounds, and tones), we apply machine learning techniques to uncover the most confounding aspects of this task. We then exploit what we learned to construct and maintain an accurate representation of what the student knows for best individual instruction. Our goal is to allow the learner to focus on the aspects of the task on which he or she is having most difficulty, thereby accelerating his or her understanding of spoken Chinese beyond what would be possible without such focused “intelligent” instruction.

Original language	English
Pages (from-to)	3-27
Number of pages	25
Journal	Journal of Educational Data Mining
Volume	6
Issue number	1
DOIs	https://doi.org/10.5281/zenodo.3554679
Publication status	Published - 2014
Externally published	Yes

Keywords

Pinyin Tutor
least angle regression (LARS)
LIBLINEAR-trained model
understanding of spoken Chinese
knowledge tracing
Hidden Markov Model

Access to Document

10.5281/zenodo.3554679

https://doi.org/10.5281/zenodo.3554679

Cite this

@article{a482e452060a405bbd785ed34b626d4a,

title = "Statistical modeling of student performance to improve Chinese dictation skills with an intelligent tutor",

abstract = "The Pinyin Tutor has been used the past few years at over thirty institutions around the world to teach students to transcribe spoken Chinese phrases into Pinyin. Large amounts of data have been collected from this program on the types of errors students make on this task. We analyze these data to discover what makes this task difficult and use our findings to iteratively improve the tutor. For instance, is a particular set of consonants, vowels, or tones causing the most difficulty? Or perhaps do certain challenges arise in the context in which these sounds are spoken? Since each Pinyin phrase can be broken down into a set of features (for example, consonants, vowel sounds, and tones), we apply machine learning techniques to uncover the most confounding aspects of this task. We then exploit what we learned to construct and maintain an accurate representation of what the student knows for best individual instruction. Our goal is to allow the learner to focus on the aspects of the task on which he or she is having most difficulty, thereby accelerating his or her understanding of spoken Chinese beyond what would be possible without such focused “intelligent” instruction.",

keywords = "Pinyin Tutor, least angle regression (LARS), LIBLINEAR-trained model, understanding of spoken Chinese, knowledge tracing, Hidden Markov Model",

author = "John Kowalski and Yanhui Zhang and Geoffrey Gordon",

year = "2014",

doi = "10.5281/zenodo.3554679",

language = "English",

volume = "6",

pages = "3--27",

journal = "Journal of Educational Data Mining",

issn = "2157-2100",

publisher = "International Educational Data Mining Society",

number = "1",

}

TY - JOUR

T1 - Statistical modeling of student performance to improve Chinese dictation skills with an intelligent tutor

AU - Kowalski, John

AU - Zhang, Yanhui

AU - Gordon, Geoffrey

PY - 2014

Y1 - 2014

N2 - The Pinyin Tutor has been used the past few years at over thirty institutions around the world to teach students to transcribe spoken Chinese phrases into Pinyin. Large amounts of data have been collected from this program on the types of errors students make on this task. We analyze these data to discover what makes this task difficult and use our findings to iteratively improve the tutor. For instance, is a particular set of consonants, vowels, or tones causing the most difficulty? Or perhaps do certain challenges arise in the context in which these sounds are spoken? Since each Pinyin phrase can be broken down into a set of features (for example, consonants, vowel sounds, and tones), we apply machine learning techniques to uncover the most confounding aspects of this task. We then exploit what we learned to construct and maintain an accurate representation of what the student knows for best individual instruction. Our goal is to allow the learner to focus on the aspects of the task on which he or she is having most difficulty, thereby accelerating his or her understanding of spoken Chinese beyond what would be possible without such focused “intelligent” instruction.

AB - The Pinyin Tutor has been used the past few years at over thirty institutions around the world to teach students to transcribe spoken Chinese phrases into Pinyin. Large amounts of data have been collected from this program on the types of errors students make on this task. We analyze these data to discover what makes this task difficult and use our findings to iteratively improve the tutor. For instance, is a particular set of consonants, vowels, or tones causing the most difficulty? Or perhaps do certain challenges arise in the context in which these sounds are spoken? Since each Pinyin phrase can be broken down into a set of features (for example, consonants, vowel sounds, and tones), we apply machine learning techniques to uncover the most confounding aspects of this task. We then exploit what we learned to construct and maintain an accurate representation of what the student knows for best individual instruction. Our goal is to allow the learner to focus on the aspects of the task on which he or she is having most difficulty, thereby accelerating his or her understanding of spoken Chinese beyond what would be possible without such focused “intelligent” instruction.

KW - Pinyin Tutor

KW - least angle regression (LARS)

KW - LIBLINEAR-trained model

KW - understanding of spoken Chinese

KW - knowledge tracing

KW - Hidden Markov Model

U2 - 10.5281/zenodo.3554679

DO - 10.5281/zenodo.3554679

M3 - Article

SN - 2157-2100

VL - 6

SP - 3

EP - 27

JO - Journal of Educational Data Mining

JF - Journal of Educational Data Mining

IS - 1

ER -

Statistical modeling of student performance to improve Chinese dictation skills with an intelligent tutor

Abstract

Keywords

Access to Document

Fingerprint

Cite this