eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Construct Validity in Human Scoring and Criterion: What Criterion would/would not Measure

Resource Type: Article
Authors: 구정연
Source: 외국어교육연구, 34(3), pp.315-350 Aug, 2020
Subject: 교육학
Language: English
ISSN: 2733-5771
1225-4975

Online Access

초록

The current investigation is a study which aims to examine the reliability of automated essay scoring(AES) and investigate the validity of writing (sub)constructs that Criterion would/would not measure. Criterion evaluated test-takers’ essays written for assessed iBT TOEFL independent writing tasks by comparing human raters’ evaluation. In particular, the current study explored which essay features were most closely related to each of the six different analytic dimensions of Criterion. Five types of prompts were employed to create a writing test administered to fifty college students in Seoul. The result showed that the agreement between human-rater and Criterion was moderate. In addition, three essay features(development, organization, and grammar/usage) were crucial factors to predict the holistic score in human rating while organization also was the most powerful predictor of AES overall scores. Also, mechanics and sentence variety/constructions were the second and the third strong factors to predict writing scores in AES. This factor discrepancy in measuring writing scores might reflect that a few sentence constructions were differently evaluated between Criterion and human raters. This result suggests that the feature dimensions in Criterion need refinement in its construct dimensions. The findings have some implications in teaching process writing to students and in using AES.

공지

DAU Library

eArticles

요약정보

Construct Validity in Human Scoring and Criterion: What Criterion would/would not Measure

Online Access

초록