SOTAVerified

Automated Essay Scoring

Essay scoring: Automated Essay Scoring is the task of assigning a score to an essay, usually in the context of assessing the language ability of a language learner. The quality of an essay is affected by the following four primary dimensions: topic relevance, organization and coherence, word usage and sentence complexity, and grammar and mechanics.

Source: A Joint Model for Multimodal Document Quality Assessment

Papers

Showing 76100 of 104 papers

TitleStatusHype
Using Active Learning Methods to Strategically Select Essays for Automated Scoring0
Analytic Automated Essay Scoring Based on Deep Neural Networks Integrating Multidimensional Item Response Theory0
Word Embedding for Response-To-Text Assessment of Evidence0
Are Large Language Models Good Essay Graders?0
ARWI: Arabic Write and Improve0
Automated Essay Scoring Based on Finite State Transducer: towards ASR Transcription of Oral English Speech0
Automated Essay Scoring by Maximizing Human-Machine Agreement0
Automated Essay Scoring for Swedish0
Automated essay scoring in Arabic: a dataset and analysis of a BERT-based system0
Automated Essay Scoring in Argumentative Writing: DeBERTeachingAssistant0
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems0
Automated Essay Scoring in the Presence of Biased Ratings0
Automated Essay Scoring System for Nonnative Japanese Learners0
Automated essay scoring using efficient transformer-based language models0
Automated Essay Scoring Using Grammatical Variety and Errors with Multi-Task Learning and Item Response Theory0
Data Augmentation for Automated Essay Scoring using Transformer Models0
Automated Essay Scoring with Discourse-Aware Neural Models0
Automated essay scoring with string kernels and word embeddings0
Automated Topical Component Extraction Using Neural Network Attention Scores from Source-based Essay Scoring0
Automatic Essay Multi-dimensional Scoring with Fine-tuning and Multiple Regression0
Automatic Features for Essay Scoring -- An Empirical Study0
Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards0
CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring0
Can ChatGPT and Bard Generate Aligned Assessment Items? A Reliability Analysis against Human Performance0
Can GPT-4 do L2 analytic assessment?0
Show:102550
← PrevPage 4 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Neural Pairwise Contrastive Regression (NPCR)Quadratic Weighted Kappa0.82Unverified
2Tran-BERT-MS-ML-RQuadratic Weighted Kappa0.79Unverified
3Considering-Content-XLNetQuadratic Weighted Kappa0.79Unverified
4HISK+BOSWEQuadratic Weighted Kappa0.79Unverified
5SkipFlowQuadratic Weighted Kappa0.76Unverified
6MHMLWQuadratic Weighted Kappa0.76Unverified
7AFQuadratic Weighted Kappa0.73Unverified
8FDAQuadratic Weighted Kappa0.71Unverified