SOTAVerified

Automated Essay Scoring

Essay scoring: Automated Essay Scoring is the task of assigning a score to an essay, usually in the context of assessing the language ability of a language learner. The quality of an essay is affected by the following four primary dimensions: topic relevance, organization and coherence, word usage and sentence complexity, and grammar and mechanics.

Source: A Joint Model for Multimodal Document Quality Assessment

Papers

Showing 150 of 104 papers

TitleStatusHype
Many Hands Make Light Work: Using Essay Traits to Automatically Score EssaysCode1
Automated Essay Scoring via Pairwise Contrastive RegressionCode1
Automated Essay Scoring Using Transformer ModelsCode1
Human-AI Collaborative Essay Scoring: A Dual-Process Framework with LLMsCode1
EXPATS: A Toolkit for Explainable Automated Text ScoringCode1
On the Use of BERT for Automated Essay Scoring: Joint Learning of Multi-Scale Essay RepresentationCode1
A Prompt-independent and Interpretable Automated Essay Scoring Method for Chinese Second Language WritingCode1
Prompt- and Trait Relation-aware Cross-prompt Essay Trait ScoringCode1
Evaluation Toolkit For Robustness Testing Of Automatic Essay Scoring SystemsCode1
Automated Essay Scoring based on Two-Stage LearningCode1
Countering the Influence of Essay Length in Neural Essay ScoringCode1
CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring0
Automated Essay Scoring System for Nonnative Japanese Learners0
Automated essay scoring using efficient transformer-based language models0
Automated Essay Scoring Based on Finite State Transducer: towards ASR Transcription of Oral English Speech0
Enhancing Marker Scoring Accuracy through Ordinal Confidence Modelling in Educational Assessments0
Automatic Features for Essay Scoring -- An Empirical Study0
Exploring Automated Essay Scoring for Nonnative English Speakers0
Automated Topical Component Extraction Using Neural Network Attention Scores from Source-based Essay Scoring0
Automated Essay Scoring in Argumentative Writing: DeBERTeachingAssistant0
Automated essay scoring with string kernels and word embeddings0
Automatic Essay Multi-dimensional Scoring with Fine-tuning and Multiple Regression0
Equity Beyond Bias in Language Technologies for Education0
Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards0
Automated Essay Scoring in the Presence of Biased Ratings0
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems0
Automated Essay Scoring with Discourse-Aware Neural Models0
Automated essay scoring in Arabic: a dataset and analysis of a BERT-based system0
Can ChatGPT and Bard Generate Aligned Assessment Items? A Reliability Analysis against Human Performance0
Can GPT-4 do L2 analytic assessment?0
ARWI: Arabic Write and Improve0
Corruption Is Not All Bad: Incorporating Discourse Structure into Pre-training via Corruption for Essay Scoring0
Constrained Multi-Task Learning for Automated Essay Scoring0
Computing with Subjectivity Lexicons0
Data Augmentation for Automated Essay Scoring using Transformer Models0
Automated Essay Scoring for Swedish0
Composable Cross-prompt Essay Scoring by Merging Models0
Are Large Language Models Good Essay Graders?0
Does the Prompt-based Large Language Model Recognize Students' Demographics and Introduce Bias in Essay Scoring?0
Do We Need a Detailed Rubric for Automated Essay Scoring using Large Language Models?0
DREsS: Dataset for Rubric-based Essay Scoring on EFL Writing0
Empirical Study of Large Language Models as Automated Essay Scoring Tools in English Composition__Taking TOEFL Independent Writing Task for Example0
Encoder Decoder Approach to Automated Essay Scoring For Deeper Semantic Analysis0
Enhancing Arabic Automated Essay Scoring with Synthetic Data and Error Injection0
Enhancing Automated Essay Scoring Performance via Fine-tuning Pre-trained Language Models with Combination of Regression and Ranking0
Enhancing Essay Scoring with Adversarial Weights Perturbation and Metric-specific AttentionPooling0
Centering-based Neural Coherence Modeling with Hierarchical Discourse Segments0
Automated Essay Scoring Using Grammatical Variety and Errors with Multi-Task Learning and Item Response Theory0
EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models0
Automated Essay Scoring by Maximizing Human-Machine Agreement0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Neural Pairwise Contrastive Regression (NPCR)Quadratic Weighted Kappa0.82Unverified
2Tran-BERT-MS-ML-RQuadratic Weighted Kappa0.79Unverified
3Considering-Content-XLNetQuadratic Weighted Kappa0.79Unverified
4HISK+BOSWEQuadratic Weighted Kappa0.79Unverified
5SkipFlowQuadratic Weighted Kappa0.76Unverified
6MHMLWQuadratic Weighted Kappa0.76Unverified
7AFQuadratic Weighted Kappa0.73Unverified
8FDAQuadratic Weighted Kappa0.71Unverified