SOTAVerified

Automated Essay Scoring

Essay scoring: Automated Essay Scoring is the task of assigning a score to an essay, usually in the context of assessing the language ability of a language learner. The quality of an essay is affected by the following four primary dimensions: topic relevance, organization and coherence, word usage and sentence complexity, and grammar and mechanics.

Source: A Joint Model for Multimodal Document Quality Assessment

Papers

Showing 150 of 104 papers

TitleStatusHype
On the Use of BERT for Automated Essay Scoring: Joint Learning of Multi-Scale Essay RepresentationCode1
EXPATS: A Toolkit for Explainable Automated Text ScoringCode1
Prompt- and Trait Relation-aware Cross-prompt Essay Trait ScoringCode1
Automated Essay Scoring based on Two-Stage LearningCode1
Automated Essay Scoring Using Transformer ModelsCode1
Human-AI Collaborative Essay Scoring: A Dual-Process Framework with LLMsCode1
Automated Essay Scoring via Pairwise Contrastive RegressionCode1
A Prompt-independent and Interpretable Automated Essay Scoring Method for Chinese Second Language WritingCode1
Many Hands Make Light Work: Using Essay Traits to Automatically Score EssaysCode1
Countering the Influence of Essay Length in Neural Essay ScoringCode1
Evaluation Toolkit For Robustness Testing Of Automatic Essay Scoring SystemsCode1
H-AES: Towards Automated Essay Scoring for HindiCode0
Language models and Automated Essay ScoringCode0
Co-Attention Based Neural Network for Source-Dependent Essay ScoringCode0
SkipFlow: Incorporating Neural Coherence Features for End-to-End Automatic Text ScoringCode0
Prompt Agnostic Essay Scorer: A Domain Generalization Approach to Cross-prompt Automated Essay ScoringCode0
WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in WikipediaCode0
Learning to love diligent trolls: Accounting for rater effects in the dialogue safety taskCode0
Modeling Structural Similarities between Documents for Coherence Assessment with Graph Convolutional NetworksCode0
VerAs: Verify then Assess STEM Lab ReportsCode0
Exploring LLM Prompting Strategies for Joint Essay Scoring and Feedback GenerationCode0
Unveiling the Tapestry of Automated Essay Scoring: A Comprehensive Investigation of Accuracy, Fairness, and GeneralizabilityCode0
A Neural Approach to Automated Essay ScoringCode0
Can Large Language Models Automatically Score Proficiency of Written Essays?Code0
Autoregressive Score Generation for Multi-trait Essay ScoringCode0
Beyond Agreement: Diagnosing the Rationale Alignment of Automated Essay Scoring Methods based on Linguistically-informed CounterfactualsCode0
Neural Automated Essay Scoring and Coherence Modeling for Adversarially Crafted InputCode0
Can GPT-4 do L2 analytic assessment?0
Can ChatGPT and Bard Generate Aligned Assessment Items? A Reliability Analysis against Human Performance0
Automated essay scoring using efficient transformer-based language models0
CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring0
Automated Essay Scoring System for Nonnative Japanese Learners0
Automated Essay Scoring in the Presence of Biased Ratings0
Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards0
Enhancing Marker Scoring Accuracy through Ordinal Confidence Modelling in Educational Assessments0
Automatic Features for Essay Scoring -- An Empirical Study0
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems0
Automated Essay Scoring Based on Finite State Transducer: towards ASR Transcription of Oral English Speech0
Automatic Essay Multi-dimensional Scoring with Fine-tuning and Multiple Regression0
Encoder Decoder Approach to Automated Essay Scoring For Deeper Semantic Analysis0
Automated Topical Component Extraction Using Neural Network Attention Scores from Source-based Essay Scoring0
Automated Essay Scoring in Argumentative Writing: DeBERTeachingAssistant0
Empirical Study of Large Language Models as Automated Essay Scoring Tools in English Composition__Taking TOEFL Independent Writing Task for Example0
Enhancing Arabic Automated Essay Scoring with Synthetic Data and Error Injection0
Enhancing Automated Essay Scoring Performance via Fine-tuning Pre-trained Language Models with Combination of Regression and Ranking0
Enhancing Essay Scoring with Adversarial Weights Perturbation and Metric-specific AttentionPooling0
DREsS: Dataset for Rubric-based Essay Scoring on EFL Writing0
Equity Beyond Bias in Language Technologies for Education0
EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models0
Automated essay scoring with string kernels and word embeddings0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Neural Pairwise Contrastive Regression (NPCR)Quadratic Weighted Kappa0.82Unverified
2Tran-BERT-MS-ML-RQuadratic Weighted Kappa0.79Unverified
3Considering-Content-XLNetQuadratic Weighted Kappa0.79Unverified
4HISK+BOSWEQuadratic Weighted Kappa0.79Unverified
5SkipFlowQuadratic Weighted Kappa0.76Unverified
6MHMLWQuadratic Weighted Kappa0.76Unverified
7AFQuadratic Weighted Kappa0.73Unverified
8FDAQuadratic Weighted Kappa0.71Unverified