SOTAVerified

Automated Essay Scoring

Essay scoring: Automated Essay Scoring is the task of assigning a score to an essay, usually in the context of assessing the language ability of a language learner. The quality of an essay is affected by the following four primary dimensions: topic relevance, organization and coherence, word usage and sentence complexity, and grammar and mechanics.

Source: A Joint Model for Multimodal Document Quality Assessment

Papers

Showing 51100 of 104 papers

TitleStatusHype
Investigating neural architectures for short answer scoring0
Is GPT-4 Alone Sufficient for Automated Essay Scoring?: A Comparative Judgment Approach Based on Rater Cognition0
LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models0
LinggleWrite: a Coaching System for Essay Writing0
Multiple Instance Learning for Content Feedback Localization without Annotation0
Multi-task Learning for Automated Essay Scoring with Sentiment Analysis0
MWE for Essay Scoring English as a Foreign Language0
Neural Automated Essay Scoring Incorporating Handcrafted Features0
Neural Multi-task Learning in Automated Assessment0
On the Suitability of pre-trained foundational LLMs for Analysis in German Legal Education0
Predicting Grammaticality on an Ordinal Scale0
Unleashing Large Language Models' Proficiency in Zero-shot Essay Scoring0
Rationale Behind Essay Scores: Enhancing S-LLM's Multi-Trait Essay Scoring with Rationale Generated by LLMs0
Regression or classification? Automated Essay Scoring for Norwegian0
Review of feedback in Automated Essay Scoring0
Rubric-Specific Approach to Automated Essay Scoring with Augmentation Training0
Should You Fine-Tune BERT for Automated Essay Scoring?0
TDNN: A Two-stage Deep Neural Network for Prompt-independent Automated Essay Scoring0
The Effectiveness of a Dynamic Loss Function in Neural Network Based Automated Essay Scoring0
The effects of data size on Automated Essay Scoring engines0
The Impact of Example Selection in Few-Shot Prompting on Automated Essay Scoring Using GPT Models0
Toward Educator-focused Automated Scoring Systems for Reading and Writing0
Transformer-based Joint Modelling for Automatic Essay Scoring and Off-Topic Detection0
TRATES: Trait-Specific Rubric-Assisted Cross-Prompt Essay Scoring0
UKARA 1.0 Challenge Track 1: Automatic Short-Answer Scoring in Bahasa Indonesia0
Using Active Learning Methods to Strategically Select Essays for Automated Scoring0
Analytic Automated Essay Scoring Based on Deep Neural Networks Integrating Multidimensional Item Response Theory0
Word Embedding for Response-To-Text Assessment of Evidence0
Are Large Language Models Good Essay Graders?0
ARWI: Arabic Write and Improve0
Automated Essay Scoring Based on Finite State Transducer: towards ASR Transcription of Oral English Speech0
Automated Essay Scoring by Maximizing Human-Machine Agreement0
Automated Essay Scoring for Swedish0
Automated essay scoring in Arabic: a dataset and analysis of a BERT-based system0
Automated Essay Scoring in Argumentative Writing: DeBERTeachingAssistant0
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems0
Automated Essay Scoring in the Presence of Biased Ratings0
Automated Essay Scoring System for Nonnative Japanese Learners0
Automated essay scoring using efficient transformer-based language models0
Automated Essay Scoring Using Grammatical Variety and Errors with Multi-Task Learning and Item Response Theory0
Data Augmentation for Automated Essay Scoring using Transformer Models0
Automated Essay Scoring with Discourse-Aware Neural Models0
Automated essay scoring with string kernels and word embeddings0
Automated Topical Component Extraction Using Neural Network Attention Scores from Source-based Essay Scoring0
Automatic Essay Multi-dimensional Scoring with Fine-tuning and Multiple Regression0
Automatic Features for Essay Scoring -- An Empirical Study0
Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards0
CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring0
Can ChatGPT and Bard Generate Aligned Assessment Items? A Reliability Analysis against Human Performance0
Can GPT-4 do L2 analytic assessment?0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Neural Pairwise Contrastive Regression (NPCR)Quadratic Weighted Kappa0.82Unverified
2Tran-BERT-MS-ML-RQuadratic Weighted Kappa0.79Unverified
3Considering-Content-XLNetQuadratic Weighted Kappa0.79Unverified
4HISK+BOSWEQuadratic Weighted Kappa0.79Unverified
5SkipFlowQuadratic Weighted Kappa0.76Unverified
6MHMLWQuadratic Weighted Kappa0.76Unverified
7AFQuadratic Weighted Kappa0.73Unverified
8FDAQuadratic Weighted Kappa0.71Unverified