SOTAVerified

Automated Essay Scoring

Essay scoring: Automated Essay Scoring is the task of assigning a score to an essay, usually in the context of assessing the language ability of a language learner. The quality of an essay is affected by the following four primary dimensions: topic relevance, organization and coherence, word usage and sentence complexity, and grammar and mechanics.

Source: A Joint Model for Multimodal Document Quality Assessment

Papers

Showing 51100 of 104 papers

TitleStatusHype
Automated Essay Scoring in the Presence of Biased Ratings0
Automated Essay Scoring System for Nonnative Japanese Learners0
Unleashing Large Language Models' Proficiency in Zero-shot Essay Scoring0
Rationale Behind Essay Scores: Enhancing S-LLM's Multi-Trait Essay Scoring with Rationale Generated by LLMs0
Regression or classification? Automated Essay Scoring for Norwegian0
Review of feedback in Automated Essay Scoring0
Rubric-Specific Approach to Automated Essay Scoring with Augmentation Training0
Should You Fine-Tune BERT for Automated Essay Scoring?0
TDNN: A Two-stage Deep Neural Network for Prompt-independent Automated Essay Scoring0
The Effectiveness of a Dynamic Loss Function in Neural Network Based Automated Essay Scoring0
The effects of data size on Automated Essay Scoring engines0
The Impact of Example Selection in Few-Shot Prompting on Automated Essay Scoring Using GPT Models0
Toward Educator-focused Automated Scoring Systems for Reading and Writing0
Transformer-based Joint Modelling for Automatic Essay Scoring and Off-Topic Detection0
TRATES: Trait-Specific Rubric-Assisted Cross-Prompt Essay Scoring0
UKARA 1.0 Challenge Track 1: Automatic Short-Answer Scoring in Bahasa Indonesia0
Using Active Learning Methods to Strategically Select Essays for Automated Scoring0
Word Embedding for Response-To-Text Assessment of Evidence0
Exploring Automated Essay Scoring for Nonnative English Speakers0
LLM-as-a-tutor in EFL Writing Education: Focusing on Evaluation of Student-LLM Interaction0
Flexible Domain Adaptation for Automated Essay Scoring Using Correlated Linear Regression0
Frustratingly Simple Prompting-based Text Denoising0
Give Me More Feedback: Annotating Argument Persuasiveness and Related Attributes in Student Essays0
Give Me More Feedback II: Annotating Thesis Strength and Related Attributes in Student Essays0
Graded Relevance Scoring of Written Essays with Dense Retrieval0
How well can LLMs Grade Essays in Arabic?0
Improving Performance of Automated Essay Scoring by using back-translation essays and adjusted scores0
Investigating neural architectures for short answer scoring0
Is GPT-4 Alone Sufficient for Automated Essay Scoring?: A Comparative Judgment Approach Based on Rater Cognition0
LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models0
LinggleWrite: a Coaching System for Essay Writing0
Multiple Instance Learning for Content Feedback Localization without Annotation0
Multi-task Learning for Automated Essay Scoring with Sentiment Analysis0
MWE for Essay Scoring English as a Foreign Language0
Analytic Automated Essay Scoring Based on Deep Neural Networks Integrating Multidimensional Item Response Theory0
Neural Multi-task Learning in Automated Assessment0
On the Suitability of pre-trained foundational LLMs for Analysis in German Legal Education0
Predicting Grammaticality on an Ordinal Scale0
Co-Attention Based Neural Network for Source-Dependent Essay ScoringCode0
SkipFlow: Incorporating Neural Coherence Features for End-to-End Automatic Text ScoringCode0
Exploring LLM Prompting Strategies for Joint Essay Scoring and Feedback GenerationCode0
Learning to love diligent trolls: Accounting for rater effects in the dialogue safety taskCode0
Autoregressive Score Generation for Multi-trait Essay ScoringCode0
Can Large Language Models Automatically Score Proficiency of Written Essays?Code0
Beyond Agreement: Diagnosing the Rationale Alignment of Automated Essay Scoring Methods based on Linguistically-informed CounterfactualsCode0
Modeling Structural Similarities between Documents for Coherence Assessment with Graph Convolutional NetworksCode0
WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in WikipediaCode0
Prompt Agnostic Essay Scorer: A Domain Generalization Approach to Cross-prompt Automated Essay ScoringCode0
H-AES: Towards Automated Essay Scoring for HindiCode0
A Neural Approach to Automated Essay ScoringCode0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Neural Pairwise Contrastive Regression (NPCR)Quadratic Weighted Kappa0.82Unverified
2Tran-BERT-MS-ML-RQuadratic Weighted Kappa0.79Unverified
3Considering-Content-XLNetQuadratic Weighted Kappa0.79Unverified
4HISK+BOSWEQuadratic Weighted Kappa0.79Unverified
5SkipFlowQuadratic Weighted Kappa0.76Unverified
6MHMLWQuadratic Weighted Kappa0.76Unverified
7AFQuadratic Weighted Kappa0.73Unverified
8FDAQuadratic Weighted Kappa0.71Unverified