SOTAVerified

Legal Reasoning

Papers

Showing 5175 of 92 papers

TitleStatusHype
Deconstructing Legal Text_Object Oriented Design in Legal Adjudication0
Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains0
Modelling Value-oriented Legal Reasoning in LogiKEy0
Engineering the Law-Machine Learning Translation Problem: Developing Legally Aligned Models0
Enhancing Logical Reasoning in Large Language Models to Facilitate Legal Applications0
Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond0
Explainable machine learning multi-label classification of Spanish legal judgements0
Exploiting Domain-Specific Knowledge for Judgment Prediction Is No Panacea0
Exploring the psychology of LLMs' Moral and Legal Reasoning0
Formalising Anti-Discrimination Law in Automated Decision Systems0
IndianBailJudgments-1200: A Multi-Attribute Dataset for Legal NLP on Indian Bail Orders0
KFinEval-Pilot: A Comprehensive Benchmark Suite for Korean Financial Language Understanding0
KRAG Framework for Enhancing LLMs in the Legal Domain0
LAPIS: Language Model-Augmented Police Investigation System0
LAR-ECHR: A New Legal Argument Reasoning Task and Dataset for Cases of the European Court of Human Rights0
Large Language Models Acing Chartered Accountancy0
Large Language Models in Cryptocurrency Securities Cases: Can a GPT Model Meaningfully Assist Lawyers?0
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans0
Claim Extraction and Law Matching for COVID-19-related LegislationCode0
NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question AnsweringCode0
TMID: A Comprehensive Real-world Dataset for Trademark Infringement Detection in E-CommerceCode0
Designing Normative Theories for Ethical and Legal Reasoning: LogiKEy Framework, Methodology, and Tool SupportCode0
Developing a Pragmatic Benchmark for Assessing Korean Legal Language Understanding in Large Language ModelsCode0
Chain of Logic: Rule-Based Reasoning with Large Language ModelsCode0
LegiLM: A Fine-Tuned Legal Language Model for Data ComplianceCode0
Show:102550
← PrevPage 3 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy82.9Unverified
2GPT-3.5Balanced Accuracy60.9Unverified
3Claude-1Balanced Accuracy58.1Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy59.2Unverified