SOTAVerified

Legal Reasoning

Papers

Showing 2650 of 92 papers

TitleStatusHype
Investigating the Shortcomings of LLMs in Step-by-Step Legal ReasoningCode0
Modeling Legal Reasoning: LM Annotation at the Edge of Human AgreementCode0
Weak-to-Strong Generalization beyond Accuracy: a Pilot Study in Safety, Toxicity, and Legal ReasoningCode0
Exploring the psychology of LLMs' Moral and Legal Reasoning0
Formalising Anti-Discrimination Law in Automated Decision Systems0
IndianBailJudgments-1200: A Multi-Attribute Dataset for Legal NLP on Indian Bail Orders0
KFinEval-Pilot: A Comprehensive Benchmark Suite for Korean Financial Language Understanding0
KRAG Framework for Enhancing LLMs in the Legal Domain0
LAPIS: Language Model-Augmented Police Investigation System0
LAR-ECHR: A New Legal Argument Reasoning Task and Dataset for Cases of the European Court of Human Rights0
Large Language Models Acing Chartered Accountancy0
Large Language Models in Cryptocurrency Securities Cases: Can a GPT Model Meaningfully Assist Lawyers?0
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans0
Law to Binary Tree -- An Formal Interpretation of Legal Natural Language0
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models0
LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning0
An Argumentation-Based Legal Reasoning Approach for DL-Ontology0
Legal Evalutions and Challenges of Large Language Models0
Legal Judgment Prediction (LJP) Amid the Advent of Autonomous AI Legal Reasoning0
Legal Prompting: Teaching a Language Model to Think Like a Lawyer0
Legal Sentiment Analysis and Opinion Mining (LSAOM): Assimilating Advances in Autonomous AI Legal Reasoning0
LEXam: Benchmarking Legal Reasoning on 340 Law Exams0
Logical Lease Litigation: Prolog and LLMs for Rental Law Compliance in New York0
Multidimensionality of Legal Singularity: Parametric Analysis and the Autonomous Levels of AI Legal Reasoning0
New Algebraic Normative Theories for Ethical and Legal Reasoning in the LogiKEy Framework0
Show:102550
← PrevPage 2 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy82.9Unverified
2GPT-3.5Balanced Accuracy60.9Unverified
3Claude-1Balanced Accuracy58.1Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy59.2Unverified