SOTAVerified

Legal Reasoning

Papers

Showing 5192 of 92 papers

TitleStatusHype
Deconstructing Legal Text_Object Oriented Design in Legal Adjudication0
Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains0
Modelling Value-oriented Legal Reasoning in LogiKEy0
Engineering the Law-Machine Learning Translation Problem: Developing Legally Aligned Models0
Enhancing Logical Reasoning in Large Language Models to Facilitate Legal Applications0
Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond0
Explainable machine learning multi-label classification of Spanish legal judgements0
Exploiting Domain-Specific Knowledge for Judgment Prediction Is No Panacea0
Exploring the psychology of LLMs' Moral and Legal Reasoning0
Formalising Anti-Discrimination Law in Automated Decision Systems0
IndianBailJudgments-1200: A Multi-Attribute Dataset for Legal NLP on Indian Bail Orders0
KFinEval-Pilot: A Comprehensive Benchmark Suite for Korean Financial Language Understanding0
KRAG Framework for Enhancing LLMs in the Legal Domain0
LAPIS: Language Model-Augmented Police Investigation System0
LAR-ECHR: A New Legal Argument Reasoning Task and Dataset for Cases of the European Court of Human Rights0
Large Language Models Acing Chartered Accountancy0
Large Language Models in Cryptocurrency Securities Cases: Can a GPT Model Meaningfully Assist Lawyers?0
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans0
Claim Extraction and Law Matching for COVID-19-related LegislationCode0
NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question AnsweringCode0
TMID: A Comprehensive Real-world Dataset for Trademark Infringement Detection in E-CommerceCode0
Designing Normative Theories for Ethical and Legal Reasoning: LogiKEy Framework, Methodology, and Tool SupportCode0
Developing a Pragmatic Benchmark for Assessing Korean Legal Language Understanding in Large Language ModelsCode0
Chain of Logic: Rule-Based Reasoning with Large Language ModelsCode0
LegiLM: A Fine-Tuned Legal Language Model for Data ComplianceCode0
ECtHR-PCR: A Dataset for Precedent Understanding and Prior Case Retrieval in the European Court of Human RightsCode0
Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal ReasoningCode0
LeKUBE: A Legal Knowledge Update BEnchmarkCode0
One Law, Many Languages: Benchmarking Multilingual Legal Reasoning for Judicial SupportCode0
Software Engineering Methods For AI-Driven Deductive Legal ReasoningCode0
LLM-based HSE Compliance Assessment: Benchmark, Performance, and AdvancementsCode0
Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian LawsCode0
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language ModelsCode0
LegalBench: Prototyping a Collaborative Benchmark for Legal ReasoningCode0
Causality and Responsibility for Formal Verification and BeyondCode0
Modeling Legal Reasoning: LM Annotation at the Edge of Human AgreementCode0
Can Large Language Models Grasp Legal Theories? Enhance Legal Reasoning with Insights from Multi-Agent CollaborationCode0
Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair UseCode0
Passing the Brazilian OAB Exam: data preparation and some experimentsCode0
Investigating the Shortcomings of LLMs in Step-by-Step Legal ReasoningCode0
Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?Code0
Weak-to-Strong Generalization beyond Accuracy: a Pilot Study in Safety, Toxicity, and Legal ReasoningCode0
Show:102550
← PrevPage 2 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy82.9Unverified
2GPT-3.5Balanced Accuracy60.9Unverified
3Claude-1Balanced Accuracy58.1Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy59.2Unverified