SOTAVerified

Legal Reasoning

Papers

Showing 125 of 92 papers

TitleStatusHype
GPT-4 Technical ReportCode6
DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal ServicesCode2
LawInstruct: A Resource for Studying Language Model Adaptation to the Legal DomainCode1
CaseGen: A Benchmark for Multi-Stage Legal Case Documents GenerationCode1
A Comprehensive Evaluation of Large Language Models on Legal Judgment PredictionCode1
LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLMCode1
JUREX-4E: Juridical Expert-Annotated Four-Element Knowledge Base for Legal ReasoningCode1
LeKUBE: A Legal Knowledge Update BEnchmarkCode0
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language ModelsCode0
LLM-based HSE Compliance Assessment: Benchmark, Performance, and AdvancementsCode0
Chain of Logic: Rule-Based Reasoning with Large Language ModelsCode0
Causality and Responsibility for Formal Verification and BeyondCode0
LegalBench: Prototyping a Collaborative Benchmark for Legal ReasoningCode0
LegiLM: A Fine-Tuned Legal Language Model for Data ComplianceCode0
Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?Code0
Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair UseCode0
Developing a Pragmatic Benchmark for Assessing Korean Legal Language Understanding in Large Language ModelsCode0
ECtHR-PCR: A Dataset for Precedent Understanding and Prior Case Retrieval in the European Court of Human RightsCode0
Claim Extraction and Law Matching for COVID-19-related LegislationCode0
Can Large Language Models Grasp Legal Theories? Enhance Legal Reasoning with Insights from Multi-Agent CollaborationCode0
Designing Normative Theories for Ethical and Legal Reasoning: LogiKEy Framework, Methodology, and Tool SupportCode0
Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal ReasoningCode0
Investigating the Shortcomings of LLMs in Step-by-Step Legal ReasoningCode0
Software Engineering Methods For AI-Driven Deductive Legal ReasoningCode0
Modeling Legal Reasoning: LM Annotation at the Edge of Human AgreementCode0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy82.9Unverified
2GPT-3.5Balanced Accuracy60.9Unverified
3Claude-1Balanced Accuracy58.1Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy59.2Unverified