SOTAVerified

Legal Reasoning

Papers

Showing 150 of 92 papers

TitleStatusHype
GPT-4 Technical ReportCode6
DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal ServicesCode2
LawInstruct: A Resource for Studying Language Model Adaptation to the Legal DomainCode1
LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLMCode1
A Comprehensive Evaluation of Large Language Models on Legal Judgment PredictionCode1
JUREX-4E: Juridical Expert-Annotated Four-Element Knowledge Base for Legal ReasoningCode1
CaseGen: A Benchmark for Multi-Stage Legal Case Documents GenerationCode1
Developing a Pragmatic Benchmark for Assessing Korean Legal Language Understanding in Large Language ModelsCode0
Investigating the Shortcomings of LLMs in Step-by-Step Legal ReasoningCode0
NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question AnsweringCode0
Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian LawsCode0
Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?Code0
Can Large Language Models Grasp Legal Theories? Enhance Legal Reasoning with Insights from Multi-Agent CollaborationCode0
ECtHR-PCR: A Dataset for Precedent Understanding and Prior Case Retrieval in the European Court of Human RightsCode0
One Law, Many Languages: Benchmarking Multilingual Legal Reasoning for Judicial SupportCode0
Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal ReasoningCode0
Causality and Responsibility for Formal Verification and BeyondCode0
TMID: A Comprehensive Real-world Dataset for Trademark Infringement Detection in E-CommerceCode0
Chain of Logic: Rule-Based Reasoning with Large Language ModelsCode0
Software Engineering Methods For AI-Driven Deductive Legal ReasoningCode0
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language ModelsCode0
LegalBench: Prototyping a Collaborative Benchmark for Legal ReasoningCode0
Weak-to-Strong Generalization beyond Accuracy: a Pilot Study in Safety, Toxicity, and Legal ReasoningCode0
Claim Extraction and Law Matching for COVID-19-related LegislationCode0
Passing the Brazilian OAB Exam: data preparation and some experimentsCode0
Designing Normative Theories for Ethical and Legal Reasoning: LogiKEy Framework, Methodology, and Tool SupportCode0
LegiLM: A Fine-Tuned Legal Language Model for Data ComplianceCode0
LeKUBE: A Legal Knowledge Update BEnchmarkCode0
LLM-based HSE Compliance Assessment: Benchmark, Performance, and AdvancementsCode0
Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair UseCode0
Modeling Legal Reasoning: LM Annotation at the Edge of Human AgreementCode0
KRAG Framework for Enhancing LLMs in the Legal Domain0
LAPIS: Language Model-Augmented Police Investigation System0
LAR-ECHR: A New Legal Argument Reasoning Task and Dataset for Cases of the European Court of Human Rights0
Large Language Models Acing Chartered Accountancy0
Large Language Models in Cryptocurrency Securities Cases: Can a GPT Model Meaningfully Assist Lawyers?0
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans0
Law to Binary Tree -- An Formal Interpretation of Legal Natural Language0
An Argumentation-Based Legal Reasoning Approach for DL-Ontology0
Legal Evalutions and Challenges of Large Language Models0
Legal Judgment Prediction (LJP) Amid the Advent of Autonomous AI Legal Reasoning0
Legal Prompting: Teaching a Language Model to Think Like a Lawyer0
Legal Sentiment Analysis and Opinion Mining (LSAOM): Assimilating Advances in Autonomous AI Legal Reasoning0
LEXam: Benchmarking Legal Reasoning on 340 Law Exams0
Logical Lease Litigation: Prolog and LLMs for Rental Law Compliance in New York0
Multidimensionality of Legal Singularity: Parametric Analysis and the Autonomous Levels of AI Legal Reasoning0
New Algebraic Normative Theories for Ethical and Legal Reasoning in the LogiKEy Framework0
Non-Determinism and the Lawlessness of Machine Learning Code0
PARAMANU-AYN: Pretrain from scratch or Continual Pretraining of LLMs for Legal Domain Adaptation?0
Proceedings First Workshop on Causal Reasoning for Embedded and safety-critical Systems Technologies0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy82.9Unverified
2GPT-3.5Balanced Accuracy60.9Unverified
3Claude-1Balanced Accuracy58.1Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy59.2Unverified