SOTAVerified

Legal Reasoning

Papers

Showing 125 of 92 papers

TitleStatusHype
IndianBailJudgments-1200: A Multi-Attribute Dataset for Legal NLP on Indian Bail Orders0
Large Language Models Acing Chartered Accountancy0
CHANCERY: Evaluating Corporate Governance Reasoning Capabilities in Language Models0
When Fairness Isn't Statistical: The Limits of Machine Learning in Evaluating Legal Reasoning0
Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian LawsCode0
LLM-based HSE Compliance Assessment: Benchmark, Performance, and AdvancementsCode0
LEXam: Benchmarking Legal Reasoning on 340 Law Exams0
Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair UseCode0
SynLexLM: Scaling Legal LLMs with Synthetic Data and Curriculum Learning0
Engineering the Law-Machine Learning Translation Problem: Developing Legally Aligned Models0
Continual Pre-Training is (not) What You Need in Domain Adaption0
KFinEval-Pilot: A Comprehensive Benchmark Suite for Korean Financial Language Understanding0
An Explicit Syllogistic Legal Reasoning Framework for Large Language Models0
Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond0
Adaptively profiling models with task elicitation0
CaseGen: A Benchmark for Multi-Stage Legal Case Documents GenerationCode1
Towards Robust Legal Reasoning: Harnessing Logical LLMs in Law0
JUREX-4E: Juridical Expert-Annotated Four-Element Knowledge Base for Legal ReasoningCode1
NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question AnsweringCode0
Logical Lease Litigation: Prolog and LLMs for Rental Law Compliance in New York0
Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal ReasoningCode0
LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLMCode1
Investigating the Shortcomings of LLMs in Step-by-Step Legal ReasoningCode0
Artificial Intelligence and Legal Analysis: Implications for Legal Education and the Profession0
Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy82.9Unverified
2GPT-3.5Balanced Accuracy60.9Unverified
3Claude-1Balanced Accuracy58.1Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy59.2Unverified