SOTAVerified

Legal Reasoning

Papers

Showing 125 of 92 papers

TitleStatusHype
GPT-4 Technical ReportCode6
DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal ServicesCode2
CaseGen: A Benchmark for Multi-Stage Legal Case Documents GenerationCode1
JUREX-4E: Juridical Expert-Annotated Four-Element Knowledge Base for Legal ReasoningCode1
LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLMCode1
LawInstruct: A Resource for Studying Language Model Adaptation to the Legal DomainCode1
A Comprehensive Evaluation of Large Language Models on Legal Judgment PredictionCode1
IndianBailJudgments-1200: A Multi-Attribute Dataset for Legal NLP on Indian Bail Orders0
Large Language Models Acing Chartered Accountancy0
CHANCERY: Evaluating Corporate Governance Reasoning Capabilities in Language Models0
When Fairness Isn't Statistical: The Limits of Machine Learning in Evaluating Legal Reasoning0
Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian LawsCode0
LLM-based HSE Compliance Assessment: Benchmark, Performance, and AdvancementsCode0
LEXam: Benchmarking Legal Reasoning on 340 Law Exams0
Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair UseCode0
SynLexLM: Scaling Legal LLMs with Synthetic Data and Curriculum Learning0
Engineering the Law-Machine Learning Translation Problem: Developing Legally Aligned Models0
Continual Pre-Training is (not) What You Need in Domain Adaption0
KFinEval-Pilot: A Comprehensive Benchmark Suite for Korean Financial Language Understanding0
An Explicit Syllogistic Legal Reasoning Framework for Large Language Models0
Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond0
Adaptively profiling models with task elicitation0
Towards Robust Legal Reasoning: Harnessing Logical LLMs in Law0
NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question AnsweringCode0
Logical Lease Litigation: Prolog and LLMs for Rental Law Compliance in New York0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy82.9Unverified
2GPT-3.5Balanced Accuracy60.9Unverified
3Claude-1Balanced Accuracy58.1Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Balanced Accuracy59.2Unverified