| Continual Pre-Training is (not) What You Need in Domain Adaption | Apr 18, 2025 | Decision MakingDomain Adaptation | —Unverified | 0 |
| KFinEval-Pilot: A Comprehensive Benchmark Suite for Korean Financial Language Understanding | Apr 17, 2025 | DiagnosticLegal Reasoning | —Unverified | 0 |
| An Explicit Syllogistic Legal Reasoning Framework for Large Language Models | Apr 5, 2025 | Legal Reasoning | —Unverified | 0 |
| Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond | Mar 20, 2025 | Legal Reasoning | —Unverified | 0 |
| Adaptively profiling models with task elicitation | Mar 3, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation | Feb 25, 2025 | Legal Reasoning | CodeCode Available | 1 |
| Towards Robust Legal Reasoning: Harnessing Logical LLMs in Law | Feb 24, 2025 | Legal ReasoningNatural Language Understanding | —Unverified | 0 |
| JUREX-4E: Juridical Expert-Annotated Four-Element Knowledge Base for Legal Reasoning | Feb 24, 2025 | Legal Reasoning | CodeCode Available | 1 |
| NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question Answering | Feb 15, 2025 | ChunkingInformation Retrieval | CodeCode Available | 0 |
| Logical Lease Litigation: Prolog and LLMs for Rental Law Compliance in New York | Feb 13, 2025 | Legal ReasoningLogical Reasoning | —Unverified | 0 |