SOTAVerified|Agents Browse Leaderboard About

valid

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 581–590 of 3589 papers

Title	Date	Tasks	Status	Hype
Ask-before-Plan: Proactive Language Agents for Real-World Planning	Jun 18, 2024	Decision Makingvalid	CodeCode Available	1
A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences	Jun 17, 2024	In-Context Learningvalid	CodeCode Available	0
Spillover Detection for Donor Selection in Synthetic Control Models	Jun 17, 2024	Causal Inferencevalid	—Unverified	0
CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven Agents	Jun 17, 2024	Code GenerationCode Search	CodeCode Available	0
Active, anytime-valid risk controlling prediction sets	Jun 15, 2024	Predictionvalid	CodeCode Available	0
Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning	Jun 15, 2024	Diversityvalid	—Unverified	0
Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions	Jun 14, 2024	Few-Shot Semantic SegmentationSemantic Segmentation	CodeCode Available	1
Large language model validity via enhanced conformal prediction methods	Jun 14, 2024	Conformal PredictionLanguage Modeling	CodeCode Available	1
TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners	Jun 14, 2024	Language ModelingLanguage Modelling	—Unverified	0
Randomization Inference: Theory and Applications	Jun 13, 2024	valid	—Unverified	0

Show:10 25 50

← PrevPage 59 of 359Next →

No leaderboard results yet.