SOTAVerified

Decision Making

Papers

Showing 181190 of 12311 papers

TitleStatusHype
AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language ModelsCode2
Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPRCode2
CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing GamesCode2
Concept Bottleneck Language Models For protein designCode2
ADAPT: Action-aware Driving Caption TransformerCode2
LingoQA: Visual Question Answering for Autonomous DrivingCode2
CoD, Towards an Interpretable Medical Agent using Chain of DiagnosisCode2
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision SupportCode2
LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention NetworksCode2
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian SplattingCode2
Show:102550
← PrevPage 19 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified