SOTAVerified

Decision Making

Papers

Showing 801825 of 12311 papers

TitleStatusHype
Linguistic Calibration of Long-Form GenerationsCode1
LITE: Modeling Environmental Ecosystems with Multimodal Large Language ModelsCode1
auto-sktime: Automated Time Series ForecastingCode1
Active Inference and Behavior Trees for Reactive Action Planning and Execution in RoboticsCode1
Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive NegotiationCode1
LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital TwinsCode1
Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on GraphsCode1
Active Fire Detection in Landsat-8 Imagery: a Large-Scale Dataset and a Deep-Learning StudyCode1
A View From Somewhere: Human-Centric Face RepresentationsCode1
LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal DataCode1
Auto-GPT for Online Decision Making: Benchmarks and Additional OpinionsCode1
Sample Efficient Reinforcement Learning via Large Vision Language Model DistillationCode1
MAGIC: Learning Macro-Actions for Online POMDP PlanningCode1
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent ConversationCode1
Analyzing Epistemic and Aleatoric Uncertainty for Drusen Segmentation in Optical Coherence Tomography ImagesCode1
Masked Trajectory Models for Prediction, Representation, and ControlCode1
Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement LearningCode1
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious PlayCode1
MCPNet: An Interpretable Classifier via Multi-Level Concept PrototypesCode1
Balancing Biases and Preserving Privacy on Balanced Faces in the WildCode1
Attention to Fires: Multi-Channel Deep Learning Models for Wildfire Severity PredictionCode1
Measuring Implicit Bias in Explicitly Unbiased Large Language ModelsCode1
Augmenting Reinforcement Learning with Transformer-based Scene Representation Learning for Decision-making of Autonomous DrivingCode1
Attention-based Bidirectional LSTM for Deceptive Opinion Spam ClassificationCode1
AT-RAG: An Adaptive RAG Model Enhancing Query Efficiency with Topic Filtering and Iterative ReasoningCode1
Show:102550
← PrevPage 33 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified