SOTAVerified

Decision Making

Papers

Showing 901925 of 12311 papers

TitleStatusHype
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent ConversationCode1
Auto-GPT for Online Decision Making: Benchmarks and Additional OpinionsCode1
Algorithmic Stability and Generalization of an Unsupervised Feature Selection AlgorithmCode1
Augmenting Reinforcement Learning with Transformer-based Scene Representation Learning for Decision-making of Autonomous DrivingCode1
A User's Guide to Calibrating Robotics SimulatorsCode1
Decision-Focused Learning: Through the Lens of Learning to RankCode1
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming VideosCode1
Attention to Fires: Multi-Channel Deep Learning Models for Wildfire Severity PredictionCode1
AT-RAG: An Adaptive RAG Model Enhancing Query Efficiency with Topic Filtering and Iterative ReasoningCode1
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive LossCode1
Atari-HEAD: Atari Human Eye-Tracking and Demonstration DatasetCode1
Attention-based Bidirectional LSTM for Deceptive Opinion Spam ClassificationCode1
Bayesian Safety Validation for Failure Probability Estimation of Black-Box SystemsCode1
Probabilistic 3D segmentation for aleatoric uncertainty quantification in full 3D medical dataCode1
Prompting Large Language Models for Zero-Shot Clinical Prediction with Structured Longitudinal Electronic Health Record DataCode1
ProoFVer: Natural Logic Theorem Proving for Fact VerificationCode1
ProtoPFormer: Concentrating on Prototypical Parts in Vision Transformers for Interpretable Image RecognitionCode1
ProTo: Program-Guided Transformer for Program-Guided TasksCode1
Pyraformer: Low-Complexity Pyramidal Attention for Long-Range Time Series Modeling and ForecastingCode1
PyTouch: A Machine Learning Library for Touch ProcessingCode1
Analyzing Epistemic and Aleatoric Uncertainty for Drusen Segmentation in Optical Coherence Tomography ImagesCode1
QPLEX: Duplex Dueling Multi-Agent Q-LearningCode1
A Survey on Interpretable Cross-modal ReasoningCode1
A Survey on Session-based Recommender SystemsCode1
A Survey of World Models for Autonomous DrivingCode1
Show:102550
← PrevPage 37 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified