SOTAVerified

Decision Making

Papers

Showing 801850 of 12311 papers

TitleStatusHype
A friendly introduction to triangular transportCode1
Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene UnderstandingCode1
Bayesian Safety Validation for Failure Probability Estimation of Black-Box SystemsCode1
A deep active learning system for species identification and counting in camera trap imagesCode1
LLM Guided Evolution - The Automation of Models Advancing ModelsCode1
Benchmarking LLMs for Political Science: A United Nations PerspectiveCode1
Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for SamplingCode1
Building a Scalable and Interpretable Bayesian Deep Learning Framework for Quality Control of Free Form SurfacesCode1
Balancing Biases and Preserving Privacy on Balanced Faces in the WildCode1
Machine Explanations and Human UnderstandingCode1
Active Inference and Behavior Trees for Reactive Action Planning and Execution in RoboticsCode1
BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous DrivingCode1
Market Making with Deep Reinforcement Learning from Limit Order BooksCode1
Markup-to-Image Diffusion Models with Scheduled SamplingCode1
Active Fire Detection in Landsat-8 Imagery: a Large-Scale Dataset and a Deep-Learning StudyCode1
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious PlayCode1
A View From Somewhere: Human-Centric Face RepresentationsCode1
MCPNet: An Interpretable Classifier via Multi-Level Concept PrototypesCode1
BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical AnalysisCode1
Measuring Implicit Bias in Explicitly Unbiased Large Language ModelsCode1
Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on GraphsCode1
Medical Dead-ends and Learning to Identify High-risk States and TreatmentsCode1
Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement LearningCode1
MEME: Generating RNN Model Explanations via Model ExtractionCode1
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of AgentsCode1
Sample Efficient Reinforcement Learning via Large Vision Language Model DistillationCode1
A Comparative Visual Analytics Framework for Evaluating Evolutionary Processes in Multi-objective OptimizationCode1
MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model FrameworkCode1
auto-sktime: Automated Time Series ForecastingCode1
milliFlow: Scene Flow Estimation on mmWave Radar Point Cloud for Human Motion SensingCode1
BayesianFitForecast: A User-Friendly R Toolbox for Parameter Estimation and Forecasting with Ordinary Differential EquationsCode1
A User's Guide to Calibrating Robotics SimulatorsCode1
ml_edm package: a Python toolkit for Machine Learning based Early Decision MakingCode1
MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge DistillationCode1
Analyzing Fairness in Deepfake Detection With Massively Annotated DatabasesCode1
Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging SegmentationCode1
A Model-Driven Approach to Machine Learning and Software Modeling for the IoTCode1
Model Agnostic Defence against Backdoor Attacks in Machine LearningCode1
Algorithmic Stability and Generalization of an Unsupervised Feature Selection AlgorithmCode1
Attention to Fires: Multi-Channel Deep Learning Models for Wildfire Severity PredictionCode1
MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable UncertaintyCode1
Motif: Intrinsic Motivation from Artificial Intelligence FeedbackCode1
Augmenting Reinforcement Learning with Transformer-based Scene Representation Learning for Decision-making of Autonomous DrivingCode1
Multi-Agent Distributed Reinforcement Learning for Making Decentralized Offloading DecisionsCode1
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent ConversationCode1
AT-RAG: An Adaptive RAG Model Enhancing Query Efficiency with Topic Filtering and Iterative ReasoningCode1
A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question AnsweringCode1
Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language ModelsCode1
Towards Rationality in Language and Multimodal Agents: A SurveyCode1
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming VideosCode1
Show:102550
← PrevPage 17 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified