| Self-Calibrating Conformal Prediction | Feb 11, 2024 | Binary ClassificationConformal Prediction | CodeCode Available | 1 |
| Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss | Feb 9, 2024 | Computational Efficiencycontinuous-control | CodeCode Available | 1 |
| Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement | Feb 9, 2024 | Code GenerationDecision Making | CodeCode Available | 1 |
| Conformal Convolution and Monte Carlo Meta-learners for Predictive Inference of Individual Treatment Effects | Feb 7, 2024 | Decision MakingMarketing | CodeCode Available | 1 |
| Sym-Q: Adaptive Symbolic Regression via Sequential Decision-Making | Feb 7, 2024 | Decision Makingregression | CodeCode Available | 1 |
| Measuring Implicit Bias in Explicitly Unbiased Large Language Models | Feb 6, 2024 | Decision MakingDiagnostic | CodeCode Available | 1 |
| Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills | Feb 5, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Deep hybrid models: infer and plan in a dynamic world | Feb 1, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| LLM Voting: Human Choices and AI Collective Decision Making | Jan 31, 2024 | Decision MakingDiversity | CodeCode Available | 1 |
| Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis | Jan 30, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 1 |