| AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge | Dec 18, 2024 | BenchmarkingWorld Knowledge | CodeCode Available | 0 |
| MetaMorph: Multimodal Understanding and Generation via Instruction Tuning | Dec 18, 2024 | Instruction FollowingMORPH | —Unverified | 0 |
| HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction | Dec 17, 2024 | PredictionTrajectory Prediction | —Unverified | 0 |
| QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs | Dec 16, 2024 | BenchmarkingCommon Sense Reasoning | CodeCode Available | 0 |
| GaGA: Towards Interactive Global Geolocation Assistant | Dec 12, 2024 | World Knowledge | —Unverified | 0 |
| AltFS: Agency-light Feature Selection with Large Language Models in Deep Recommender Systems | Dec 11, 2024 | Feature Importancefeature selection | —Unverified | 0 |
| Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach | Dec 9, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving | Dec 9, 2024 | Autonomous DrivingWorld Knowledge | —Unverified | 0 |
| Balancing Efficiency and Effectiveness: An LLM-Infused Approach for Optimized CTR Prediction | Dec 9, 2024 | Click-Through Rate PredictionWorld Knowledge | —Unverified | 0 |
| A surprisal oracle for when every layer counts | Dec 4, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 0 |