| MoLoRec: A Generalizable and Efficient Framework for LLM-Based Recommendation | Feb 12, 2025 | parameter-efficient fine-tuningRecommendation Systems | —Unverified | 0 |
| Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries | Feb 9, 2025 | DiversityFairness | CodeCode Available | 0 |
| Can LLMs Maintain Fundamental Abilities under KV Cache Compression? | Feb 4, 2025 | Arithmetic ReasoningCode Generation | —Unverified | 0 |
| LAST SToP For Modeling Asynchronous Time Series | Feb 4, 2025 | Anomaly DetectionImputation | —Unverified | 0 |
| Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning | Jan 27, 2025 | World Knowledge | —Unverified | 0 |
| Zero-shot Robotic Manipulation with Language-guided Instruction and Formal Task Planning | Jan 25, 2025 | Motion PlanningTask and Motion Planning | —Unverified | 0 |
| A Collection of Question Answering Datasets for Norwegian | Jan 19, 2025 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| Distilling Multi-modal Large Language Models for Autonomous Driving | Jan 16, 2025 | Autonomous DrivingMotion Planning | —Unverified | 0 |
| LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading | Jan 16, 2025 | Mixture-of-ExpertsWorld Knowledge | —Unverified | 0 |
| Dynamic Knowledge Integration for Enhanced Vision-Language Reasoning | Jan 15, 2025 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| HALoGEN: Fantastic LLM Hallucinations and Where to Find Them | Jan 14, 2025 | HallucinationWorld Knowledge | —Unverified | 0 |
| A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild | Jan 6, 2025 | AttributeOptical Character Recognition | —Unverified | 0 |
| Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering | Jan 1, 2025 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble | Dec 30, 2024 | knowledge editingLanguage Modeling | —Unverified | 0 |
| VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks | Dec 24, 2024 | Common Sense ReasoningTransfer Learning | —Unverified | 0 |
| Interweaving Memories of a Siamese Large Language Model | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Beyond Partisan Leaning: A Comparative Analysis of Political Bias in Large Language Models | Dec 21, 2024 | World Knowledge | —Unverified | 0 |
| Logical Consistency of Large Language Models in Fact-checking | Dec 20, 2024 | Fact CheckingHallucination | —Unverified | 0 |
| GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering | Dec 19, 2024 | Efficient ExplorationEmbodied Question Answering | —Unverified | 0 |
| MetaMorph: Multimodal Understanding and Generation via Instruction Tuning | Dec 18, 2024 | Instruction FollowingMORPH | —Unverified | 0 |
| AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge | Dec 18, 2024 | BenchmarkingWorld Knowledge | CodeCode Available | 0 |
| HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction | Dec 17, 2024 | PredictionTrajectory Prediction | —Unverified | 0 |
| QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs | Dec 16, 2024 | BenchmarkingCommon Sense Reasoning | CodeCode Available | 0 |
| GaGA: Towards Interactive Global Geolocation Assistant | Dec 12, 2024 | World Knowledge | —Unverified | 0 |