| MoLoRec: A Generalizable and Efficient Framework for LLM-Based Recommendation | Feb 12, 2025 | parameter-efficient fine-tuningRecommendation Systems | —Unverified | 0 |
| Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries | Feb 9, 2025 | DiversityFairness | CodeCode Available | 0 |
| Can LLMs Maintain Fundamental Abilities under KV Cache Compression? | Feb 4, 2025 | Arithmetic ReasoningCode Generation | —Unverified | 0 |
| LAST SToP For Modeling Asynchronous Time Series | Feb 4, 2025 | Anomaly DetectionImputation | —Unverified | 0 |
| Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning | Jan 27, 2025 | World Knowledge | —Unverified | 0 |
| Zero-shot Robotic Manipulation with Language-guided Instruction and Formal Task Planning | Jan 25, 2025 | Motion PlanningTask and Motion Planning | —Unverified | 0 |
| A Collection of Question Answering Datasets for Norwegian | Jan 19, 2025 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading | Jan 16, 2025 | Mixture-of-ExpertsWorld Knowledge | —Unverified | 0 |
| Distilling Multi-modal Large Language Models for Autonomous Driving | Jan 16, 2025 | Autonomous DrivingMotion Planning | —Unverified | 0 |
| Dynamic Knowledge Integration for Enhanced Vision-Language Reasoning | Jan 15, 2025 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| HALoGEN: Fantastic LLM Hallucinations and Where to Find Them | Jan 14, 2025 | HallucinationWorld Knowledge | —Unverified | 0 |
| A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild | Jan 6, 2025 | AttributeOptical Character Recognition | —Unverified | 0 |
| Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering | Jan 1, 2025 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble | Dec 30, 2024 | knowledge editingLanguage Modeling | —Unverified | 0 |
| VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks | Dec 24, 2024 | Common Sense ReasoningTransfer Learning | —Unverified | 0 |
| Interweaving Memories of a Siamese Large Language Model | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Beyond Partisan Leaning: A Comparative Analysis of Political Bias in Large Language Models | Dec 21, 2024 | World Knowledge | —Unverified | 0 |
| Logical Consistency of Large Language Models in Fact-checking | Dec 20, 2024 | Fact CheckingHallucination | —Unverified | 0 |
| GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering | Dec 19, 2024 | Efficient ExplorationEmbodied Question Answering | —Unverified | 0 |
| AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge | Dec 18, 2024 | BenchmarkingWorld Knowledge | CodeCode Available | 0 |
| MetaMorph: Multimodal Understanding and Generation via Instruction Tuning | Dec 18, 2024 | Instruction FollowingMORPH | —Unverified | 0 |
| HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction | Dec 17, 2024 | PredictionTrajectory Prediction | —Unverified | 0 |
| QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs | Dec 16, 2024 | BenchmarkingCommon Sense Reasoning | CodeCode Available | 0 |
| GaGA: Towards Interactive Global Geolocation Assistant | Dec 12, 2024 | World Knowledge | —Unverified | 0 |
| AltFS: Agency-light Feature Selection with Large Language Models in Deep Recommender Systems | Dec 11, 2024 | Feature Importancefeature selection | —Unverified | 0 |
| Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach | Dec 9, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Balancing Efficiency and Effectiveness: An LLM-Infused Approach for Optimized CTR Prediction | Dec 9, 2024 | Click-Through Rate PredictionWorld Knowledge | —Unverified | 0 |
| World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving | Dec 9, 2024 | Autonomous DrivingWorld Knowledge | —Unverified | 0 |
| A surprisal oracle for when every layer counts | Dec 4, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 0 |
| SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model | Dec 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Realistic Corner Case Generation for Autonomous Vehicles with Multimodal Large Language Model | Nov 29, 2024 | Autonomous VehiclesLanguage Modeling | —Unverified | 0 |
| Structured Object Language Modeling (SoLM): Native Structured Objects Generation Conforming to Complex Schemas with Self-Supervised Denoising | Nov 28, 2024 | DenoisingLanguage Modeling | —Unverified | 0 |
| Functionality understanding and segmentation in 3D scenes | Nov 25, 2024 | AI AgentLanguage Modeling | —Unverified | 0 |
| PIANIST: Learning Partially Observable World Models with LLMs for Multi-Agent Decision Making | Nov 24, 2024 | Decision MakingWorld Knowledge | —Unverified | 0 |
| GOT4Rec: Graph of Thoughts for Sequential Recommendation | Nov 22, 2024 | General KnowledgeSequential Recommendation | —Unverified | 0 |
| LEADRE: Multi-Faceted Knowledge Enhanced LLM Empowered Display Advertisement Recommender System | Nov 21, 2024 | Learning-To-RankPrompt Engineering | —Unverified | 0 |
| GLOVER: Generalizable Open-Vocabulary Affordance Reasoning for Task-Oriented Grasping | Nov 19, 2024 | Common Sense ReasoningHuman-Object Interaction Detection | —Unverified | 0 |
| Past, Present, and Future of Sensor-Based Human Activity Recognition Using Wearables: A Surveying Tutorial on a Still Challenging Task | Nov 11, 2024 | Activity RecognitionHuman Activity Recognition | —Unverified | 0 |
| Vision Language Models are In-Context Value Learners | Nov 7, 2024 | In-Context LearningWorld Knowledge | —Unverified | 0 |
| Gradient Localization Improves Lifelong Pretraining of Language Models | Nov 7, 2024 | Continual LearningWorld Knowledge | —Unverified | 0 |
| Pre-trained Visual Dynamics Representations for Efficient Policy Learning | Nov 5, 2024 | Reinforcement Learning (RL)Video Prediction | —Unverified | 0 |
| ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model | Nov 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation | Nov 1, 2024 | EpidemiologyKnowledge Distillation | —Unverified | 0 |
| On the Exploration of LM-Based Soft Modular Robot Design | Nov 1, 2024 | World Knowledge | —Unverified | 0 |
| EMMA: End-to-End Multimodal Model for Autonomous Driving | Oct 30, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| ADAM: An Embodied Causal Agent in Open-World Environments | Oct 29, 2024 | Lifelong learningMinecraft | —Unverified | 0 |
| Learning and Unlearning of Fabricated Knowledge in Language Models | Oct 29, 2024 | Data PoisoningLanguage Modeling | —Unverified | 0 |
| GRADE: Quantifying Sample Diversity in Text-to-Image Models | Oct 29, 2024 | AttributeDiversity | —Unverified | 0 |
| ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval | Oct 24, 2024 | Image RetrievalRetrieval | CodeCode Available | 0 |