| Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language Models | Feb 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models | Feb 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities | Feb 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Knowledge Graph-Driven Retrieval-Augmented Generation: Integrating Deepseek-R1 with Weaviate for Advanced Chatbot Applications | Feb 16, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| Reading Your Heart: Learning ECG Words and Sentences via Pre-training ECG Language Model | Feb 15, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can Large Language Model Agents Balance Energy Systems? | Feb 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence | Feb 12, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Small Language Model Makes an Effective Long Text Extractor | Feb 11, 2025 | GPULanguage Modeling | CodeCode Available | 1 |
| MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification | Feb 11, 2025 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata | Feb 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE | Feb 10, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 |
| RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Implicit Language Models are RNNs: Balancing Parallelization and Expressivity | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control | Feb 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| UniCMs: A Unified Consistency Model For Efficient Multimodal Generation and Understanding | Feb 8, 2025 | DenoisingImage Generation | CodeCode Available | 1 |
| Gemstones: A Model Suite for Multi-Faceted Scaling Laws | Feb 7, 2025 | Experimental DesignLanguage Modeling | CodeCode Available | 1 |
| Position-aware Automatic Circuit Discovery | Feb 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Division-of-Thoughts: Harnessing Hybrid Language Model Synergy for Efficient On-Device Agents | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Great Models Think Alike and this Undermines AI Oversight | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ADIFF: Explaining audio difference using natural language | Feb 6, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 |
| Robotouille: An Asynchronous Planning Benchmark for LLM Agents | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics | Feb 5, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| Do Large Language Model Benchmarks Test Reliability? | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications | Feb 5, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Intent Representation Learning with Large Language Model for Recommendation | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |