| Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations | Apr 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Learning to Attribute with Attention | Apr 18, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 |
| SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging | Apr 14, 2025 | Anomaly DetectionDiagnostic | CodeCode Available | 1 |
| Fine-tuning a Large Language Model for Automating Computational Fluid Dynamics Simulations | Apr 13, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Parameterized Synthetic Text Generation with SimpleStories | Apr 12, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 |
| LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models | Apr 10, 2025 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration | Apr 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source) | Apr 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization | Apr 6, 2025 | BenchmarkingCombinatorial Optimization | CodeCode Available | 1 |
| MSL: Not All Tokens Are What You Need for Tuning LLM as a Recommender | Apr 5, 2025 | AllLanguage Modeling | CodeCode Available | 1 |
| SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding | Apr 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction | Apr 4, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Distillation and Refinement of Reasoning in Small Language Models for Document Re-ranking | Apr 4, 2025 | Document RankingInformation Retrieval | CodeCode Available | 1 |
| Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation | Apr 4, 2025 | ClusteringHallucination | CodeCode Available | 1 |
| STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection | Apr 3, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| IPA-CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language Modeling | Apr 3, 2025 | Grapheme-to-Phoneme ConversionLanguage Modeling | CodeCode Available | 1 |
| MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities | Apr 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model | Apr 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining | Apr 2, 2025 | Continual LearningContinual Pretraining | CodeCode Available | 1 |
| STPNet: Scale-aware Text Prompt Network for Medical Image Segmentation | Apr 2, 2025 | Image SegmentationLanguage Modeling | CodeCode Available | 1 |
| Representation Bending for Large Language Model Safety | Apr 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Rethinking Key-Value Cache Compression Techniques for Large Language Model Serving | Mar 31, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| CrowdVLM-R1: Expanding R1 Ability to Vision Language Model for Crowd Counting using Fuzzy Group Relative Policy Reward | Mar 31, 2025 | Crowd CountingLanguage Modeling | CodeCode Available | 1 |
| Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages | Mar 30, 2025 | Automatic Speech RecognitionLanguage Modeling | CodeCode Available | 1 |
| Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense Retrieval | Mar 29, 2025 | AllLanguage Modeling | CodeCode Available | 1 |
| OpenHuEval: Evaluating Large Language Model on Hungarian Specifics | Mar 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding | Mar 27, 2025 | FormLanguage Modeling | CodeCode Available | 1 |
| CoLLM: A Large Language Model for Composed Image Retrieval | Mar 25, 2025 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation | Mar 25, 2025 | Code CompletionLanguage Modeling | CodeCode Available | 1 |
| CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Mar 25, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| PM4Bench: A Parallel Multilingual Multi-Modal Multi-task Benchmark for Large Vision Language Model | Mar 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Model Uncertainty Quantification with Attention Chain | Mar 24, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Sun-Shine: A Large Language Model for Tibetan Culture | Mar 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| What Makes a Reward Model a Good Teacher? An Optimization Perspective | Mar 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma? | Mar 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CoLLMLight: Cooperative Large Language Model Agents for Network-Wide Traffic Signal Control | Mar 14, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens | Mar 14, 2025 | Audio-Visual Speech RecognitionComputational Efficiency | CodeCode Available | 1 |
| Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BiasEdit: Debiasing Stereotyped Language Models via Model Editing | Mar 11, 2025 | counterfactualLanguage Modeling | CodeCode Available | 1 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| GRITHopper: Decomposition-Free Multi-Hop Dense Retrieval | Mar 10, 2025 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation | Mar 10, 2025 | DecoderImage Generation | CodeCode Available | 1 |
| VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion | Mar 8, 2025 | 3D Semantic Scene CompletionAutonomous Driving | CodeCode Available | 1 |
| Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices | Mar 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| L^2M: Mutual Information Scaling Law for Long-Context Language Modeling | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model | Mar 4, 2025 | es-enLanguage Modeling | CodeCode Available | 1 |
| Words or Vision: Do Vision-Language Models Have Blind Faith in Text? | Mar 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Superscopes: Amplifying Internal Feature Representations for Language Model Interpretation | Mar 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enhancing Monocular 3D Scene Completion with Diffusion Model | Mar 2, 2025 | 3D Reconstruction3D Scene Reconstruction | CodeCode Available | 1 |
| Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable | Mar 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |