| Outlier dimensions favor frequent tokens in language models | Mar 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VALLR: Visual ASR Language Model for Lip Reading | Mar 27, 2025 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models via Voxel Representation | Mar 27, 2025 | Autonomous NavigationLanguage Modeling | —Unverified | 0 |
| Controlling Large Language Model with Latent Actions | Mar 27, 2025 | CoLALanguage Modeling | CodeCode Available | 0 |
| Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing | Mar 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models | Mar 27, 2025 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Prompting Vision-Language Model for Nuclei Instance Segmentation and Classification | Mar 27, 2025 | Cell SegmentationContrastive Learning | CodeCode Available | 0 |
| MoRE-LLM: Mixture of Rule Experts Guided by a Large Language Model | Mar 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| InfoBid: A Simulation Framework for Studying Information Disclosure in Auctions with Large Language Model-based Agents | Mar 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can Large Language Models Predict Associations Among Human Attitudes? | Mar 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| D4R -- Exploring and Querying Relational Graphs Using Natural Language and Large Language Models -- the Case of Historical Documents | Mar 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dynamic Pyramid Network for Efficient Multimodal Large Language Model | Mar 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CFunModel: A "Funny" Language Model Capable of Chinese Humor Generation and Processing | Mar 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AutoRad-Lung: A Radiomic-Guided Prompting Autoregressive Vision-Language Model for Lung Nodule Malignancy Prediction | Mar 26, 2025 | Computed Tomography (CT)cross-modal alignment | —Unverified | 0 |
| ASGO: Adaptive Structured Gradient Optimization | Mar 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Exploring the Effect of Robotic Embodiment and Empathetic Tone of LLMs on Empathy Elicitation | Mar 26, 2025 | ChatbotLanguage Modeling | —Unverified | 0 |
| A Multilingual, Culture-First Approach to Addressing Misgendering in LLM Applications | Mar 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The cell as a token: high-dimensional geometry in language models and cell embeddings | Mar 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RALLRec+: Retrieval Augmented Large Language Model Recommendation with Reasoning | Mar 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LogicQA: Logical Anomaly Detection with Vision Language Model Generated Questions | Mar 26, 2025 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector | Mar 26, 2025 | Binary ClassificationDeepFake Detection | CodeCode Available | 2 |
| Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OAEI-LLM-T: A TBox Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PHEONA: An Evaluation Framework for Large Language Model-based Approaches to Computational Phenotyping | Mar 25, 2025 | Computational PhenotypingLanguage Modeling | —Unverified | 0 |
| Optimizing Photonic Structures with Large Language Model Driven Algorithm Discovery | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A-MESS: Anchor based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition | Mar 25, 2025 | Contrastive LearningIntent Recognition | —Unverified | 0 |
| Improved Alignment of Modalities in Large Vision Language Models | Mar 25, 2025 | GPUImage Captioning | —Unverified | 0 |
| SemEval-2025 Task 9: The Food Hazard Detection Challenge | Mar 25, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model | Mar 25, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Optimizing Language Models for Inference Time Objectives using Reinforcement Learning | Mar 25, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Mar 25, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| 1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CoLLM: A Large Language Model for Composed Image Retrieval | Mar 25, 2025 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| Med3DVLM: An Efficient Vision-Language Model for 3D Medical Image Analysis | Mar 25, 2025 | Contrastive LearningImage-text Retrieval | CodeCode Available | 2 |
| LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation | Mar 25, 2025 | Code CompletionLanguage Modeling | CodeCode Available | 1 |
| CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model | Mar 25, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Exploring Textual Semantics Diversity for Image Transmission in Semantic Communication Systems using Visual Language Model | Mar 25, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Generative Linguistics, Large Language Models, and the Social Nature of Scientific Success | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CLEAR: Contrasting Textual Feedback with Experts and Amateurs for Reasoning | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model Uncertainty Quantification with Attention Chain | Mar 24, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling | Mar 24, 2025 | Continual PretrainingLanguage Modeling | —Unverified | 0 |
| A Survey of Large Language Model Agents for Question Answering | Mar 24, 2025 | Answer GenerationInformation Retrieval | —Unverified | 0 |
| LANGALIGN: Enhancing Non-English Language Models via Cross-Lingual Embedding Alignment | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PM4Bench: A Parallel Multilingual Multi-Modal Multi-task Benchmark for Large Vision Language Model | Mar 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MC-LLaVA: Multi-Concept Personalized Vision-Language Model | Mar 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ClinText-SP and RigoBERTa Clinical: a new set of open resources for Spanish Clinical NLP | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Solving Situation Puzzles with Large Language Model and External Reformulation | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MMCR: Advancing Visual Language Model in Multimodal Multi-Turn Contextual Reasoning | Mar 24, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics | Mar 24, 2025 | Human-Object Interaction DetectionLanguage Modeling | —Unverified | 0 |