CLEAR: Contrasting Textual Feedback with Experts and Amateurs for Reasoning Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0Language Model Uncertainty Quantification with Attention Chain Mar 24, 2025 Computational Efficiency Language Modeling
Code Code Available 1Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling Mar 24, 2025 Continual Pretraining Language Modeling
— Unverified 0A Survey of Large Language Model Agents for Question Answering Mar 24, 2025 Answer Generation Information Retrieval
— Unverified 0Sun-Shine: A Large Language Model for Tibetan Culture Mar 24, 2025 Language Modeling Language Modelling
Code Code Available 1Solving Situation Puzzles with Large Language Model and External Reformulation Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0Breaking the Encoder Barrier for Seamless Video-Language Understanding Mar 24, 2025 Decoder Language Modeling
— Unverified 0MC-LLaVA: Multi-Concept Personalized Vision-Language Model Mar 24, 2025 Language Modeling Language Modelling
Code Code Available 2Autoregressive Language Models for Knowledge Base Population: A case study in the space mission domain Mar 24, 2025 Knowledge Base Population Language Modeling
— Unverified 0Unsupervised Acquisition of Discrete Grammatical Categories Mar 24, 2025 Language Acquisition Language Modeling
— Unverified 0TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics Mar 24, 2025 Human-Object Interaction Detection Language Modeling
— Unverified 0MMCR: Advancing Visual Language Model in Multimodal Multi-Turn Contextual Reasoning Mar 24, 2025 Diagnostic Language Modeling
— Unverified 0LANGALIGN: Enhancing Non-English Language Models via Cross-Lingual Embedding Alignment Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0Discriminative protein sequence modelling with Latent Space Diffusion Mar 24, 2025 Denoising Language Modeling
— Unverified 0PM4Bench: A Parallel Multilingual Multi-Modal Multi-task Benchmark for Large Vision Language Model Mar 24, 2025 Language Modeling Language Modelling
Code Code Available 1ClinText-SP and RigoBERTa Clinical: a new set of open resources for Spanish Clinical NLP Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures Mar 24, 2025 Language Modeling Language Modelling
— Unverified 0Manipulation and the AI Act: Large Language Model Chatbots and the Danger of Mirrors Mar 24, 2025 Chatbot Language Modeling
— Unverified 0ModiGen: A Large Language Model-Based Workflow for Multi-Task Modelica Code Generation Mar 24, 2025 Code Generation Language Modeling
— Unverified 0Simulating Filter Bubble on Short-video Recommender System with Large Language Model Agents Mar 23, 2025 Language Modeling Language Modelling
— Unverified 0ExpertRAG: Efficient RAG with Mixture of Experts -- Optimizing Context Retrieval for Adaptive LLM Responses Mar 23, 2025 Language Modeling Language Modelling
— Unverified 0Payload-Aware Intrusion Detection with CMAE and Large Language Models Mar 23, 2025 Intrusion Detection Language Modeling
— Unverified 0MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation Mar 23, 2025 Language Modeling Language Modelling
— Unverified 0LakotaBERT: A Transformer-based Model for Low Resource Lakota Language Mar 23, 2025 Language Modeling Language Modelling
— Unverified 0Detection of Somali-written Fake News and Toxic Messages on the Social Media Using Transformer-based Language Models Mar 23, 2025 Language Modeling Language Modelling
— Unverified 0WLB-LLM: Workload-Balanced 4D Parallelism for Large Language Model Training Mar 23, 2025 Language Modeling Language Modelling
— Unverified 0CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model Mar 22, 2025 Language Modeling Language Modelling
— Unverified 0Large Language Model Compression via the Nested Activation-Aware Decomposition Mar 21, 2025 Language Modeling Language Modelling
— Unverified 0CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement Mar 21, 2025 Dimensionality Reduction Language Modeling
— Unverified 0Modifying Large Language Model Post-Training for Diverse Creative Writing Mar 21, 2025 Diversity Language Modeling
Code Code Available 2Federated Cross-Domain Click-Through Rate Prediction With Large Language Model Augmentation Mar 21, 2025 Click-Through Rate Prediction Contrastive Learning
— Unverified 0Imagine to Hear: Auditory Knowledge Generation can be an Effective Assistant for Language Models Mar 21, 2025 Language Modeling Language Modelling
— Unverified 0Audio-Enhanced Vision-Language Modeling with Latent Space Broadening for High Quality Data Expansion Mar 21, 2025 Active Learning Language Modeling
— Unverified 0Efficient Knowledge Distillation via Curriculum Extraction Mar 21, 2025 Knowledge Distillation Language Modeling
— Unverified 0CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities Mar 21, 2025 Language Modeling Language Modelling
Code Code Available 2FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models Mar 21, 2025 Language Modeling Language Modelling
Code Code Available 2Variance Control via Weight Rescaling in LLM Pre-training Mar 21, 2025 Language Modeling Language Modelling
Code Code Available 0Field-Mediated Semantic Organization in Large Language Models: Evidence for Quantum-Like Properties in Artificial Neural Systems Mar 21, 2025 Language Modeling Language Modelling
— Unverified 0How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities Mar 20, 2025 General Knowledge Language Modeling
Code Code Available 0A Comprehensive Survey on Long Context Language Modeling Mar 20, 2025 Language Modeling Language Modelling
Code Code Available 3Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms Mar 20, 2025 Language Modeling Language Modelling
— Unverified 0Video-VoT-R1: An efficient video inference model integrating image packing and AoE architecture Mar 20, 2025 Language Modeling Language Modelling
— Unverified 0Using Language Models to Decipher the Motivation Behind Human Behaviors Mar 20, 2025 Language Modeling Language Modelling
— Unverified 0Entropy-based Exploration Conduction for Multi-step Reasoning Mar 20, 2025 Language Modeling Language Modelling
— Unverified 0Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model Mar 20, 2025 Language Modeling Language Modelling
Code Code Available 2Exploring the Reliability of Self-explanation and its Relationship with Classification in Language Model-driven Financial Analysis Mar 20, 2025 Classification Financial Analysis
Code Code Available 0Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction Mar 20, 2025 Image Generation Language Modeling
— Unverified 0Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning Mar 20, 2025 Decision Making Language Modeling
Code Code Available 4