Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs Oct 10, 2024 Active Learning Language Modeling
Code Code Available 2Q-VLM: Post-training Quantization for Large Vision-Language Models Oct 10, 2024 Language Modeling Language Modelling
Code Code Available 2OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling Oct 10, 2024 Language Modeling Language Modelling
Code Code Available 2Sylber: Syllabic Embedding Representation of Speech from Raw Audio Oct 9, 2024 Language Modeling Language Modelling
Code Code Available 2Towards Interpreting Visual Information Processing in Vision-Language Models Oct 9, 2024 Language Modeling Language Modelling
Code Code Available 2Compositional Entailment Learning for Hyperbolic Vision-Language Models Oct 9, 2024 Language Modelling Representation Learning
Code Code Available 2Think While You Generate: Discrete Diffusion with Planned Denoising Oct 8, 2024 Denoising Image Generation
Code Code Available 2BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation Oct 8, 2024 Language Modeling Language Modelling
Code Code Available 2PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling Oct 8, 2024 document understanding Language Modeling
Code Code Available 2TextHawk2: A Large Vision-Language Model Excels in Bilingual OCR and Grounding with 16x Fewer Tokens Oct 7, 2024 Language Modeling Language Modelling
Code Code Available 2Differential Transformer Oct 7, 2024 Hallucination In-Context Learning
Code Code Available 2Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality Oct 7, 2024 Causal Inference counterfactual
Code Code Available 2GenSim: A General Social Simulation Platform with Large Language Model based Agents Oct 6, 2024 Language Modeling Language Modelling
Code Code Available 2SyllableLM: Learning Coarse Semantic Units for Speech Language Models Oct 5, 2024 Clustering Language Modeling
Code Code Available 2A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models Oct 5, 2024 Language Modeling Language Modelling
Code Code Available 2Autoregressive Action Sequence Learning for Robotic Manipulation Oct 4, 2024 Chunking Language Modeling
Code Code Available 2NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator Oct 3, 2024 Language Modeling Language Modelling
Code Code Available 2Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 2Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning Sep 30, 2024 Instruction Following Language Modeling
Code Code Available 2FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows" Sep 30, 2024 counterfactual Hallucination
Code Code Available 2DeSTA2: Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data Sep 30, 2024 Instruction Following Language Modeling
Code Code Available 2LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation Sep 30, 2024 Attribute Collaborative Filtering
Code Code Available 2One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos Sep 29, 2024 All Image Segmentation
Code Code Available 2Control Industrial Automation System with Large Language Model Agents Sep 26, 2024 Language Modeling Language Modelling
Code Code Available 2Empirical Asset Pricing with Large Language Model Agents Sep 25, 2024 Language Modeling Language Modelling
Code Code Available 2Small Language Models: Survey, Measurements, and Insights Sep 24, 2024 Benchmarking Decoder
Code Code Available 2EEGUnity: Open-Source Tool in Facilitating Unified EEG Datasets Towards Large-Scale EEG Model Sep 24, 2024 EEG Electroencephalogram (EEG)
Code Code Available 2MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding Sep 23, 2024 Language Modeling Language Modelling
Code Code Available 2Diabetica: Adapting Large Language Model to Enhance Multiple Medical Tasks in Diabetes Care and Management Sep 20, 2024 Language Modeling Language Modelling
Code Code Available 2Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization Sep 19, 2024 GPU Language Modeling
Code Code Available 2Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning Sep 19, 2024 Language Modeling Language Modelling
Code Code Available 2Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making Framework Sep 19, 2024 Autonomous Vehicles Decision Making
Code Code Available 2AutoVerus: Automated Proof Generation for Rust Code Sep 19, 2024 Code Generation Language Modeling
Code Code Available 2LLaQo: Towards a Query-Based Coach in Expressive Music Performance Assessment Sep 13, 2024 Language Modeling Language Modelling
Code Code Available 2Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Synthetic continued pretraining Sep 11, 2024 Data Augmentation Language Modelling
Code Code Available 2MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving Sep 11, 2024 Autonomous Driving Feature Engineering
Code Code Available 2DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks Sep 10, 2024 Contrastive Learning Image Reconstruction
Code Code Available 2TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks Sep 9, 2024 Classification Language Modeling
Code Code Available 2The AdEMAMix Optimizer: Better, Faster, Older Sep 5, 2024 image-classification Image Classification
Code Code Available 2Language Model Powered Digital Biology with BRAD Sep 4, 2024 Chatbot Code Generation
Code Code Available 2EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio Captioning Performance Sep 2, 2024 AudioCaps Audio captioning
Code Code Available 2Sample-Efficient Diffusion for Text-To-Speech Synthesis Sep 1, 2024 Language Modeling Language Modelling
Code Code Available 2SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation Sep 1, 2024 Language Modeling Language Modelling
Code Code Available 2MemLong: Memory-Augmented Retrieval for Long Text Modeling Aug 30, 2024 4k Decoder
Code Code Available 2Law of Vision Representation in MLLMs Aug 29, 2024 cross-modal alignment Language Modeling
Code Code Available 2Efficient LLM Scheduling by Learning to Rank Aug 28, 2024 Blocking Chatbot
Code Code Available 2LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet Aug 27, 2024 Language Modeling Language Modelling
Code Code Available 2MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents Aug 26, 2024 Language Modeling Language Modelling
Code Code Available 2LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings Aug 25, 2024 Language Modelling Link Prediction
Code Code Available 2