Towards Multi-Modal Mastery: A 4.5B Parameter Truly Multi-Modal Small Language Model Nov 8, 2024 Language Modeling Language Modelling
— Unverified 0Recycled Attention: Efficient inference for long-context language models Nov 8, 2024 Language Modeling Language Modelling
Code Code Available 0LBPE: Long-token-first Tokenization to Improve Large Language Models Nov 8, 2024 Language Modeling Language Modelling
— Unverified 0Improving Multi-Domain Task-Oriented Dialogue System with Offline Reinforcement Learning Nov 8, 2024 Language Modeling Language Modelling
— Unverified 0End-to-End Navigation with Vision Language Models: Transforming Spatial Reasoning into Question-Answering Nov 8, 2024 Language Modeling Language Modelling
Code Code Available 2LLM-PySC2: Starcraft II learning environment for Large Language Models Nov 8, 2024 Decision Making Language Modelling
Code Code Available 2A Two-Step Concept-Based Approach for Enhanced Interpretability and Trust in Skin Lesion Diagnosis Nov 8, 2024 Disease Prediction Language Modeling
Code Code Available 0Real-World Offline Reinforcement Learning from Vision Language Model Feedback Nov 8, 2024 Language Modeling Language Modelling
— Unverified 0AgentOps: Enabling Observability of LLM Agents Nov 8, 2024 AI Agent Language Modeling
— Unverified 0Aioli: A Unified Optimization Framework for Language Model Data Mixing Nov 8, 2024 Language Modeling Language Modelling
Code Code Available 1Watermarking Language Models through Language Models Nov 7, 2024 Language Modeling Language Modelling
— Unverified 0AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein Engineering Nov 7, 2024 AutoML Hyperparameter Optimization
Code Code Available 1SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications Nov 7, 2024 Code Generation Language Modeling
Code Code Available 3VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos Nov 7, 2024 Decoder Language Modeling
— Unverified 0LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation Nov 7, 2024 Contrastive Learning Image Captioning
Code Code Available 4DELIFT: Data Efficient Language model Instruction Fine Tuning Nov 7, 2024 Language Modeling Language Modelling
Code Code Available 1Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model Nov 7, 2024 Language Modeling Language Modelling
Code Code Available 0CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR Nov 7, 2024 Language Modelling Large Language Model
— Unverified 0PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training Nov 7, 2024 Language Modeling Language Modelling
Code Code Available 2A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model Nov 7, 2024 Language Modeling Language Modelling
— Unverified 0VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models Nov 7, 2024 Language Modeling Language Modelling
— Unverified 0When Does Classical Chinese Help? Quantifying Cross-Lingual Transfer in Hanja and Kanbun Nov 7, 2024 Cross-Lingual Transfer Language Modeling
Code Code Available 0Scaling Laws for Pre-training Agents and World Models Nov 7, 2024 Imitation Learning Language Modeling
— Unverified 0BendVLM: Test-Time Debiasing of Vision-Language Embeddings Nov 7, 2024 Attribute Image Generation
Code Code Available 0Benchmarking Large Language Models with Integer Sequence Generation Tasks Nov 7, 2024 Benchmarking Computational Efficiency
— Unverified 0Fine-Tuning Vision-Language Model for Automated Engineering Drawing Information Extraction Nov 6, 2024 Hallucination Language Modeling
— Unverified 0Large Generative Model-assisted Talking-face Semantic Communication System Nov 6, 2024 Language Modeling Language Modelling
— Unverified 0The N-Grammys: Accelerating Autoregressive Inference with Learning-Free Batched Speculation Nov 6, 2024 Language Modeling Language Modelling
— Unverified 0NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA Nov 6, 2024 Federated Learning Language Modelling
— Unverified 0Deploying Multi-task Online Server with Large Language Model Nov 6, 2024 Language Modeling Language Modelling
— Unverified 0Reducing Hyperparameter Tuning Costs in ML, Vision and Language Model Training Pipelines via Memoization-Awareness Nov 6, 2024 Bayesian Optimization GPU
Code Code Available 0Unified Pathological Speech Analysis with Prompt Tuning Nov 5, 2024 Language Modeling Language Modelling
— Unverified 0AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution Nov 5, 2024 Language Modeling Language Modelling
— Unverified 0Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset Nov 5, 2024 Benchmarking Language Modeling
Code Code Available 1ChatGPT in Research and Education: Exploring Benefits and Threats Nov 5, 2024 Language Modeling Language Modelling
— Unverified 0HumanVLM: Foundation for Human-Scene Vision-Language Model Nov 5, 2024 Language Modeling Language Modelling
— Unverified 0Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities Nov 5, 2024 Diversity Language Modeling
— Unverified 0[Vision Paper] PRObot: Enhancing Patient-Reported Outcome Measures for Diabetic Retinopathy using Chatbots and Generative AI Nov 5, 2024 Chatbot Language Modeling
— Unverified 0Controlling for Unobserved Confounding with Large Language Model Classification of Patient Smoking Status Nov 5, 2024 Causal Inference Language Modeling
— Unverified 0Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning Nov 5, 2024 Abstractive Text Summarization Language Modeling
— Unverified 0PersianRAG: A Retrieval-Augmented Generation System for Persian Language Nov 5, 2024 Language Modeling Language Modelling
— Unverified 0V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization Nov 5, 2024 Hallucination Language Modeling
Code Code Available 2The Evolution of RWKV: Advancements in Efficient Language Modeling Nov 5, 2024 Language Modeling Language Modelling
— Unverified 0AVSS: Layer Importance Evaluation in Large Language Models via Activation Variance-Sparsity Analysis Nov 4, 2024 Language Modeling Language Modelling
— Unverified 0Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge Nov 4, 2024 Diagnostic Language Modeling
Code Code Available 1TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network Nov 4, 2024 Chunking Language Modelling
Code Code Available 1Wave Network: An Ultra-Small Language Model Nov 4, 2024 Language Modeling Language Modelling
— Unverified 0GraphVL: Graph-Enhanced Semantic Modeling via Vision-Language Models for Generalized Class Discovery Nov 4, 2024 Language Modeling Language Modelling
— Unverified 0ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model Nov 4, 2024 Language Modeling Language Modelling
— Unverified 0KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension Nov 4, 2024 Keypoint Detection Language Modeling
— Unverified 0