| The Empirical Impact of Data Sanitization on Language Models | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Recycled Attention: Efficient inference for long-context language models | Nov 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation | Nov 8, 2024 | Fact CheckingLanguage Modeling | —Unverified | 0 |
| Towards Multi-Modal Mastery: A 4.5B Parameter Truly Multi-Modal Small Language Model | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An Early FIRST Reproduction and Improvements to Single-Token Decoding for Fast Listwise Reranking | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SSSD: Simply-Scalable Speculative Decoding | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LBPE: Long-token-first Tokenization to Improve Large Language Models | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Assessing the Answerability of Queries in Retrieval-Augmented Code Generation | Nov 8, 2024 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent | Nov 8, 2024 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| Unmasking the Shadows: Pinpoint the Implementations of Anti-Dynamic Analysis Techniques in Malware Using LLM | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Real-World Offline Reinforcement Learning from Vision Language Model Feedback | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Two-Step Concept-Based Approach for Enhanced Interpretability and Trust in Skin Lesion Diagnosis | Nov 8, 2024 | Disease PredictionLanguage Modeling | CodeCode Available | 0 |
| AgentOps: Enabling Observability of LLM Agents | Nov 8, 2024 | AI AgentLanguage Modeling | —Unverified | 0 |
| Aioli: A Unified Optimization Framework for Language Model Data Mixing | Nov 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| End-to-End Navigation with Vision Language Models: Transforming Spatial Reasoning into Question-Answering | Nov 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Improving Multi-Domain Task-Oriented Dialogue System with Offline Reinforcement Learning | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Watermarking Language Models through Language Models | Nov 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models | Nov 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DELIFT: Data Efficient Language model Instruction Fine Tuning | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation | Nov 7, 2024 | Contrastive LearningImage Captioning | CodeCode Available | 4 |
| BendVLM: Test-Time Debiasing of Vision-Language Embeddings | Nov 7, 2024 | AttributeImage Generation | CodeCode Available | 0 |
| Benchmarking Large Language Models with Integer Sequence Generation Tasks | Nov 7, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Scaling Laws for Pre-training Agents and World Models | Nov 7, 2024 | Imitation LearningLanguage Modeling | —Unverified | 0 |
| SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications | Nov 7, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 3 |
| PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein Engineering | Nov 7, 2024 | AutoMLHyperparameter Optimization | CodeCode Available | 1 |
| VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos | Nov 7, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| When Does Classical Chinese Help? Quantifying Cross-Lingual Transfer in Hanja and Kanbun | Nov 7, 2024 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 0 |
| A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model | Nov 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Deploying Multi-task Online Server with Large Language Model | Nov 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Generative Model-assisted Talking-face Semantic Communication System | Nov 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The N-Grammys: Accelerating Autoregressive Inference with Learning-Free Batched Speculation | Nov 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reducing Hyperparameter Tuning Costs in ML, Vision and Language Model Training Pipelines via Memoization-Awareness | Nov 6, 2024 | Bayesian OptimizationGPU | CodeCode Available | 0 |
| Fine-Tuning Vision-Language Model for Automated Engineering Drawing Information Extraction | Nov 6, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Unified Pathological Speech Analysis with Prompt Tuning | Nov 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset | Nov 5, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution | Nov 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Controlling for Unobserved Confounding with Large Language Model Classification of Patient Smoking Status | Nov 5, 2024 | Causal InferenceLanguage Modeling | —Unverified | 0 |
| The Evolution of RWKV: Advancements in Efficient Language Modeling | Nov 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning | Nov 5, 2024 | Abstractive Text SummarizationLanguage Modeling | —Unverified | 0 |
| Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities | Nov 5, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| ChatGPT in Research and Education: Exploring Benefits and Threats | Nov 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HumanVLM: Foundation for Human-Scene Vision-Language Model | Nov 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PersianRAG: A Retrieval-Augmented Generation System for Persian Language | Nov 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization | Nov 5, 2024 | HallucinationLanguage Modeling | CodeCode Available | 2 |
| [Vision Paper] PRObot: Enhancing Patient-Reported Outcome Measures for Diabetic Retinopathy using Chatbots and Generative AI | Nov 5, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| AVSS: Layer Importance Evaluation in Large Language Models via Activation Variance-Sparsity Analysis | Nov 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Wave Network: An Ultra-Small Language Model | Nov 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge | Nov 4, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 1 |