| DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models | Oct 10, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 |
| PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiency | Oct 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data | Oct 10, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling | Oct 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Q-VLM: Post-training Quantization for Large Vision-Language Models | Oct 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression | Oct 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing | Oct 10, 2024 | image-classificationImage Classification | CodeCode Available | 0 |
| Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models | Oct 10, 2024 | Conformal PredictionLanguage Modeling | —Unverified | 0 |
| TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text | Oct 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Evolutionary Contrastive Distillation for Language Model Alignment | Oct 10, 2024 | Contrastive LearningInstruction Following | —Unverified | 0 |
| Efficient Reinforcement Learning with Large Language Model Priors | Oct 10, 2024 | Bayesian InferenceDecision Making | —Unverified | 0 |
| Disease Entity Recognition and Normalization is Improved with Large Language Model Derived Synthetic Normalized Mentions | Oct 10, 2024 | Data AugmentationKnowledge Graphs | —Unverified | 0 |
| OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model Prompting | Oct 10, 2024 | Entity LinkingFew-Shot Learning | CodeCode Available | 1 |
| Closing the Loop: Learning to Generate Writing Feedback via Language Model Simulated Student Revisions | Oct 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs | Oct 10, 2024 | Active LearningLanguage Modeling | CodeCode Available | 2 |
| Recent advancements in LLM Red-Teaming: Techniques, Defenses, and Ethical Considerations | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generating long-horizon stock "buy" signals with a neural language model | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| QuAILoRA: Quantization-Aware Initialization for LoRA | Oct 9, 2024 | Causal Language ModelingGPU | —Unverified | 0 |
| Exploring Prompt Engineering: A Systematic Review with SWOT Analysis | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TinyClick: Single-Turn Agent for Empowering GUI Automation | Oct 9, 2024 | Data AugmentationGPU | —Unverified | 0 |
| Enhancing Vision-Language Model Pre-training with Image-text Pair Pruning Based on Word Frequency | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| AuditWen:An Open-Source Large Language Model for Audit | Oct 9, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| Exploring Efficient Foundational Multi-modal Models for Video Summarization | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Task Program Error Repair and Explanatory Diagnosis | Oct 9, 2024 | Graph Neural NetworkLanguage Modeling | —Unverified | 0 |
| β-calibration of Language Model Confidence Scores for Generative QA | Oct 9, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Large Language Model Compression with Neural Architecture Search | Oct 9, 2024 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Pixtral 12B | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 11 |
| Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning | Oct 9, 2024 | Graph Neural NetworkIn-Context Learning | —Unverified | 0 |
| Sylber: Syllabic Embedding Representation of Speech from Raw Audio | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Interpreting Visual Information Processing in Vision-Language Models | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| TinyEmo: Scaling down Emotional Reasoning via Metric Projection | Oct 9, 2024 | Bias DetectionClassification | CodeCode Available | 0 |
| Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling | Oct 9, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| FltLM: An Intergrated Long-Context Large Language Model for Effective Context Filtering and Understanding | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reproducing and Extending Experiments in Behavioral Strategy with Large Language Models | Oct 9, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Personal Intelligence System UniLM: Hybrid On-Device Small Language Model and Server-Based Large Language Model for Malay Nusantara | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Application of NotebookLM, a Large Language Model with Retrieval-Augmented Generation, for Lung Cancer Staging | Oct 8, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Applying Refusal-Vector Ablation to Llama 3.1 70B Agents | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation | Oct 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training | Oct 8, 2024 | Graph Question AnsweringLanguage Modeling | CodeCode Available | 0 |
| ParallelSpec: Parallel Drafter for Efficient Speculative Decoding | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Session Client-Centered Treatment Outcome Evaluation in Psychotherapy | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Training-free Diffusion Model Alignment with Sampling Demons | Oct 8, 2024 | DenoisingImage Generation | CodeCode Available | 1 |
| FG-PRM: Fine-grained Hallucination Detection and Mitigation in Language Model Mathematical Reasoning | Oct 8, 2024 | GSM8KHallucination | —Unverified | 0 |
| Accelerated Preference Optimization for Large Language Model Alignment | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Think While You Generate: Discrete Diffusion with Planned Denoising | Oct 8, 2024 | DenoisingImage Generation | CodeCode Available | 2 |
| RL, but don't do anything I wouldn't do | Oct 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |