| Large Language Model Compression with Neural Architecture Search | Oct 9, 2024 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Pixtral 12B | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 11 |
| Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning | Oct 9, 2024 | Graph Neural NetworkIn-Context Learning | —Unverified | 0 |
| Sylber: Syllabic Embedding Representation of Speech from Raw Audio | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Interpreting Visual Information Processing in Vision-Language Models | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| TinyEmo: Scaling down Emotional Reasoning via Metric Projection | Oct 9, 2024 | Bias DetectionClassification | CodeCode Available | 0 |
| Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling | Oct 9, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| FltLM: An Intergrated Long-Context Large Language Model for Effective Context Filtering and Understanding | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reproducing and Extending Experiments in Behavioral Strategy with Large Language Models | Oct 9, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Personal Intelligence System UniLM: Hybrid On-Device Small Language Model and Server-Based Large Language Model for Malay Nusantara | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Application of NotebookLM, a Large Language Model with Retrieval-Augmented Generation, for Lung Cancer Staging | Oct 8, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Applying Refusal-Vector Ablation to Llama 3.1 70B Agents | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation | Oct 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training | Oct 8, 2024 | Graph Question AnsweringLanguage Modeling | CodeCode Available | 0 |
| ParallelSpec: Parallel Drafter for Efficient Speculative Decoding | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Session Client-Centered Treatment Outcome Evaluation in Psychotherapy | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Training-free Diffusion Model Alignment with Sampling Demons | Oct 8, 2024 | DenoisingImage Generation | CodeCode Available | 1 |
| FG-PRM: Fine-grained Hallucination Detection and Mitigation in Language Model Mathematical Reasoning | Oct 8, 2024 | GSM8KHallucination | —Unverified | 0 |
| Accelerated Preference Optimization for Large Language Model Alignment | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Think While You Generate: Discrete Diffusion with Planned Denoising | Oct 8, 2024 | DenoisingImage Generation | CodeCode Available | 2 |
| RL, but don't do anything I wouldn't do | Oct 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |