| The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Pre-trained Language Models | Jun 14, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 |
| Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment | Jun 14, 2024 | Image Quality AssessmentLanguage Modeling | —Unverified | 0 |
| GEB-1.3B: Open Lightweight Large Language Model | Jun 14, 2024 | CPULanguage Modeling | —Unverified | 0 |
| Large language model validity via enhanced conformal prediction methods | Jun 14, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| Group and Shuffle: Efficient Structured Orthogonal Parametrization | Jun 14, 2024 | Computational EfficiencyLanguage Modeling | CodeCode Available | 0 |
| TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst | Jun 14, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Let the Poem Hit the Rhythm: Using a Byte-Based Transformer for Beat-Aligned Poetry Generation | Jun 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| 3D-RPE: Enhancing Long-Context Modeling Through 3D Rotary Position Encoding | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CarLLaVA: Vision language models for camera-only closed-loop driving | Jun 14, 2024 | Autonomous DrivingBench2Drive | CodeCode Available | 3 |
| Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First Meeting | Jun 14, 2024 | Dialogue GenerationForm | CodeCode Available | 0 |
| CLST: Cold-Start Mitigation in Knowledge Tracing by Aligning a Generative Language Model as a Students' Knowledge Tracer | Jun 13, 2024 | Domain GeneralizationKnowledge Tracing | —Unverified | 0 |
| Newswire: A Large-Scale Structured Database of a Century of Historical News | Jun 13, 2024 | ArticlesEntity Disambiguation | CodeCode Available | 1 |
| Multi-Modal Retrieval For Large Language Model Based Speech Recognition | Jun 13, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| LLM Reading Tea Leaves: Automatically Evaluating Topic Models with Large Language Models | Jun 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing | Jun 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| StreamBench: Towards Benchmarking Continuous Improvement of Language Agents | Jun 13, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding | Jun 13, 2024 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| RH-SQL: Refined Schema and Hardness Prompt for Text-to-SQL | Jun 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models | Jun 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigating the translation capabilities of Large Language Models trained on parallel data only | Jun 13, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Autonomous Multi-Objective Optimization Using Large Language Model | Jun 13, 2024 | Evolutionary AlgorithmsLanguage Modeling | —Unverified | 0 |
| Chain-of-Though (CoT) prompting strategies for medical error detection and correction | Jun 13, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models | Jun 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On Softmax Direct Preference Optimization for Recommendation | Jun 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Unlearning with Control: Assessing Real-world Utility for Large Language Model Unlearning | Jun 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generative AI-based Prompt Evolution Engineering Design Optimization With Vision-Language Model | Jun 13, 2024 | 3D Shape RepresentationLanguage Modeling | —Unverified | 0 |
| Explore the Limits of Omni-modal Pretraining at Scale | Jun 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and Transparency | Jun 13, 2024 | Decision Makingimage-classification | CodeCode Available | 0 |
| Enhancing Diagnostic Accuracy in Rare and Common Fundus Diseases with a Knowledge-Rich Vision-Language Model | Jun 13, 2024 | DiagnosticImage Retrieval | CodeCode Available | 2 |
| Transformers meet Neural Algorithmic Reasoners | Jun 13, 2024 | Graph Neural NetworkLanguage Modeling | —Unverified | 0 |
| Enhancing Domain Adaptation through Prompt Gradient Alignment | Jun 13, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| ElicitationGPT: Text Elicitation Mechanisms via Language Models | Jun 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multimodal Representation Loss Between Timed Text and Audio for Regularized Speech Separation | Jun 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Advancing High Resolution Vision-Language Models in Biomedicine | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Guiding In-Context Learning of LLMs through Quality Estimation for Machine Translation | Jun 12, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Figuratively Speaking: Authorship Attribution via Multi-Task Figurative Language Modeling | Jun 12, 2024 | Authorship AttributionLanguage Modeling | CodeCode Available | 0 |
| Language Model Council: Democratically Benchmarking Foundation Models on Highly Subjective Tasks | Jun 12, 2024 | BenchmarkingChatbot | CodeCode Available | 3 |
| Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences | Jun 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Discovering Preference Optimization Algorithms with and for Large Language Models | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| An Empirical Study of Mamba-based Language Models | Jun 12, 2024 | 16kIn-Context Learning | —Unverified | 0 |
| VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks | Jun 12, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 5 |
| CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction | Jun 12, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| PolySpeech: Exploring Unified Multitask Speech Models for Competitiveness with Single-task Models | Jun 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference | Jun 12, 2024 | AllLanguage Modeling | —Unverified | 0 |
| Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation | Jun 12, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents | Jun 12, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Analyzing constrained LLM through PDFA-learning | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |