| A Perspective on Literary Metaphor in the Context of Generative AI | Sep 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Grounding Language Models in Autonomous Loco-manipulation Tasks | Sep 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information | Sep 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Balancing Performance and Efficiency: A Multimodal Large Language Model Pruning Method based Image Text Interaction | Sep 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces | Sep 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SCOPE: Sign Language Contextual Processing with Embedding from LLMs | Sep 2, 2024 | DiversityLanguage Modeling | CodeCode Available | 0 |
| Revisiting SMoE Language Models by Evaluating Inefficiencies with Task Specific Expert Pruning | Sep 2, 2024 | Inference OptimizationLanguage Modeling | —Unverified | 0 |
| SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation | Sep 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Multimodal Multi-turn Conversation Stance Detection: A Challenge Dataset and Effective Model | Sep 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ContextCite: Attributing Model Generation to Context | Sep 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Sample-Efficient Diffusion for Text-To-Speech Synthesis | Sep 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Comparing Discrete and Continuous Space LLMs for Speech Recognition | Sep 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| From Prediction to Application: Language Model-based Code Knowledge Tracing with Domain Adaptive Pre-Training and Automatic Feedback System with Pedagogical Prompting for Comprehensive Programming Education | Aug 31, 2024 | Knowledge TracingLanguage Modeling | —Unverified | 0 |
| FADE: Few-shot/zero-shot Anomaly Detection Engine using Large Vision-Language Model | Aug 31, 2024 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 1 |
| Testing and Evaluation of Large Language Models: Correctness, Non-Toxicity, and Fairness | Aug 31, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| OrthoDoc: Multimodal Large Language Model for Assisting Diagnosis in Computed Tomography | Aug 30, 2024 | Computed Tomography (CT)Diagnostic | —Unverified | 0 |
| Speaker Tagging Correction With Non-Autoregressive Language Models | Aug 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| OnlySportsLM: Optimizing Sports-Domain Language Models with SOTA Performance under Billion Parameters | Aug 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models | Aug 30, 2024 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer | Aug 30, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Language-guided Scale-aware MedSegmentor for Lesion Segmentation in Medical Imaging | Aug 30, 2024 | DiagnosticImage Segmentation | —Unverified | 0 |
| InkubaLM: A small language model for low-resource African languages | Aug 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MemLong: Memory-Augmented Retrieval for Long Text Modeling | Aug 30, 2024 | 4kDecoder | CodeCode Available | 2 |
| Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage | Aug 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Joint Estimation and Prediction of City-wide Delivery Demand: A Large Language Model Empowered Graph-based Learning Approach | Aug 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Retrieval-Augmented Natural Language Reasoning for Explainable Visual Question Answering | Aug 30, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Novel-WD: Exploring acquisition of Novel World Knowledge in LLMs Using Prefix-Tuning | Aug 30, 2024 | Causal Language ModelingContinual Learning | —Unverified | 0 |
| Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model | Aug 30, 2024 | Audio CompressionAudio Generation | CodeCode Available | 3 |
| Getting Inspiration for Feature Elicitation: App Store- vs. LLM-based Approach | Aug 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain | Aug 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ChatSUMO: Large Language Model for Automating Traffic Scenario Generation in Simulation of Urban MObility | Aug 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WET: Overcoming Paraphrasing Vulnerabilities in Embeddings-as-a-Service with Linear Transformation Watermarks | Aug 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems | Aug 29, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| VLM-KD: Knowledge Distillation from VLM for Long-Tail Visual Recognition | Aug 29, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Plausible-Parrots @ MSP2023: Enhancing Semantic Plausibility Modeling using Entity and Event Knowledge | Aug 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving | Aug 29, 2024 | Autonomous DrivingDenoising | —Unverified | 0 |
| SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval | Aug 29, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models | Aug 29, 2024 | Data AugmentationImage Retrieval | —Unverified | 0 |
| Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction | Aug 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling | Aug 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic | Aug 29, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| Law of Vision Representation in MLLMs | Aug 29, 2024 | cross-modal alignmentLanguage Modeling | CodeCode Available | 2 |
| A Gradient Analysis Framework for Rewarding Good and Penalizing Bad Examples in Language Models | Aug 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Drop the beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation | Aug 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Legilimens: Practical and Unified Content Moderation for Large Language Model Services | Aug 28, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| Towards Logically Sound Natural Language Reasoning with Logic-Enhanced Language Model Agents | Aug 28, 2024 | Decision MakingFormal Logic | CodeCode Available | 0 |
| LM-PUB-QUIZ: A Comprehensive Framework for Zero-Shot Evaluation of Relational Knowledge in Language Models | Aug 28, 2024 | Continual LearningKnowledge Probing | —Unverified | 0 |
| Scaling Up Summarization: Leveraging Large Language Models for Long Text Extractive Summarization | Aug 28, 2024 | Extractive SummarizationExtractive Text Summarization | —Unverified | 0 |
| Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input | Aug 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient LLM Scheduling by Learning to Rank | Aug 28, 2024 | BlockingChatbot | CodeCode Available | 2 |