| MinMo: A Multimodal Large Language Model for Seamless Voice Interaction | Jan 10, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Effective faking of verbal deception detection with target-aligned adversarial attacks | Jan 10, 2025 | Adversarial AttackDeception Detection | —Unverified | 0 |
| Automating Date Format Detection for Data Visualization | Jan 10, 2025 | Data VisualizationLanguage Modeling | —Unverified | 0 |
| Large Language Models for Bioinformatics | Jan 10, 2025 | Domain AdaptationDrug Discovery | —Unverified | 0 |
| Personalized Language Model Learning on Text Data Without User Identifiers | Jan 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Scalable Vision Language Model Training via High Quality Data Curation | Jan 10, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Towards a Probabilistic Framework for Analyzing and Improving LLM-Enabled Software | Jan 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Gender-Neutral Large Language Models for Medical Applications: Reducing Bias in PubMed Abstracts | Jan 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Valley2: Exploring Multimodal Models with Scalable Vision-Language Design | Jan 10, 2025 | Image CaptioningLanguage Modeling | CodeCode Available | 3 |
| JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis | Jan 9, 2025 | Emotion RecognitionLanguage Modeling | —Unverified | 0 |
| TreeKV: Smooth Key-Value Cache Compression with Tree Structures | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| UAV-VLA: Vision-Language-Action System for Large Scale Aerial Mission Generation | Jan 9, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| Optimizing Estonian TV Subtitles with Semi-supervised Learning and LLMs | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model | Jan 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation | Jan 8, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach | Jan 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis | Jan 8, 2025 | DecoderEmotional Speech Synthesis | CodeCode Available | 2 |
| Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning | Jan 8, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction | Jan 8, 2025 | Crop Yield PredictionLanguage Modeling | —Unverified | 0 |
| AI-Driven Reinvention of Hydrological Modeling for Accurate Predictions and Interpretation to Transform Earth System Modeling | Jan 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos | Jan 7, 2025 | 2kLanguage Modeling | CodeCode Available | 5 |
| Investigating the Impact of Data Selection Strategies on Language Model Performance | Jan 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |