| LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis | Jan 9, 2025 | Emotion RecognitionLanguage Modeling | —Unverified | 0 |
| Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Optimizing Estonian TV Subtitles with Semi-supervised Learning and LLMs | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TreeKV: Smooth Key-Value Cache Compression with Tree Structures | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning | Jan 8, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation | Jan 8, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach | Jan 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction | Jan 8, 2025 | Crop Yield PredictionLanguage Modeling | —Unverified | 0 |
| ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono | Jan 7, 2025 | Code GenerationComputational Efficiency | —Unverified | 0 |
| Investigating the Impact of Data Selection Strategies on Language Model Performance | Jan 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| AI-Driven Reinvention of Hydrological Modeling for Accurate Predictions and Interpretation to Transform Earth System Modeling | Jan 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation | Jan 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds | Jan 7, 2025 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Self-adaptive vision-language model for 3D segmentation of pulmonary artery and vein | Jan 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders | Jan 6, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Multi-Modal One-Shot Federated Ensemble Learning for Medical Data with Vision Large Language Model | Jan 6, 2025 | DiagnosticEnsemble Learning | —Unverified | 0 |
| Analyzing Bias in Swiss Federal Supreme Court Judgments Using Facebook's Holistic Bias Dataset: Implications for Language Model Training | Jan 6, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment | Jan 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine | Jan 5, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| LLMPC: Large Language Model Predictive Control | Jan 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Decoding fMRI Data into Captions using Prefix Language Modeling | Jan 5, 2025 | Brain DecodingImage Captioning | CodeCode Available | 0 |
| Towards the Anonymization of the Language Modeling | Jan 5, 2025 | Causal Language ModelingLanguage Modeling | —Unverified | 0 |
| From Superficial Patterns to Semantic Understanding: Fine-Tuning Language Models on Contrast Sets | Jan 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DeServe: Towards Affordable Offline LLM Inference via Decentralization | Jan 4, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Learning Evolution via Optimization Knowledge Adaptation | Jan 4, 2025 | Evolutionary AlgorithmsLanguage Modeling | —Unverified | 0 |
| Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving | Jan 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Time Series Language Model for Descriptive Caption Generation | Jan 3, 2025 | Caption GenerationDenoising | —Unverified | 0 |
| Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions | Jan 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PersonaAI: Leveraging Retrieval-Augmented Generation and Personalized Context for AI-Driven Digital Avatars | Jan 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reading Between the Lines: A dataset and a study on why some texts are tougher than others | Jan 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models | Jan 3, 2025 | Binary ClassificationFace Anti-Spoofing | —Unverified | 0 |
| Integrating Domain Knowledge into Large Language Models for Enhanced Fashion Recommendations | Jan 3, 2025 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| CarbonChat: Large Language Model-Based Corporate Carbon Emission Analysis and Climate Knowledge Q&A System | Jan 3, 2025 | ChunkingHallucination | —Unverified | 0 |
| Abstractive Text Summarization for Contemporary Sanskrit Prose: Issues and Challenges | Jan 3, 2025 | Abstractive Text SummarizationLanguage Modeling | —Unverified | 0 |
| Does a Large Language Model Really Speak in Human-Like Language? | Jan 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Vision-Language Model Alignment and Misalignment: A Survey Through the Lens of Explainability | Jan 2, 2025 | AttributeLanguage Modeling | —Unverified | 0 |
| Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion | Jan 2, 2025 | DiversityHallucination | —Unverified | 0 |
| Risks of Cultural Erasure in Large Language Models | Jan 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NeutraSum: A Language Model can help a Balanced Media Diet by Neutralizing News Summaries | Jan 2, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| MSWA: Refining Local Attention with Multi-ScaleWindow Attention | Jan 2, 2025 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model | Jan 2, 2025 | DescriptiveLanguage Modeling | —Unverified | 0 |
| MIMO: A Medical Vision Language Model with Visual Referring Multimodal Input and Pixel Grounding Multimodal Output | Jan 1, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 0 |
| SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation | Jan 1, 2025 | Dialogue GenerationLanguage Modeling | CodeCode Available | 0 |
| Video Language Model Pretraining with Spatio-temporal Masking | Jan 1, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Navigating Nuance: In Quest for Political Truth | Jan 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Once-Tuning-Multiple-Variants: Tuning Once and Expanded as Multiple Vision-Language Model Variants | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Symbolic Representation for Any-to-Any Generative Tasks | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Model with Spatio-Temporal Visual Representation | Jan 1, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |