| Optimizing Estonian TV Subtitles with Semi-supervised Learning and LLMs | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis | Jan 9, 2025 | Emotion RecognitionLanguage Modeling | —Unverified | 0 |
| Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning | Jan 8, 2025 | GPULanguage Modeling | —Unverified | 0 |
| End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach | Jan 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Retrieval Based on Generative AI | Jan 8, 2025 | Large Language ModelMultiple-choice | —Unverified | 0 |
| Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction | Jan 8, 2025 | Crop Yield PredictionLanguage Modeling | —Unverified | 0 |
| Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation | Jan 8, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis | Jan 7, 2025 | Computed Tomography (CT)Large Language Model | —Unverified | 0 |
| Detection, Retrieval, and Explanation Unified: A Violence Detection System Based on Knowledge Graphs and GAT | Jan 7, 2025 | Graph AttentionKnowledge Graphs | —Unverified | 0 |
| KAnoCLIP: Zero-Shot Anomaly Detection through Knowledge-Driven Prompt Learning and Enhanced Cross-Modal Integration | Jan 7, 2025 | Anomaly DetectionAnomaly Segmentation | —Unverified | 0 |
| Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation | Jan 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono | Jan 7, 2025 | Code GenerationComputational Efficiency | —Unverified | 0 |
| AI-Driven Reinvention of Hydrological Modeling for Accurate Predictions and Interpretation to Transform Earth System Modeling | Jan 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Developing an Artificial Intelligence Tool for Personalized Breast Cancer Treatment Plans based on the NCCN Guidelines | Jan 6, 2025 | Large Language ModelRAG | —Unverified | 0 |
| Multi-Modal One-Shot Federated Ensemble Learning for Medical Data with Vision Large Language Model | Jan 6, 2025 | DiagnosticEnsemble Learning | —Unverified | 0 |
| IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment | Jan 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine | Jan 5, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| LLMPC: Large Language Model Predictive Control | Jan 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| GenTREC: The First Test Collection Generated by Large Language Models for Evaluating Information Retrieval Systems | Jan 5, 2025 | Information RetrievalLarge Language Model | —Unverified | 0 |
| Can ChatGPT implement finite element models for geotechnical engineering applications? | Jan 4, 2025 | Large Language ModelPrompt Engineering | —Unverified | 0 |
| DeServe: Towards Affordable Offline LLM Inference via Decentralization | Jan 4, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving | Jan 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CarbonChat: Large Language Model-Based Corporate Carbon Emission Analysis and Climate Knowledge Q&A System | Jan 3, 2025 | ChunkingHallucination | —Unverified | 0 |
| AgentRefine: Enhancing Agent Generalization through Refinement Tuning | Jan 3, 2025 | Large Language Model | —Unverified | 0 |
| Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models | Jan 3, 2025 | Binary ClassificationFace Anti-Spoofing | —Unverified | 0 |
| Integrating Domain Knowledge into Large Language Models for Enhanced Fashion Recommendations | Jan 3, 2025 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| PersonaAI: Leveraging Retrieval-Augmented Generation and Personalized Context for AI-Driven Digital Avatars | Jan 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Time Series Language Model for Descriptive Caption Generation | Jan 3, 2025 | Caption GenerationDenoising | —Unverified | 0 |
| MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model | Jan 2, 2025 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Does a Large Language Model Really Speak in Human-Like Language? | Jan 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion | Jan 2, 2025 | DiversityHallucination | —Unverified | 0 |
| Labels Generated by Large Language Model Helps Measuring People's Empathy in Vitro | Jan 1, 2025 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Chain of Semantics Programming in 3D Gaussian Splatting Representation for 3D Vision Grounding | Jan 1, 2025 | 3DGSLarge Language Model | —Unverified | 0 |
| Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ROD-MLLM: Towards More Reliable Object Detection in Multimodal Large Language Models | Jan 1, 2025 | Large Language ModelObject | —Unverified | 0 |
| S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Model with Spatio-Temporal Visual Representation | Jan 1, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines | Jan 1, 2025 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving | Jan 1, 2025 | Autonomous DrivingCARLA longest6 | —Unverified | 0 |
| Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform | Jan 1, 2025 | Code GenerationImage Generation | —Unverified | 0 |
| Adjoint sharding for very long context training of state space models | Jan 1, 2025 | GPULarge Language Model | —Unverified | 0 |
| Video-Bench: Human-Aligned Video Generation Benchmark | Jan 1, 2025 | Large Language ModelVideo Generation | —Unverified | 0 |
| VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos | Jan 1, 2025 | Large Language ModelVideo Segmentation | —Unverified | 0 |
| Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene | Jan 1, 2025 | Graph GenerationLarge Language Model | —Unverified | 0 |
| HOIGPT: Learning Long-Sequence Hand-Object Interaction with Language Models | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model | Jan 1, 2025 | AttributeLanguage Modeling | —Unverified | 0 |
| Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers | Jan 1, 2025 | Bias DetectionLarge Language Model | —Unverified | 0 |
| ChatHuman: Chatting about 3D Humans with Tools | Jan 1, 2025 | Human-Object Interaction DetectionIn-Context Learning | —Unverified | 0 |