| AgentRefine: Enhancing Agent Generalization through Refinement Tuning | Jan 3, 2025 | Large Language Model | —Unverified | 0 |
| Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models | Jan 3, 2025 | Binary ClassificationFace Anti-Spoofing | —Unverified | 0 |
| Integrating Domain Knowledge into Large Language Models for Enhanced Fashion Recommendations | Jan 3, 2025 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| PersonaAI: Leveraging Retrieval-Augmented Generation and Personalized Context for AI-Driven Digital Avatars | Jan 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Time Series Language Model for Descriptive Caption Generation | Jan 3, 2025 | Caption GenerationDenoising | —Unverified | 0 |
| MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model | Jan 2, 2025 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Does a Large Language Model Really Speak in Human-Like Language? | Jan 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion | Jan 2, 2025 | DiversityHallucination | —Unverified | 0 |
| Labels Generated by Large Language Model Helps Measuring People's Empathy in Vitro | Jan 1, 2025 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Chain of Semantics Programming in 3D Gaussian Splatting Representation for 3D Vision Grounding | Jan 1, 2025 | 3DGSLarge Language Model | —Unverified | 0 |
| Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ROD-MLLM: Towards More Reliable Object Detection in Multimodal Large Language Models | Jan 1, 2025 | Large Language ModelObject | —Unverified | 0 |
| S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Model with Spatio-Temporal Visual Representation | Jan 1, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines | Jan 1, 2025 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving | Jan 1, 2025 | Autonomous DrivingCARLA longest6 | —Unverified | 0 |
| Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform | Jan 1, 2025 | Code GenerationImage Generation | —Unverified | 0 |
| Adjoint sharding for very long context training of state space models | Jan 1, 2025 | GPULarge Language Model | —Unverified | 0 |
| Video-Bench: Human-Aligned Video Generation Benchmark | Jan 1, 2025 | Large Language ModelVideo Generation | —Unverified | 0 |
| VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos | Jan 1, 2025 | Large Language ModelVideo Segmentation | —Unverified | 0 |
| Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene | Jan 1, 2025 | Graph GenerationLarge Language Model | —Unverified | 0 |
| HOIGPT: Learning Long-Sequence Hand-Object Interaction with Language Models | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model | Jan 1, 2025 | AttributeLanguage Modeling | —Unverified | 0 |
| Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers | Jan 1, 2025 | Bias DetectionLarge Language Model | —Unverified | 0 |
| ChatHuman: Chatting about 3D Humans with Tools | Jan 1, 2025 | Human-Object Interaction DetectionIn-Context Learning | —Unverified | 0 |