| GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model | Jan 1, 2025 | AttributeLanguage Modeling | —Unverified | 0 |
| SynTab-LLaVA: Enhancing Multimodal Table Understanding with Decoupled Synthesis | Jan 1, 2025 | Large Language Model | CodeCode Available | 1 |
| Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chain of Semantics Programming in 3D Gaussian Splatting Representation for 3D Vision Grounding | Jan 1, 2025 | 3DGSLarge Language Model | —Unverified | 0 |
| Video-Bench: Human-Aligned Video Generation Benchmark | Jan 1, 2025 | Large Language ModelVideo Generation | —Unverified | 0 |
| DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving | Jan 1, 2025 | Autonomous DrivingCARLA longest6 | —Unverified | 0 |
| Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering | Jan 1, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos | Jan 1, 2025 | Large Language ModelVideo Segmentation | —Unverified | 0 |
| ROD-MLLM: Towards More Reliable Object Detection in Multimodal Large Language Models | Jan 1, 2025 | Large Language ModelObject | —Unverified | 0 |
| HOIGPT: Learning Long-Sequence Hand-Object Interaction with Language Models | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatHuman: Chatting about 3D Humans with Tools | Jan 1, 2025 | Human-Object Interaction DetectionIn-Context Learning | —Unverified | 0 |
| S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Model with Spatio-Temporal Visual Representation | Jan 1, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers | Jan 1, 2025 | Bias DetectionLarge Language Model | —Unverified | 0 |
| Labels Generated by Large Language Model Helps Measuring People's Empathy in Vitro | Jan 1, 2025 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines | Jan 1, 2025 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform | Jan 1, 2025 | Code GenerationImage Generation | —Unverified | 0 |
| Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adjoint sharding for very long context training of state space models | Jan 1, 2025 | GPULarge Language Model | —Unverified | 0 |
| Towards Sustainable Large Language Model Serving | Dec 31, 2024 | GPULanguage Modeling | —Unverified | 0 |
| CancerKG.ORG A Web-scale, Interactive, Verifiable Knowledge Graph-LLM Hybrid for Assisting with Optimal Cancer Treatment and Care | Dec 31, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Efficient Standardization of Clinical Notes using Large Language Models | Dec 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation | Dec 31, 2024 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| Generative Emergent Communication: Large Language Model is a Collective World Model | Dec 31, 2024 | Bayesian InferenceLanguage Modeling | —Unverified | 0 |
| LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts | Dec 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DropMicroFluidAgents (DMFAs): Autonomous Droplet Microfluidic Research Framework Through Large Language Model Agents | Dec 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| GroverGPT: A Large Language Model with 8 Billion Parameters for Quantum Searching | Dec 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble | Dec 30, 2024 | knowledge editingLanguage Modeling | —Unverified | 0 |
| Retrieval-Augmented Generation for Mobile Edge Computing via Large Language Model | Dec 30, 2024 | Edge-computingInformation Retrieval | —Unverified | 0 |
| Enhancing Annotated Bibliography Generation with LLM Ensembles | Dec 30, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense | Dec 30, 2024 | Cloud ComputingCode Generation | CodeCode Available | 1 |
| Facilitating large language model Russian adaptation with Learned Embedding Propagation | Dec 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ICLR: In-Context Learning of Representations | Dec 29, 2024 | In-Context LearningLarge Language Model | —Unverified | 0 |
| Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs | Dec 29, 2024 | Large Language ModelMachine Translation | —Unverified | 0 |
| HindiLLM: Large Language Model for Hindi | Dec 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Natural Language Fine-Tuning | Dec 29, 2024 | GSM8KLarge Language Model | CodeCode Available | 2 |
| Multi-Objective Large Language Model Unlearning | Dec 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ST^3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming | Dec 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MADiff: Text-Guided Fashion Image Editing with Mask Prediction and Attention-Enhanced Diffusion | Dec 28, 2024 | Large Language Modeltext-guided-image-editing | —Unverified | 0 |
| MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios | Dec 27, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 0 |
| An Engorgio Prompt Makes Large Language Model Babble on | Dec 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization | Dec 27, 2024 | Face SwappingImage Segmentation | —Unverified | 0 |
| Xmodel-2 Technical Report | Dec 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Survey on Large Language Model Acceleration based on KV Cache Management | Dec 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities | Dec 26, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| From Interests to Insights: An LLM Approach to Course Recommendations Using Natural Language Queries | Dec 26, 2024 | FairnessLanguage Modeling | CodeCode Available | 0 |
| SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis | Dec 26, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation | Dec 26, 2024 | Graph GenerationLarge Language Model | —Unverified | 0 |
| Speech Recognition With LLMs Adapted to Disordered Speech Using Reinforcement Learning | Dec 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Torque-Aware Momentum | Dec 25, 2024 | image-classificationImage Classification | —Unverified | 0 |
| SAFLITE: Fuzzing Autonomous Systems via Large Language Models | Dec 25, 2024 | Large Language Model | —Unverified | 0 |