| un^2CLIP: Improving CLIP's Visual Detail Capturing Ability via Inverting unCLIP | May 30, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation | May 29, 2025 | Caption GenerationLanguage Modeling | CodeCode Available | 1 |
| SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents | May 29, 2025 | Adversarial AttackLarge Language Model | CodeCode Available | 1 |
| ChatCFD: an End-to-End CFD Agent with Domain-specific Structured Thinking | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution | May 27, 2025 | 8kAvg | CodeCode Available | 1 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 |
| CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models | May 27, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| REARANK: Reasoning Re-ranking Agent via Reinforcement Learning | May 26, 2025 | Data AugmentationInformation Retrieval | CodeCode Available | 1 |
| Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion | May 26, 2025 | DenoisingImage Generation | CodeCode Available | 1 |
| Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering | May 26, 2025 | ChunkingLarge Language Model | CodeCode Available | 1 |
| Think or Not? Exploring Thinking Efficiency in Large Reasoning Models via an Information-Theoretic Lens | May 23, 2025 | Large Language Model | CodeCode Available | 1 |
| UniTTS: An end-to-end TTS system without decoupling of acoustic and semantic information | May 23, 2025 | Large Language ModelQuantization | CodeCode Available | 1 |
| A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization | May 22, 2025 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 1 |
| ChemMLLM: Chemical Multimodal Large Language Model | May 22, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution | May 21, 2025 | Large Language ModelTask Planning | CodeCode Available | 1 |
| How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior | May 21, 2025 | Large Language ModelManagement | CodeCode Available | 1 |
| PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration | May 21, 2025 | Large Language Modelscientific discovery | CodeCode Available | 1 |
| U-SAM: An audio language Model for Unified Speech, Audio, and Music Understanding | May 20, 2025 | cross-modal alignmentLanguage Modeling | CodeCode Available | 1 |
| BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation | May 19, 2025 | Binary ClassificationDeepFake Detection | CodeCode Available | 1 |
| Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation | May 17, 2025 | Dataset GenerationGPU | CodeCode Available | 1 |
| Unifying Segment Anything in Microscopy with Multimodal Large Language Model | May 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts | May 15, 2025 | Continual LearningLanguage Modeling | CodeCode Available | 1 |
| Measuring General Intelligence with Generated Games | May 12, 2025 | In-Context LearningLarge Language Model | CodeCode Available | 1 |
| MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception | May 11, 2025 | Emotion ClassificationLarge Language Model | CodeCode Available | 1 |
| CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory | May 8, 2025 | Large Language ModelNavigate | CodeCode Available | 1 |
| WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks | May 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework | Apr 30, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation | Apr 30, 2025 | DiagnosticLarge Language Model | CodeCode Available | 1 |
| PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant Phenotyping | Apr 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LEAM: A Prompt-only Large Language Model-enabled Antenna Modeling Method | Apr 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providers | Apr 25, 2025 | Large Language Model | CodeCode Available | 1 |
| Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations | Apr 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration | Apr 17, 2025 | Geometry Problem SolvingLarge Language Model | CodeCode Available | 1 |
| Retrieval-Augmented Generation with Conflicting Evidence | Apr 17, 2025 | Large Language ModelMisinformation | CodeCode Available | 1 |
| SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding | Apr 17, 2025 | Image GenerationLarge Language Model | CodeCode Available | 1 |
| AnomalyR1: A GRPO-based End-to-end MLLM for Industrial Anomaly Detection | Apr 16, 2025 | Anomaly DetectionLarge Language Model | CodeCode Available | 1 |
| HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks | Apr 16, 2025 | High-Level SynthesisLarge Language Model | CodeCode Available | 1 |
| Omni-Dish: Photorealistic and Faithful Image Generation and Editing for Arbitrary Chinese Dishes | Apr 14, 2025 | Image GenerationLarge Language Model | CodeCode Available | 1 |
| Fine-tuning a Large Language Model for Automating Computational Fluid Dynamics Simulations | Apr 13, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric | Apr 10, 2025 | FairnessLarge Language Model | CodeCode Available | 1 |
| Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving | Apr 10, 2025 | GPULarge Language Model | CodeCode Available | 1 |
| Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration | Apr 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source) | Apr 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation | Apr 4, 2025 | ClusteringHallucination | CodeCode Available | 1 |
| Representation Bending for Large Language Model Safety | Apr 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Rethinking Key-Value Cache Compression Techniques for Large Language Model Serving | Mar 31, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages | Mar 30, 2025 | Automatic Speech RecognitionLanguage Modeling | CodeCode Available | 1 |
| Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense Retrieval | Mar 29, 2025 | AllLanguage Modeling | CodeCode Available | 1 |