| CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting | May 28, 2025 | Style Transfer | CodeCode Available | 1 |
| IMTS is Worth Time Channel Patches: Visual Masked Autoencoders for Irregular Multivariate Time Series Prediction | May 28, 2025 | Missing ValuesSelf-Supervised Learning | CodeCode Available | 1 |
| Scalable Parameter and Memory Efficient Pretraining for LLM: Recent Algorithmic Advances and Benchmarking | May 28, 2025 | Benchmarking | CodeCode Available | 1 |
| Do You See Me : A Multidimensional Benchmark for Evaluating Visual Perception in Multimodal LLMs | May 28, 2025 | | CodeCode Available | 1 |
| Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge | May 28, 2025 | Depression DetectionDiagnostic | CodeCode Available | 1 |
| GoMatching++: Parameter- and Data-Efficient Arbitrary-Shaped Video Text Spotting and Benchmarking | May 28, 2025 | BenchmarkingText Spotting | CodeCode Available | 1 |
| Update Your Transformer to the Latest Release: Re-Basin of Task Vectors | May 28, 2025 | Re-basin | CodeCode Available | 1 |
| FALCON: An ML Framework for Fully Automated Layout-Constrained Analog Circuit Design | May 28, 2025 | Graph Neural Network | CodeCode Available | 1 |
| SVRPBench: A Realistic Benchmark for Stochastic Vehicle Routing Problem | May 28, 2025 | Benchmarking | CodeCode Available | 1 |
| Pre-Training Curriculum for Multi-Token Prediction in Language Models | May 28, 2025 | Prediction | CodeCode Available | 1 |
| ChatVLA-2: Vision-Language-Action Model with Open-World Embodied Reasoning from Pretrained Knowledge | May 28, 2025 | Imitation LearningMath | CodeCode Available | 1 |
| Self-orthogonalizing attractor neural networks emerging from the free energy principle | May 28, 2025 | | CodeCode Available | 1 |
| ChatCFD: an End-to-End CFD Agent with Domain-specific Structured Thinking | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Training Language Models to Generate Quality Code with Program Analysis Feedback | May 28, 2025 | Code Generation | CodeCode Available | 1 |
| GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution | May 27, 2025 | 8kAvg | CodeCode Available | 1 |
| See through the Dark: Learning Illumination-affined Representations for Nighttime Occupancy Prediction | May 27, 2025 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 1 |
| DeSocial: Blockchain-based Decentralized Social Networks | May 27, 2025 | Model SelectionPrediction | CodeCode Available | 1 |
| R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning | May 27, 2025 | Code GenerationReinforcement Learning (RL) | CodeCode Available | 1 |
| MedSentry: Understanding and Mitigating Safety Risks in Medical LLM Multi-Agent Systems | May 27, 2025 | | CodeCode Available | 1 |
| Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility | May 27, 2025 | 3DGSScheduling | CodeCode Available | 1 |
| REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RefAV: Towards Planning-Centric Scenario Mining | May 27, 2025 | Autonomous VehiclesMotion Planning | CodeCode Available | 1 |
| Breaking the Ceiling: Exploring the Potential of Jailbreak Attacks through Expanding Strategy Space | May 27, 2025 | Prompt Engineering | CodeCode Available | 1 |
| ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval | May 27, 2025 | Image RetrievalRetrieval | CodeCode Available | 1 |
| Explainability of Large Language Models using SMILE: Statistical Model-agnostic Interpretability with Local Explanations | May 27, 2025 | | CodeCode Available | 1 |
| MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding | May 27, 2025 | Reinforcement Learning (RL)Video Understanding | CodeCode Available | 1 |
| CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models | May 27, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| LPOI: Listwise Preference Optimization for Vision Language Models | May 27, 2025 | Object | CodeCode Available | 1 |
| AgriFM: A Multi-source Temporal Remote Sensing Foundation Model for Crop Mapping | May 27, 2025 | | CodeCode Available | 1 |
| Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals | May 27, 2025 | Virtual Try-OffVirtual Try-on | CodeCode Available | 1 |
| Taylor expansion-based Kolmogorov-Arnold network for blind image quality assessment | May 27, 2025 | Blind Image Quality AssessmentComputational Efficiency | CodeCode Available | 1 |
| Minute-Long Videos with Dual Parallelisms | May 27, 2025 | DenoisingGPU | CodeCode Available | 1 |
| Bencher: Simple and Reproducible Benchmarking for Black-Box Optimization | May 27, 2025 | Benchmarking | CodeCode Available | 1 |
| FinTagging: An LLM-ready Benchmark for Extracting and Structuring Financial Information | May 27, 2025 | Concept AlignmentMulti-class Classification | CodeCode Available | 1 |
| Dual-Polarization Stacked Intelligent Metasurfaces for Holographic MIMO | May 27, 2025 | | CodeCode Available | 1 |
| FM-Planner: Foundation Model Guided Path Planning for Autonomous Drone Navigation | May 27, 2025 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration | May 27, 2025 | Multi-hop Question AnsweringQuestion Answering | CodeCode Available | 1 |
| AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage | May 27, 2025 | | CodeCode Available | 1 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 |
| DiMoSR: Feature Modulation via Multi-Branch Dilated Convolutions for Efficient Image Super-Resolution | May 27, 2025 | Computational EfficiencyImage Super-Resolution | CodeCode Available | 1 |
| RoBiS: Robust Binary Segmentation for High-Resolution Industrial Images | May 27, 2025 | Anomaly DetectionBinarization | CodeCode Available | 1 |
| FastFace: Tuning Identity Preservation in Distilled Diffusion via Guidance and Attention | May 27, 2025 | | CodeCode Available | 1 |
| Pretraining Language Models to Ponder in Continuous Space | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Music Source Restoration | May 27, 2025 | Music Source Separation | CodeCode Available | 1 |
| FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models | May 26, 2025 | Token Reduction | CodeCode Available | 1 |
| OB3D: A New Dataset for Benchmarking Omnidirectional 3D Reconstruction Using Blender | May 26, 2025 | 3DGS3D Reconstruction | CodeCode Available | 1 |
| Efficient Multi-modal Long Context Learning for Training-free Adaptation | May 26, 2025 | | CodeCode Available | 1 |
| Lifelong Safety Alignment for Language Models | May 26, 2025 | Safety Alignment | CodeCode Available | 1 |
| REARANK: Reasoning Re-ranking Agent via Reinforcement Learning | May 26, 2025 | Data AugmentationInformation Retrieval | CodeCode Available | 1 |
| Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs | May 26, 2025 | Code GenerationRecommendation Systems | CodeCode Available | 1 |