| Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking | Jun 25, 2025 | GPUVisual Tracking | CodeCode Available | 1 |
| Fast ground penetrating radar dual-parameter full waveform inversion method accelerated by hybrid compilation of CUDA kernel function and PyTorch | Jun 25, 2025 | Computational EfficiencyGPR | CodeCode Available | 1 |
| A foundation model with multi-variate parallel attention to generate neuronal activity | Jun 25, 2025 | Seizure DetectionTime Series | CodeCode Available | 1 |
| Q-resafe: Assessing Safety Risks and Quantization-aware Safety Patching for Quantized Large Language Models | Jun 25, 2025 | Quantization | CodeCode Available | 1 |
| WattsOnAI: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI Workloads | Jun 25, 2025 | Benchmarking | CodeCode Available | 1 |
| DPLib: A Standard Benchmark Library for Distributed Power System Analysis and Optimization | Jun 25, 2025 | Distributed Optimization | CodeCode Available | 1 |
| MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration | Jun 24, 2025 | DiagnosticMedical Diagnosis | CodeCode Available | 1 |
| Augmenting Multi-Agent Communication with State Delta Trajectory | Jun 24, 2025 | | CodeCode Available | 1 |
| Self-Supervised Multimodal NeRF for Autonomous Driving | Jun 24, 2025 | Autonomous DrivingNeRF | CodeCode Available | 1 |
| HERCULES: Hierarchical Embedding-based Recursive Clustering Using LLMs for Efficient Summarization | Jun 24, 2025 | Clustering | CodeCode Available | 1 |
| SMARTIES: Spectrum-Aware Multi-Sensor Auto-Encoder for Remote Sensing Images | Jun 24, 2025 | | CodeCode Available | 1 |
| Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models | Jun 24, 2025 | Camouflaged Object SegmentationSegmentation | CodeCode Available | 1 |
| ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model | Jun 24, 2025 | | CodeCode Available | 1 |
| Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting | Jun 24, 2025 | DenoisingWeather Forecasting | CodeCode Available | 1 |
| MATE: LLM-Powered Multi-Agent Translation Environment for Accessibility Applications | Jun 24, 2025 | | CodeCode Available | 1 |
| KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality | Jun 24, 2025 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| EvDetMAV: Generalized MAV Detection from Moving Event Cameras | Jun 24, 2025 | | CodeCode Available | 1 |
| Fast and Distributed Equivariant Graph Neural Networks by Virtual Node Learning | Jun 24, 2025 | Graph Learning | CodeCode Available | 1 |
| Introducing EG-IPT and ipt~: a novel electric guitar dataset and a new Max/MSP object for real-time classification of instrumental playing techniques | Jun 24, 2025 | | CodeCode Available | 1 |
| Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture | Jun 24, 2025 | Decoder | CodeCode Available | 1 |
| EBC-ZIP: Improving Blockwise Crowd Counting with Zero-Inflated Poisson Regression | Jun 24, 2025 | Crowd CountingDensity Estimation | CodeCode Available | 1 |
| LEGATO: Large-scale End-to-end Generalizable Approach to Typeset OMR | Jun 23, 2025 | Decoder | CodeCode Available | 1 |
| NIC-RobustBench: A Comprehensive Open-Source Toolkit for Neural Image Compression and Robustness Analysis | Jun 23, 2025 | Adversarial RobustnessImage Compression | CodeCode Available | 1 |
| The Within-Orbit Adaptive Leapfrog No-U-Turn Sampler | Jun 23, 2025 | | CodeCode Available | 1 |
| Riemannian generative decoder | Jun 23, 2025 | DecoderRepresentation Learning | CodeCode Available | 1 |
| Morse: Dual-Sampling for Lossless Acceleration of Diffusion Models | Jun 23, 2025 | Image Generation | CodeCode Available | 1 |
| MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation | Jun 23, 2025 | 3D ReconstructionNeRF | CodeCode Available | 1 |
| MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis | Jun 23, 2025 | DiagnosticLarge Language Model | CodeCode Available | 1 |
| Sequential keypoint density estimator: an overlooked baseline of skeleton-based video anomaly detection | Jun 23, 2025 | Anomaly DetectionVideo Anomaly Detection | CodeCode Available | 1 |
| Parallel Continuous Chain-of-Thought with Jacobi Iteration | Jun 23, 2025 | | CodeCode Available | 1 |
| Advancing Talking Head Generation: A Comprehensive Survey of Multi-Modal Methodologies, Datasets, Evaluation Metrics, and Loss Functions | Jun 23, 2025 | NeRFTalking Head Generation | CodeCode Available | 1 |
| What You Think Is What You Get: Bridge User Intent and Transfer Function Design through Multimodal Large Language Models | Jun 23, 2025 | | CodeCode Available | 1 |
| Rethinking Decoder Design: Improving Biomarker Segmentation Using Depth-to-Space Restoration and Residual Linear Attention | Jun 23, 2025 | DecoderImage Segmentation | CodeCode Available | 1 |
| CommVQ: Commutative Vector Quantization for KV Cache Compression | Jun 23, 2025 | GPUGSM8K | CodeCode Available | 1 |
| Taming Vision-Language Models for Medical Image Analysis: A Comprehensive Review | Jun 23, 2025 | Medical Image AnalysisPrompt Learning | CodeCode Available | 1 |
| DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling | Jun 23, 2025 | Motion Synthesis | CodeCode Available | 1 |
| DIP: Unsupervised Dense In-Context Post-training of Visual Representations | Jun 23, 2025 | GPUMeta-Learning | CodeCode Available | 1 |
| LIGHTHOUSE: Fast and precise distance to shoreline calculations from anywhere on earth | Jun 23, 2025 | CPU | CodeCode Available | 1 |
| Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective | Jun 22, 2025 | In-Context LearningLarge Language Model | CodeCode Available | 1 |
| h-calibration: Rethinking Classifier Recalibration with Probabilistic Error-Bounded Objective | Jun 22, 2025 | scoring rule | CodeCode Available | 1 |
| MiCo: Multiple Instance Learning with Context-Aware Clustering for Whole Slide Image Analysis | Jun 22, 2025 | ClusteringMultiple Instance Learning | CodeCode Available | 1 |
| OmniESI: A unified framework for enzyme-substrate interaction prediction with progressive conditional deep learning | Jun 22, 2025 | Parameter PredictionPrediction | CodeCode Available | 1 |
| DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For Driving | Jun 21, 2025 | Autonomous DrivingDescriptive | CodeCode Available | 1 |
| AbRank: A Benchmark Dataset and Metric-Learning Framework for Antibody-Antigen Affinity Ranking | Jun 21, 2025 | Metric LearningProtein Language Model | CodeCode Available | 1 |
| ConsumerBench: Benchmarking Generative AI Applications on End-User Devices | Jun 21, 2025 | BenchmarkingCPU | CodeCode Available | 1 |
| TeXpert: A Multi-Level Benchmark for Evaluating LaTeX Code Generation by LLMs | Jun 20, 2025 | Code Generation | CodeCode Available | 1 |
| TextBraTS: Text-Guided Volumetric Brain Tumor Segmentation with Innovative Dataset Development and Fusion Module Exploration | Jun 20, 2025 | Brain Tumor SegmentationImage Segmentation | CodeCode Available | 1 |
| Visual-Instructed Degradation Diffusion for All-in-One Image Restoration | Jun 20, 2025 | AllDeblurring | CodeCode Available | 1 |
| Mesh-Informed Neural Operator : A Transformer Generative Approach | Jun 20, 2025 | Operator learning | CodeCode Available | 1 |
| A Minimalist Optimizer Design for LLM Pretraining | Jun 20, 2025 | | CodeCode Available | 1 |