| Is Micro-expression Ethnic Leaning? | Jul 14, 2025 | | CodeCode Available | 0 |
| Large Population Models | Jul 14, 2025 | | CodeCode Available | 0 |
| Demonstrating the Octopi-1.5 Visual-Tactile-Language Model | Jul 14, 2025 | | CodeCode Available | 0 |
| Boosting Multimodal Learning via Disentangled Gradient Learning | Jul 14, 2025 | | CodeCode Available | 0 |
| FTCFormer: Fuzzy Token Clustering Transformer for Image Classification | Jul 14, 2025 | | CodeCode Available | 0 |
| DisCo: Towards Distinct and Coherent Visual Encapsulation in Video MLLMs | Jul 14, 2025 | | CodeCode Available | 0 |
| CWNet: Causal Wavelet Network for Low-Light Image Enhancement | Jul 14, 2025 | | CodeCode Available | 0 |
| Structure-Guided Diffusion Models for High-Fidelity Portrait Shadow Removal | Jul 14, 2025 | | CodeCode Available | 0 |
| Minimizing the Pretraining Gap: Domain-aligned Text-Based Person Retrieval | Jul 14, 2025 | | CodeCode Available | 0 |
| A Training-Free, Task-Agnostic Framework for Enhancing MLLM Performance on High-Resolution Images | Jul 14, 2025 | | CodeCode Available | 0 |
| ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users | Jul 14, 2025 | | CodeCode Available | 0 |
| RefSTAR: Blind Facial Image Restoration with Reference Selection, Transfer, and Reconstruction | Jul 14, 2025 | | CodeCode Available | 0 |
| BenchReAD: A systematic benchmark for retinal anomaly detection | Jul 14, 2025 | | CodeCode Available | 0 |
| Training Dynamics Underlying Language Model Scaling Laws: Loss Deceleration and Zero-Sum Learning | Jul 14, 2025 | | CodeCode Available | 0 |
| Offline Reinforcement Learning with Wasserstein Regularization via Optimal Transport Maps | Jul 14, 2025 | | CodeCode Available | 0 |
| LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models | Jul 14, 2025 | Long-range modeling | CodeCode Available | 0 |
| CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding Tasks | Jul 14, 2025 | BenchmarkingCode Generation | —Unverified | 0 |
| CodeAssistBench (CAB): Dataset & Benchmarking for Multi-turn Chat-Based Code Assistance | Jul 14, 2025 | BenchmarkingCode Generation | —Unverified | 0 |
| Turning the Tide: Repository-based Code Reflection | Jul 14, 2025 | Code GenerationDiversity | —Unverified | 0 |
| A New Dataset and Performance Benchmark for Real-time Spacecraft Segmentation in Onboard Flight Computers | Jul 14, 2025 | Image SegmentationSegmentation | CodeCode Available | 0 |
| Self-supervised Learning on Camera Trap Footage Yields a Strong Universal Face Embedder | Jul 14, 2025 | Self-Supervised Learning | —Unverified | 0 |
| Vision Language Action Models in Robotic Manipulation: A Systematic Review | Jul 14, 2025 | Dataset GenerationNatural Language Understanding | CodeCode Available | 2 |
| Leveraging RAG-LLMs for Urban Mobility Simulation and Analysis | Jul 14, 2025 | Decision MakingRAG | —Unverified | 0 |
| Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning | Jul 14, 2025 | Federated LearningKnowledge Distillation | —Unverified | 0 |
| MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization | Jul 14, 2025 | 2kImage Generation | CodeCode Available | 2 |
| Binomial Self-Compensation: Mechanism and Suppression of Motion Error in Phase-Shifting Profilometry | Jul 14, 2025 | 3D Reconstruction | —Unverified | 0 |
| Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks | Jul 14, 2025 | image-classificationImage Classification | —Unverified | 0 |
| 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving | Jul 14, 2025 | 3DGSAdversarial Attack | —Unverified | 0 |
| Lightweight Model for Poultry Disease Detection from Fecal Images Using Multi-Color Space Feature Optimization and Machine Learning | Jul 14, 2025 | Computational EfficiencyDimensionality Reduction | —Unverified | 0 |
| EmbRACE-3K: Embodied Reasoning and Action in Complex Environments | Jul 14, 2025 | Scene UnderstandingSpatial Reasoning | —Unverified | 0 |
| Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI | Jul 14, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| Privacy-Preserving Multi-Stage Fall Detection Framework with Semi-supervised Federated Learning and Robotic Vision Confirmation | Jul 14, 2025 | Federated LearningIndoor Localization | —Unverified | 0 |
| Efficient Federated Learning with Heterogeneous Data and Adaptive Dropout | Jul 14, 2025 | Federated Learning | —Unverified | 0 |
| MTF-Grasp: A Multi-tier Federated Learning Approach for Robotic Grasping | Jul 14, 2025 | Federated LearningRobotic Grasping | —Unverified | 0 |
| Cameras as Relative Positional Encoding | Jul 14, 2025 | Depth EstimationNovel View Synthesis | —Unverified | 0 |
| A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends | Jul 14, 2025 | document understandingOptical Character Recognition | —Unverified | 0 |
| Domain Borders Are There to Be Crossed With Federated Few-Shot Adaptation | Jul 14, 2025 | Domain AdaptationFederated Learning | CodeCode Available | 0 |
| Differentially Private Federated Low Rank Adaptation Beyond Fixed-Matrix | Jul 14, 2025 | Privacy Preserving | CodeCode Available | 0 |
| Graph World Model | Jul 14, 2025 | Graph Learningmodel | CodeCode Available | 1 |
| Glance-MCMT: A General MCMT Framework with Glance Initialization and Progressive Association | Jul 14, 2025 | | CodeCode Available | 0 |
| DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation | Jul 14, 2025 | DecoderGPU | CodeCode Available | 0 |
| Test-Time Canonicalization by Foundation Models for Robust Perception | Jul 14, 2025 | | CodeCode Available | 0 |
| FPC-Net: Revisiting SuperPoint with Descriptor-Free Keypoint Detection via Feature Pyramids and Consistency-Based Implicit Matching | Jul 14, 2025 | Keypoint Detection | —Unverified | 0 |
| MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second | Jul 14, 2025 | Novel View SynthesisPoint Tracking | —Unverified | 0 |
| ZClassifier: Temperature Tuning and Manifold Approximation via KL Divergence on Logit Space | Jul 14, 2025 | Out of Distribution (OOD) Detection | CodeCode Available | 0 |
| Kodezi Chronos: A Debugging-First Language Model for Repository-Scale, Memory-Driven Code Understanding | Jul 14, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 9 |
| SentiDrop: A Multi Modal Machine Learning model for Predicting Dropout in Distance Learning | Jul 14, 2025 | Feature ImportanceSentiment Analysis | —Unverified | 0 |
| Convergence of Agnostic Federated Averaging | Jul 14, 2025 | Federated Learning | —Unverified | 0 |
| 4D-Animal: Freely Reconstructing Animatable 3D Animals from Videos | Jul 14, 2025 | | CodeCode Available | 1 |
| Benchmarking and Evaluation of AI Models in Biology: Outcomes and Recommendations from the CZI Virtual Cells Workshop | Jul 14, 2025 | Benchmarking | —Unverified | 0 |