| EgoBlind: Towards Egocentric Visual Assistance for the Blind | Mar 11, 2025 | | CodeCode Available | 1 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| SAS: Segment Any 3D Scene with Integrated 2D Priors | Mar 11, 2025 | Instance SegmentationSemantic Segmentation | CodeCode Available | 1 |
| NullFace: Training-Free Localized Face Anonymization | Mar 11, 2025 | AttributeFace Anonymization | CodeCode Available | 1 |
| MegaSR: Mining Customized Semantics and Expressive Guidance for Image Super-Resolution | Mar 11, 2025 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 1 |
| Chemical reasoning in LLMs unlocks steerable synthesis planning and reaction mechanism elucidation | Mar 11, 2025 | | CodeCode Available | 1 |
| PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability | Mar 11, 2025 | Visual Reasoning | CodeCode Available | 1 |
| Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning Policies | Mar 11, 2025 | Conformal PredictionImitation Learning | CodeCode Available | 1 |
| All That Glitters Is Not Gold: Key-Secured 3D Secrets within 3D Gaussian Splatting | Mar 10, 2025 | 3DGSAll | CodeCode Available | 1 |
| Open-Set Gait Recognition from Sparse mmWave Radar Point Clouds | Mar 10, 2025 | Edge-computingGait Recognition | CodeCode Available | 1 |
| REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding | Mar 10, 2025 | Instruction FollowingKeypoint Detection | CodeCode Available | 1 |
| ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation | Mar 10, 2025 | Code Generation | CodeCode Available | 1 |
| SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks | Mar 10, 2025 | Person Re-IdentificationPerson Search | CodeCode Available | 1 |
| Performance-driven Constrained Optimal Auto-Tuner for MPC | Mar 10, 2025 | Autonomous RacingBayesian Optimization | CodeCode Available | 1 |
| Illuminating Darkness: Enhancing Real-world Low-light Scenes with Smartphone Images | Mar 10, 2025 | 4kBenchmarking | CodeCode Available | 1 |
| Frequency-Aware Density Control via Reparameterization for High-Quality Rendering of 3D Gaussian Splatting | Mar 10, 2025 | 3DGS | CodeCode Available | 1 |
| Effective and Efficient Masked Image Generation Models | Mar 10, 2025 | Image Generation | CodeCode Available | 1 |
| Lshan-1.0 Technical Report | Mar 10, 2025 | Large Language Model | CodeCode Available | 1 |
| SimROD: A Simple Baseline for Raw Object Detection with Global and Local Enhancements | Mar 10, 2025 | Objectobject-detection | CodeCode Available | 1 |
| COMODO: Cross-Modal Video-to-IMU Distillation for Efficient Egocentric Human Activity Recognition | Mar 10, 2025 | Activity RecognitionHuman Activity Recognition | CodeCode Available | 1 |
| RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing | Mar 10, 2025 | Code GenerationHumanEval | CodeCode Available | 1 |
| TokenButler: Token Importance is Predictable | Mar 10, 2025 | | CodeCode Available | 1 |
| V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation | Mar 10, 2025 | DecoderImage Generation | CodeCode Available | 1 |
| SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models | Mar 10, 2025 | Computational Efficiency | CodeCode Available | 1 |
| SANDRO: a Robust Solver with a Splitting Strategy for Point Cloud Registration | Mar 10, 2025 | Point Cloud Registration | CodeCode Available | 1 |
| Process-Supervised LLM Recommenders via Flow-guided Tuning | Mar 10, 2025 | DiversityFairness | CodeCode Available | 1 |
| A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning | Mar 10, 2025 | ObjectScene Understanding | CodeCode Available | 1 |
| Dynamic Cross-Modal Feature Interaction Network for Hyperspectral and LiDAR Data Classification | Mar 10, 2025 | Classification | CodeCode Available | 1 |
| SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models | Mar 10, 2025 | Model Editing | CodeCode Available | 1 |
| RefactorBench: Evaluating Stateful Reasoning in Language Agents Through Code | Mar 10, 2025 | Specificity | CodeCode Available | 1 |
| VisRL: Intention-Driven Visual Perception via Reinforced Reasoning | Mar 10, 2025 | Reinforcement Learning (RL)Visual Reasoning | CodeCode Available | 1 |
| On the Generalization of Representation Uncertainty in Earth Observation | Mar 10, 2025 | Earth ObservationMulti-Label Classification | CodeCode Available | 1 |
| Implicit Reasoning in Transformers is Reasoning through Shortcuts | Mar 10, 2025 | Mathematical Reasoning | CodeCode Available | 1 |
| HybridReg: Robust 3D Point Cloud Registration with Hybrid Motions | Mar 10, 2025 | Point Cloud Registration | CodeCode Available | 1 |
| GRITHopper: Decomposition-Free Multi-Hop Dense Retrieval | Mar 10, 2025 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| Interactive Medical Image Analysis with Concept-based Similarity Reasoning | Mar 10, 2025 | Medical Image Analysis | CodeCode Available | 1 |
| Learning Decision Trees as Amortized Structure Inference | Mar 10, 2025 | Anomaly DetectionDeep Reinforcement Learning | CodeCode Available | 1 |
| Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation | Mar 10, 2025 | Text Generation | CodeCode Available | 1 |
| Unleashing the Potential of Large Language Models for Text-to-Image Generation through Autoregressive Representation Alignment | Mar 10, 2025 | Domain AdaptationImage Generation | CodeCode Available | 1 |
| TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models | Mar 10, 2025 | Contrastive LearningDenoising | CodeCode Available | 1 |
| AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models | Mar 10, 2025 | Style Transfer | CodeCode Available | 1 |
| ZeroSumEval: An Extensible Framework For Scaling LLM Evaluation with Inter-Model Competition | Mar 10, 2025 | Board Games | CodeCode Available | 1 |
| Dynamics-Invariant Quadrotor Control using Scale-Aware Deep Reinforcement Learning | Mar 9, 2025 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Geometric Knowledge-Guided Localized Global Distribution Alignment for Federated Learning | Mar 9, 2025 | Federated Learning | CodeCode Available | 1 |
| QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation | Mar 9, 2025 | QuantizationVideo Generation | CodeCode Available | 1 |
| One-Step Diffusion Model for Image Motion-Deblurring | Mar 9, 2025 | DeblurringDenoising | CodeCode Available | 1 |
| Dynamic Updates for Language Adaptation in Visual-Language Tracking | Mar 9, 2025 | Large Language Model | CodeCode Available | 1 |
| TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos | Mar 9, 2025 | Action LocalizationBoundary Detection | CodeCode Available | 1 |
| M^3amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing Classification | Mar 9, 2025 | Computational EfficiencyHyperspectral Image Classification | CodeCode Available | 1 |
| Online Dense Point Tracking with Streaming Memory | Mar 9, 2025 | Optical Flow EstimationPoint Tracking | CodeCode Available | 1 |