| GS-IR: 3D Gaussian Splatting for Inverse Rendering | Nov 26, 2023 | Inverse RenderingNeRF | CodeCode Available | 2 | 5 |
| A Novel Unified Architecture for Low-Shot Counting by Detection and Segmentation | Sep 27, 2024 | Exemplar-Free CountingFew-shot Object Counting and Detection | CodeCode Available | 2 | 5 |
| An Unforgeable Publicly Verifiable Watermark for Large Language Models | Jul 30, 2023 | Computational Efficiency | CodeCode Available | 2 | 5 |
| Scene-Centric Unsupervised Panoptic Segmentation | Apr 2, 2025 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 2 | 5 |
| BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training | Aug 12, 2024 | Data AugmentationVirtual Try-on | CodeCode Available | 2 | 5 |
| Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping | Jan 31, 2025 | 3DGSNovel View Synthesis | CodeCode Available | 2 | 5 |
| SE(3)-DiffusionFields: Learning smooth cost functions for joint grasp and motion optimization through diffusion | Sep 8, 2022 | Motion PlanningRobot Manipulation | CodeCode Available | 2 | 5 |
| Unified Generative Modeling of 3D Molecules via Bayesian Flow Networks | Mar 17, 2024 | 3D Molecule Generation | CodeCode Available | 2 | 5 |
| Ultra Fast Deep Lane Detection with Hybrid Anchor Driven Ordinal Classification | Jun 15, 2022 | Lane DetectionOrdinal Classification | CodeCode Available | 2 | 5 |
| MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training | May 31, 2023 | Language ModellingQuantization | CodeCode Available | 2 | 5 |
| MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video | Mar 2, 2022 | 3D Human Pose EstimationClassification | CodeCode Available | 2 | 5 |
| Gradient Boosting Reinforcement Learning | Jul 11, 2024 | GPUreinforcement-learning | CodeCode Available | 2 | 5 |
| AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind | Feb 21, 2025 | Model Discovery | CodeCode Available | 2 | 5 |
| CompGS: Smaller and Faster Gaussian Splatting with Vector Quantization | Nov 30, 2023 | 3DGSNeRF | CodeCode Available | 2 | 5 |
| Diffusion Actor-Critic with Entropy Regulator | May 24, 2024 | Decision MakingMuJoCo | CodeCode Available | 2 | 5 |
| Contextual Object Detection with Multimodal Large Language Models | May 29, 2023 | Cloze TestDecoder | CodeCode Available | 2 | 5 |
| V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding | Dec 12, 2024 | Position | CodeCode Available | 2 | 5 |
| Exploration-Driven Generative Interactive Environments | Apr 3, 2025 | | CodeCode Available | 2 | 5 |
| PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution | Nov 26, 2024 | DenoisingImage Super-Resolution | CodeCode Available | 2 | 5 |
| Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation | Apr 2, 2024 | NavigateVision and Language Navigation | CodeCode Available | 2 | 5 |
| FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space | May 3, 2024 | Facial Expression RecognitionFacial Expression Recognition (FER) | CodeCode Available | 2 | 5 |
| DreamGaussian4D: Generative 4D Gaussian Splatting | Dec 28, 2023 | Video Generation | CodeCode Available | 2 | 5 |
| GraphKAN: Enhancing Feature Extraction with Graph Kolmogorov Arnold Networks | Jun 19, 2024 | Kolmogorov-Arnold Networks | CodeCode Available | 2 | 5 |
| Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts | Mar 31, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 | 5 |
| Referring to Any Person | Mar 11, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 2 | 5 |
| Training Language Models to Self-Correct via Reinforcement Learning | Sep 19, 2024 | HumanEvalMath | CodeCode Available | 2 | 5 |
| GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning | Feb 3, 2024 | Link PredictionNode Classification | CodeCode Available | 2 | 5 |
| Training on test proteins improves fitness, structure, and function prediction | Nov 4, 2024 | PredictionProtein Structure Prediction | CodeCode Available | 2 | 5 |
| mGPT: Few-Shot Learners Go Multilingual | Apr 15, 2022 | Cross-Lingual Natural Language InferenceCross-Lingual Paraphrase Identification | CodeCode Available | 2 | 5 |
| Promptus: Can Prompts Streaming Replace Video Streaming with Stable Diffusion | May 30, 2024 | Semantic CommunicationVideo Compression | CodeCode Available | 2 | 5 |
| TimeFilter: Patch-Specific Spatial-Temporal Graph Filtration for Time Series Forecasting | Jan 22, 2025 | ClusteringTime Series | CodeCode Available | 2 | 5 |
| SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model | May 3, 2023 | Instance SegmentationObject | CodeCode Available | 2 | 5 |
| YOLOPv2: Better, Faster, Stronger for Panoptic Driving Perception | Aug 24, 2022 | Autonomous DrivingDrivable Area Detection | CodeCode Available | 2 | 5 |
| CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions | May 24, 2025 | Benchmarking | CodeCode Available | 2 | 5 |
| Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation | Jun 14, 2024 | NavigateVision and Language Navigation | CodeCode Available | 2 | 5 |
| A large-scale multicenter breast cancer DCE-MRI benchmark dataset with expert segmentations | Jun 19, 2024 | Benchmarking | CodeCode Available | 2 | 5 |
| MonoOcc: Digging into Monocular Semantic Occupancy Prediction | Mar 13, 2024 | 3D geometryAutonomous Vehicles | CodeCode Available | 2 | 5 |
| Self-Supervised Any-Point Tracking by Contrastive Random Walks | Sep 24, 2024 | Contrastive LearningData Augmentation | CodeCode Available | 2 | 5 |
| Click-Calib: A Robust Extrinsic Calibration Method for Surround-View Systems | Jan 2, 2025 | | CodeCode Available | 2 | 5 |
| ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis | Mar 11, 2024 | Question Answering | CodeCode Available | 2 | 5 |
| Brain Latent Progression: Individual-based Spatiotemporal Disease Progression on 3D Brain MRIs via Latent Diffusion | Feb 12, 2025 | | CodeCode Available | 2 | 5 |
| Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review | Apr 20, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 | 5 |
| MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks | Dec 29, 2024 | 3DGSNovel View Synthesis | CodeCode Available | 2 | 5 |
| Generating Long Semantic IDs in Parallel for Recommendation | Jun 6, 2025 | | CodeCode Available | 2 | 5 |
| Graphs Meet AI Agents: Taxonomy, Progress, and Future Opportunities | Jun 22, 2025 | Reinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation | Feb 28, 2024 | Semantic SegmentationTAG | CodeCode Available | 2 | 5 |
| Three New Validators and a Large-Scale Benchmark Ranking for Unsupervised Domain Adaptation | Aug 15, 2022 | Domain AdaptationUnsupervised Domain Adaptation | CodeCode Available | 2 | 5 |
| LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents | Feb 13, 2024 | BenchmarkingModel Selection | CodeCode Available | 2 | 5 |
| Learning from All Vehicles | Mar 22, 2022 | AllAutonomous Driving | CodeCode Available | 2 | 5 |
| LambdaNetworks: Modeling Long-Range Interactions Without Attention | Feb 17, 2021 | image-classificationImage Classification | CodeCode Available | 2 | 5 |