| SSD: Single Shot MultiBox Detector | Dec 8, 2015 | LIDAR Semantic SegmentationLow-Light Image Enhancement | CodeCode Available | 2 | 5 |
| Bridging Remote Sensors with Multisensor Geospatial Foundation Models | Apr 1, 2024 | Cloud RemovalDiversity | CodeCode Available | 2 | 5 |
| Text2Reward: Reward Shaping with Language Models for Reinforcement Learning | Sep 20, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 2 | 5 |
| A New Outlier Removal Strategy Based on Reliability of Correspondence Graph for Fast Point Cloud Registration | May 16, 2022 | Point Cloud Registration | CodeCode Available | 2 | 5 |
| Self-supervised Dataset Distillation: A Good Compression Is All You Need | Apr 11, 2024 | AllDataset Distillation | CodeCode Available | 2 | 5 |
| Training Transformer Models by Wavelet Losses Improves Quantitative and Visual Performance in Single Image Super-Resolution | Apr 17, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| Towards Real-World Visual Tracking with Temporal Contexts | Aug 20, 2023 | Visual Tracking | CodeCode Available | 2 | 5 |
| Evaluating Object Hallucination in Large Vision-Language Models | May 17, 2023 | HallucinationObject | CodeCode Available | 2 | 5 |
| Adaptable Logical Control for Large Language Models | Jun 19, 2024 | MathText Generation | CodeCode Available | 2 | 5 |
| RWKV-TS: Beyond Traditional Recurrent Neural Network for Time Series Tasks | Jan 17, 2024 | Computational EfficiencyTime Series | CodeCode Available | 2 | 5 |
| In-BoXBART: Get Instructions into Biomedical Multi-Task Learning | Apr 15, 2022 | Few-Shot LearningMulti-Task Learning | CodeCode Available | 2 | 5 |
| Vehicle: Bridging the Embedding Gap in the Verification of Neuro-Symbolic Programs | Jan 12, 2024 | | CodeCode Available | 2 | 5 |
| OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement | Mar 21, 2025 | Multimodal ReasoningReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| PaPaGei: Open Foundation Models for Optical Physiological Signals | Oct 27, 2024 | Contrastive LearningDomain Generalization | CodeCode Available | 2 | 5 |
| GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance | May 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Attack-Resilient Image Watermarking Using Stable Diffusion | Jan 8, 2024 | Denoising | CodeCode Available | 2 | 5 |
| GeoChat: Grounded Large Vision-Language Model for Remote Sensing | Nov 24, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 | 5 |
| Semantic Guidance Tuning for Text-To-Image Diffusion Models | Dec 26, 2023 | Zero-shot Generalization | CodeCode Available | 2 | 5 |
| Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion | Mar 25, 2022 | DiversityPedestrian Trajectory Prediction | CodeCode Available | 2 | 5 |
| DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments | Sep 17, 2022 | Motion SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| Zero-Shot Tokenizer Transfer | May 13, 2024 | XLM-R | CodeCode Available | 2 | 5 |
| BlendSQL: A Scalable Dialect for Unifying Hybrid Question Answering in Relational Algebra | Feb 27, 2024 | Question Answering | CodeCode Available | 2 | 5 |
| Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds | Mar 21, 2022 | AllGPU | CodeCode Available | 2 | 5 |
| BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework | May 27, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| RelPose++: Recovering 6D Poses from Sparse-view Observations | May 8, 2023 | 3D ReconstructionPose Estimation | CodeCode Available | 2 | 5 |
| Block-Recurrent Transformers | Mar 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Global birdsong embeddings enable superior transfer learning for bioacoustic classification | Jul 12, 2023 | Audio ClassificationDecision Making | CodeCode Available | 2 | 5 |
| Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture | Jan 19, 2023 | Depth EstimationDepth Prediction | CodeCode Available | 2 | 5 |
| MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism | Mar 3, 2025 | Object Detection | CodeCode Available | 2 | 5 |
| TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks | Sep 16, 2020 | Anomaly DetectionBenchmarking | CodeCode Available | 2 | 5 |
| When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset | Jul 14, 2024 | 3D Object DetectionMultispectral Object Detection | CodeCode Available | 2 | 5 |
| Differentiable All-pole Filters for Time-varying Audio Systems | Apr 11, 2024 | AllAudio Effects Modeling | CodeCode Available | 2 | 5 |
| TorchOpt: An Efficient Library for Differentiable Optimization | Nov 13, 2022 | CPUGPU | CodeCode Available | 2 | 5 |
| Progressive Growing of GANs for Improved Quality, Stability, and Variation | Oct 27, 2017 | Face GenerationImage Generation | CodeCode Available | 2 | 5 |
| CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities | Aug 23, 2024 | DenoisingMotion Generation | CodeCode Available | 2 | 5 |
| ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization | May 16, 2025 | AllDeepFake Detection | CodeCode Available | 2 | 5 |
| Cross-Task Generalization via Natural Language Crowdsourcing Instructions | Apr 18, 2021 | Question Answering | CodeCode Available | 2 | 5 |
| MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control | Jan 4, 2025 | AttributeDenoising | CodeCode Available | 2 | 5 |
| From Poses to Identity: Training-Free Person Re-Identification via Feature Centralization | Mar 2, 2025 | Cross-Modal Person Re-IdentificationPerson Re-Identification | CodeCode Available | 2 | 5 |
| Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions | Jun 9, 2025 | Large Language ModelReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery | Jan 3, 2025 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| RWKV-UNet: Improving UNet with Long-Range Cooperation for Effective Medical Image Segmentation | Jan 14, 2025 | Computational EfficiencyImage Segmentation | CodeCode Available | 2 | 5 |
| Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning | Oct 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| You Only Segment Once: Towards Real-Time Panoptic Segmentation | Mar 26, 2023 | DecoderPanoptic Segmentation | CodeCode Available | 2 | 5 |
| GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation | Dec 4, 2023 | Novel View Synthesis | CodeCode Available | 2 | 5 |
| Supervised Learning for Analog and RF Circuit Design: Benchmarks and Comparative Insights | Jan 21, 2025 | | CodeCode Available | 2 | 5 |
| Synchformer: Efficient Synchronization from Sparse Cues | Jan 29, 2024 | Audio-Visual Synchronization | CodeCode Available | 2 | 5 |
| GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation | Nov 29, 2023 | 3D GenerationText to 3D | CodeCode Available | 2 | 5 |
| Implicit Diffusion Models for Continuous Super-Resolution | Mar 29, 2023 | DenoisingImage Super-Resolution | CodeCode Available | 2 | 5 |
| Spherical Transformer for LiDAR-based 3D Recognition | Mar 22, 2023 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 | 5 |