| Deep Reinforcement Learning with Enhanced PPO for Safe Mobile Robot Navigation | May 25, 2024 | Autonomous NavigationDeep Reinforcement Learning | CodeCode Available | 2 |
| SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes | Dec 4, 2023 | Novel View Synthesis | CodeCode Available | 2 |
| Fast ODE-based Sampling for Diffusion Models in Around 5 Steps | Nov 30, 2023 | Image Generation | CodeCode Available | 2 |
| CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR | Feb 27, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement | Mar 23, 2022 | Speech Enhancement | CodeCode Available | 2 |
| MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Accelerated Quality-Diversity through Massive Parallelism | Feb 2, 2022 | DiversityGPU | CodeCode Available | 2 |
| Anomaly Detection with Conditioned Denoising Diffusion Models | May 25, 2023 | Anomaly DetectionDenoising | CodeCode Available | 2 |
| VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance | Apr 18, 2022 | Image Generation | CodeCode Available | 2 |
| MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning | Sep 26, 2021 | BenchmarkingDecision Making | CodeCode Available | 2 |
| S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training | Sep 13, 2024 | Quantization | CodeCode Available | 2 |
| Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens | Jan 30, 2024 | Language Modelling | CodeCode Available | 2 |
| Analyzing and Improving the Training Dynamics of Diffusion Models | Dec 5, 2023 | Image GenerationPhilosophy | CodeCode Available | 2 |
| Spectral-Refiner: Accurate Fine-Tuning of Spatiotemporal Fourier Neural Operator for Turbulent Flows | May 27, 2024 | Computational EfficiencyDe-aliasing | CodeCode Available | 2 |
| DaisyRec 2.0: Benchmarking Recommendation for Rigorous Evaluation | Jun 22, 2022 | BenchmarkingRecommendation Systems | CodeCode Available | 2 |
| Dynamic Snake Convolution based on Topological Geometric Constraints for Tubular Structure Segmentation | Jul 17, 2023 | SegmentationSpecificity | CodeCode Available | 2 |
| AnomalyDINO: Boosting Patch-based Few-shot Anomaly Detection with DINOv2 | May 23, 2024 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 2 |
| Reaction-conditioned De Novo Enzyme Design with GENzyme | Nov 10, 2024 | | CodeCode Available | 2 |
| PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization | Jun 8, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation | Jan 1, 2024 | General KnowledgeNavigate | CodeCode Available | 2 |
| Mist: Towards Improved Adversarial Examples for Diffusion Models | May 22, 2023 | Adversarial Defense | CodeCode Available | 2 |
| Improve Vision Language Model Chain-of-thought Reasoning | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Fast and Accurate Blind Flexible Docking | Feb 20, 2025 | Blind DockingComputational Efficiency | CodeCode Available | 2 |
| LaMAR: Benchmarking Localization and Mapping for Augmented Reality | Oct 19, 2022 | BenchmarkingDiversity | CodeCode Available | 2 |
| The First Place Solution of WSDM Cup 2024: Leveraging Large Language Models for Conversational Multi-Doc QA | Feb 28, 2024 | Natural Language UnderstandingQuestion Answering | CodeCode Available | 2 |
| KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking | Apr 3, 2024 | Fact CheckingForm | CodeCode Available | 2 |
| Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection | Feb 6, 2024 | 3D Object DetectionDenoising | CodeCode Available | 2 |
| ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image Segmentation | Jul 19, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Tucano: Advancing Neural Text Generation for Portuguese | Nov 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SituatedGen: Incorporating Geographical and Temporal Contexts into Generative Commonsense Reasoning | Jun 21, 2023 | SentenceText Generation | CodeCode Available | 2 |
| CCTC: A Cross-Sentence Chinese Text Correction Dataset for Native Speakers | Oct 1, 2022 | Grammatical Error CorrectionSentence | CodeCode Available | 2 |
| SwinFuSR: an image fusion-inspired model for RGB-guided thermal image super-resolution | Apr 22, 2024 | Image Super-ResolutionSSIM | CodeCode Available | 2 |
| SCIMAP: A Python Toolkit for Integrated Spatial Analysis of Multiplexed Imaging Data | May 3, 2024 | | CodeCode Available | 2 |
| EditWorld: Simulating World Dynamics for Instruction-Following Image Editing | May 23, 2024 | Instruction Following | CodeCode Available | 2 |
| Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling | Aug 30, 2024 | Chunking | CodeCode Available | 2 |
| Robust Synthetic-to-Real Transfer for Stereo Matching | Mar 12, 2024 | Domain GeneralizationPseudo Label | CodeCode Available | 2 |
| Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents | Feb 17, 2024 | Backdoor Attackbackdoor defense | CodeCode Available | 2 |
| Diabetica: Adapting Large Language Model to Enhance Multiple Medical Tasks in Diabetes Care and Management | Sep 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DeepLIIF: An Online Platform for Quantification of Clinical Pathology Slides | Apr 9, 2022 | GPU | CodeCode Available | 2 |
| pyVHR: a Python framework for remote photoplethysmography | Apr 15, 2022 | GPUHeart rate estimation | CodeCode Available | 2 |
| Agentic Knowledgeable Self-awareness | Apr 4, 2025 | Decision Making | CodeCode Available | 2 |
| GS-IR: 3D Gaussian Splatting for Inverse Rendering | Nov 26, 2023 | Inverse RenderingNeRF | CodeCode Available | 2 |
| A Novel Unified Architecture for Low-Shot Counting by Detection and Segmentation | Sep 27, 2024 | Exemplar-Free CountingFew-shot Object Counting and Detection | CodeCode Available | 2 |
| An Unforgeable Publicly Verifiable Watermark for Large Language Models | Jul 30, 2023 | Computational Efficiency | CodeCode Available | 2 |
| Scene-Centric Unsupervised Panoptic Segmentation | Apr 2, 2025 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 2 |
| BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training | Aug 12, 2024 | Data AugmentationVirtual Try-on | CodeCode Available | 2 |
| Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping | Jan 31, 2025 | 3DGSNovel View Synthesis | CodeCode Available | 2 |
| SE(3)-DiffusionFields: Learning smooth cost functions for joint grasp and motion optimization through diffusion | Sep 8, 2022 | Motion PlanningRobot Manipulation | CodeCode Available | 2 |
| Unified Generative Modeling of 3D Molecules via Bayesian Flow Networks | Mar 17, 2024 | 3D Molecule Generation | CodeCode Available | 2 |
| Ultra Fast Deep Lane Detection with Hybrid Anchor Driven Ordinal Classification | Jun 15, 2022 | Lane DetectionOrdinal Classification | CodeCode Available | 2 |