| Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples | Feb 9, 2023 | | CodeCode Available | 2 |
| DaCapo: a modular deep learning framework for scalable 3D image segmentation | Aug 5, 2024 | Image SegmentationManagement | CodeCode Available | 2 |
| OmniSat: Self-Supervised Modality Fusion for Earth Observation | Apr 12, 2024 | DiversityEarth Observation | CodeCode Available | 2 |
| Datrics Text2SQL. A Framework for Natural Language to SQL Query Generation | Mar 15, 2025 | Natural Language QueriesRAG | CodeCode Available | 2 |
| 3DShape2VecSet: A 3D Shape Representation for Neural Fields and Generative Diffusion Models | Jan 26, 2023 | 3D Shape RepresentationPoint Cloud Completion | CodeCode Available | 2 |
| RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing | Mar 13, 2025 | Computational EfficiencyMamba | CodeCode Available | 2 |
| An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds | Mar 21, 2023 | 3D Single Object TrackingAutonomous Driving | CodeCode Available | 2 |
| Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective | Mar 24, 2025 | Building Damage AssessmentChange Detection | CodeCode Available | 2 |
| PiCO+: Contrastive Label Disambiguation for Robust Partial Label Learning | Jan 22, 2022 | Contrastive LearningPartial Label Learning | CodeCode Available | 2 |
| Generative AI for Medical Imaging: extending the MONAI Framework | Jul 27, 2023 | Anomaly DetectionDenoising | CodeCode Available | 2 |
| GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Nov 13, 2024 | 3DGSCamera Localization | CodeCode Available | 2 |
| A Survey on LLM Inference-Time Self-Improvement | Dec 18, 2024 | Survey | CodeCode Available | 2 |
| Toward AI-Driven Digital Organism: Multiscale Foundation Models for Predicting, Simulating and Programming Biology at All Levels | Dec 9, 2024 | All | CodeCode Available | 2 |
| Language-only Training of Zero-shot Composed Image Retrieval | Jan 1, 2024 | Image RetrievalRetrieval | CodeCode Available | 2 |
| Generalizable, Fast, and Accurate DeepQSPR with fastprop | Apr 2, 2024 | Molecular Property PredictionProperty Prediction | CodeCode Available | 2 |
| T-FREE: Subword Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings | Jun 27, 2024 | Cross-Lingual TransferTransfer Learning | CodeCode Available | 2 |
| SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories | Mar 11, 2025 | Decision MakingInteractive Segmentation | CodeCode Available | 2 |
| Denoising Diffusion Implicit Models | Oct 6, 2020 | DenoisingImage Generation | CodeCode Available | 2 |
| PyBOP: A Python package for battery model optimisation and parameterisation | Dec 20, 2024 | | CodeCode Available | 2 |
| FastFlows: Flow-Based Models for Molecular Graph Generation | Jan 28, 2022 | Graph GenerationMolecular Graph Generation | CodeCode Available | 2 |
| Intrinsic Image Decomposition via Ordinal Shading | Nov 21, 2023 | Intrinsic Image DecompositionInverse Rendering | CodeCode Available | 2 |
| Towards Real-Time Multi-Object Tracking | Sep 27, 2019 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 2 |
| PyRetri: A PyTorch-based Library for Unsupervised Image Retrieval by Deep Convolutional Neural Networks | May 2, 2020 | Content-Based Image RetrievalDeep Learning | CodeCode Available | 2 |
| PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS | Feb 24, 2023 | Decodertext-to-speech | CodeCode Available | 2 |
| Sparse Global Matching for Video Frame Interpolation with Large Motion | Apr 10, 2024 | Video Frame Interpolation | CodeCode Available | 2 |
| Detecting Pretraining Data from Large Language Models | Oct 25, 2023 | Machine Unlearning | CodeCode Available | 2 |
| RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models | Jul 6, 2024 | Medical DiagnosisRAG | CodeCode Available | 2 |
| WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents | Jul 4, 2022 | Decision MakingImitation Learning | CodeCode Available | 2 |
| Fast Algorithms for Convolutional Neural Networks | Sep 30, 2015 | GPUPedestrian Detection | CodeCode Available | 2 |
| Reasoning to Learn from Latent Thoughts | Mar 24, 2025 | MathText Generation | CodeCode Available | 2 |
| Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching | Mar 7, 2025 | | CodeCode Available | 2 |
| Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding | Jan 9, 2024 | Fact VerificationIn-Context Learning | CodeCode Available | 2 |
| TensorFlow Distributions | Nov 28, 2017 | Deep LearningProbabilistic Programming | CodeCode Available | 2 |
| GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism | Nov 16, 2018 | Fine-Grained Image Classificationimage-classification | CodeCode Available | 2 |
| Physics-Informed Machine Learning: A Survey on Problems, Methods and Applications | Nov 15, 2022 | Physics-informed machine learning | CodeCode Available | 2 |
| Enhancing Blind Video Quality Assessment with Rich Quality-aware Features | May 14, 2024 | Video Quality Assessment | CodeCode Available | 2 |
| Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis | Nov 6, 2024 | 3DGSNeRF | CodeCode Available | 2 |
| An Expression Tree Decoding Strategy for Mathematical Equation Generation | Oct 14, 2023 | MathMathematical Reasoning | CodeCode Available | 2 |
| Interactive Language: Talking to Robots in Real Time | Oct 12, 2022 | | CodeCode Available | 2 |
| Cones 2: Customizable Image Synthesis with Multiple Subjects | May 30, 2023 | Image Generation | CodeCode Available | 2 |
| Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection | Jul 7, 2022 | ObjectOpen Vocabulary Attribute Detection | CodeCode Available | 2 |
| MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation | Dec 2, 2022 | Domain Adaptationimage-classification | CodeCode Available | 2 |
| Mixture of Diffusers for scene composition and high resolution image generation | Feb 5, 2023 | Image GenerationVocal Bursts Intensity Prediction | CodeCode Available | 2 |
| MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval | Jul 2, 2023 | Biomedical Information RetrievalContrastive Learning | CodeCode Available | 2 |
| Dataset Distillation by Matching Training Trajectories | Mar 22, 2022 | Dataset DistillationDataset Distillation - 1IPC | CodeCode Available | 2 |
| Physics-inform attention temporal convolutional network for EEG-based motor imagery classification | Aug 1, 2022 | Brain Computer InterfaceEEG | CodeCode Available | 2 |
| Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning | Aug 24, 2021 | CPUGPU | CodeCode Available | 2 |
| LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of end-to-end ASR Models | Oct 4, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning | Feb 7, 2025 | | CodeCode Available | 2 |