| Skeleton-Aware Networks for Deep Motion Retargeting | May 12, 2020 | motion retargetingMotion Synthesis | CodeCode Available | 2 |
| Fixed Point Diffusion Models | Jan 16, 2024 | DenoisingImage Generation | CodeCode Available | 2 |
| AdaInt: Learning Adaptive Intervals for 3D Lookup Tables on Real-time Image Enhancement | Apr 29, 2022 | Image EnhancementPhoto Retouching | CodeCode Available | 2 |
| VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning | Apr 10, 2025 | MathMultimodal Reasoning | CodeCode Available | 2 |
| Socially-Aware Self-Supervised Tri-Training for Recommendation | Jun 7, 2021 | Contrastive LearningRecommendation Systems | CodeCode Available | 2 |
| Customized Segment Anything Model for Medical Image Segmentation | Apr 26, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| Enhancing LLM Reasoning with Reward-guided Tree Search | Nov 18, 2024 | Mathematical Reasoning | CodeCode Available | 2 |
| SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation | May 8, 2025 | 3DGSData Augmentation | CodeCode Available | 2 |
| CityDreamer: Compositional Generative Model of Unbounded 3D Cities | Sep 1, 2023 | modelScene Generation | CodeCode Available | 2 |
| Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic | Apr 10, 2024 | GPU | CodeCode Available | 2 |
| Out of Many, One: Designing and Scaffolding Proteins at the Scale of the Structural Universe with Genie 2 | May 24, 2024 | Data AugmentationDiversity | CodeCode Available | 2 |
| BEiT: BERT Pre-Training of Image Transformers | Jun 15, 2021 | Document Image ClassificationDocument Layout Analysis | CodeCode Available | 2 |
| Twelve years of SAMtools and BCFtools | Dec 18, 2020 | | CodeCode Available | 2 |
| EnzymeFlow: Generating Reaction-specific Enzyme Catalytic Pockets through Flow Matching and Co-Evolutionary Dynamics | Oct 1, 2024 | | CodeCode Available | 2 |
| Deep Long-Tailed Learning: A Survey | Oct 9, 2021 | Survey | CodeCode Available | 2 |
| Using Speech Synthesis to Train End-to-End Spoken Language Understanding Models | Oct 21, 2019 | Data AugmentationNatural Language Understanding | CodeCode Available | 2 |
| ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing | Dec 19, 2024 | Mixture-of-Experts | CodeCode Available | 2 |
| HumanMM: Global Human Motion Recovery from Multi-shot Videos | Mar 10, 2025 | Camera Pose EstimationMotion Generation | CodeCode Available | 2 |
| TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation | Oct 12, 2020 | Sign Language RecognitionSign Language Translation | CodeCode Available | 2 |
| An Overview of Deep Semi-Supervised Learning | Jun 9, 2020 | Deep Learningimage-classification | CodeCode Available | 2 |
| Monster Mash: A Single-View Approach to Casual 3D Modeling and Animation | Dec 1, 2020 | Image Generation | CodeCode Available | 2 |
| Deep Portfolio Theory | May 23, 2016 | | CodeCode Available | 2 |
| A System for Real-Time Interactive Analysis of Deep Learning Training | Jan 5, 2020 | 3D Action RecognitionDiagnostic | CodeCode Available | 2 |
| Context Encoding for Semantic Segmentation | Mar 23, 2018 | image-classificationImage Classification | CodeCode Available | 2 |
| ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation | Jun 1, 2023 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence | Apr 8, 2020 | Sentence EmbeddingsTopic Models | CodeCode Available | 2 |
| SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search | Nov 5, 2021 | | CodeCode Available | 2 |
| Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly | Jan 1, 2024 | Anomaly Detection | CodeCode Available | 2 |
| PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models | Jul 8, 2024 | Autonomous DrivingImage Generation | CodeCode Available | 2 |
| Towards Unifying Feature Attribution and Counterfactual Explanations: Different Means to the Same End | Nov 10, 2020 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| Neural Texture Extraction and Distribution for Controllable Person Image Synthesis | Apr 13, 2022 | Image Generation | CodeCode Available | 2 |
| AMC: AutoML for Model Compression and Acceleration on Mobile Devices | Feb 10, 2018 | AutoMLGPU | CodeCode Available | 2 |
| What do we learn from inverting CLIP models? | Mar 5, 2024 | | CodeCode Available | 2 |
| EasyTPP: Towards Open Benchmarking Temporal Point Processes | Jul 16, 2023 | BenchmarkingPoint Processes | CodeCode Available | 2 |
| Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents | Feb 3, 2023 | MinecraftTask Planning | CodeCode Available | 2 |
| Technique Inference Engine: A Recommender Model to Support Cyber Threat Hunting | Mar 4, 2025 | | CodeCode Available | 2 |
| Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications | Nov 29, 2023 | Autonomous DrivingPrediction | CodeCode Available | 2 |
| Barbershop: GAN-based Image Compositing using Segmentation Masks | Jun 2, 2021 | | CodeCode Available | 2 |
| On the limits of cross-domain generalization in automated X-ray prediction | Feb 6, 2020 | DiagnosticDomain Generalization | CodeCode Available | 2 |
| jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models | Mar 4, 2020 | Transfer Learning | CodeCode Available | 2 |
| Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation | Sep 22, 2021 | Image-to-Image TranslationTalking Face Generation | CodeCode Available | 2 |
| SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy | Jan 1, 2023 | | CodeCode Available | 2 |
| Predicting COVID-19 Pneumonia Severity on Chest X-ray with Deep Learning | May 24, 2020 | Management | CodeCode Available | 2 |
| DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object Detection | Dec 6, 2024 | Objectobject-detection | CodeCode Available | 2 |
| PermutoSDF: Fast Multi-View Reconstruction with Implicit Surfaces using Permutohedral Lattices | Nov 22, 2022 | | CodeCode Available | 2 |
| A Critical Evaluation of AI Feedback for Aligning Large Language Models | Feb 19, 2024 | Instruction Followingreinforcement-learning | CodeCode Available | 2 |
| Linear Transformers with Learnable Kernel Functions are Better In-Context Models | Feb 16, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Russian Paraphrasers: Paraphrase with Transformers | Apr 1, 2021 | | CodeCode Available | 2 |
| Diffusion Models for Reinforcement Learning: A Survey | Nov 2, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| Online Deep Clustering for Unsupervised Representation Learning | Jun 18, 2020 | ClusteringDeep Clustering | CodeCode Available | 2 |