| Compositional Visual Generation with Composable Diffusion Models | Jun 3, 2022 | Sentence | CodeCode Available | 2 |
| Revisiting BPR: A Replicability Study of a Common Recommender System Baseline | Sep 21, 2024 | Collaborative FilteringRecommendation Systems | CodeCode Available | 2 |
| Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation | May 1, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| ProtTrans: Towards Cracking the Language of Life's Code Through Self-Supervised Deep Learning and High Performance Computing | Jul 13, 2020 | Dimensionality ReductionProtein Secondary Structure Prediction | CodeCode Available | 2 |
| Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning | Jun 17, 2022 | Few-Shot LearningOffline RL | CodeCode Available | 2 |
| One Fits All:Power General Time Series Analysis by Pretrained LM | Feb 23, 2023 | Anomaly DetectionFew-Shot Learning | CodeCode Available | 2 |
| DataMap: A Portable Application for Visualizing High-Dimensional Data | Apr 11, 2025 | | CodeCode Available | 2 |
| What Disease does this Patient Have? A Large-scale Open Domain Question Answering Dataset from Medical Exams | Sep 28, 2020 | MedQAMultiple-choice | CodeCode Available | 2 |
| BERN2: an advanced neural biomedical named entity recognition and normalization tool | Jan 6, 2022 | graph constructionnamed-entity-recognition | CodeCode Available | 2 |
| Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis | Jun 7, 2024 | Audio Synthesis | CodeCode Available | 2 |
| Deep learning for time series classification | Oct 1, 2020 | Activity RecognitionClassification | CodeCode Available | 2 |
| Revisiting VAE for Unsupervised Time Series Anomaly Detection: A Frequency Perspective | Feb 5, 2024 | Anomaly DetectionTime Series | CodeCode Available | 2 |
| When Do Program-of-Thoughts Work for Reasoning? | Aug 29, 2023 | Code GenerationMathematical Reasoning | CodeCode Available | 2 |
| EDM: Efficient Deep Feature Matching | Mar 7, 2025 | | CodeCode Available | 2 |
| StructGPT: A General Framework for Large Language Model to Reason over Structured Data | May 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Tensor Compiler for Unified Machine Learning Prediction Serving | Oct 9, 2020 | BIG-bench Machine LearningCPU | CodeCode Available | 2 |
| End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames | Nov 28, 2023 | Action DetectionTemporal Action Localization | CodeCode Available | 2 |
| Large Language Models are Zero-Shot Reasoners | May 24, 2022 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 2 |
| Guided Saliency Feature Learning for Person Re-identification in Crowded Scenes | Aug 1, 2020 | Person Re-Identification | CodeCode Available | 2 |
| GiantMIDI-Piano: A large-scale MIDI dataset for classical piano music | Oct 11, 2020 | Information RetrievalMusic Information Retrieval | CodeCode Available | 2 |
| RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning | May 25, 2022 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering | Mar 15, 2025 | Scene GenerationVideo Generation | CodeCode Available | 2 |
| LLM Attributor: Interactive Visual Attribution for LLM Generation | Apr 1, 2024 | ArticlesAttribute | CodeCode Available | 2 |
| A Survey on RGB-D Datasets | Jan 15, 2022 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 2 |
| Robust Self-Supervised Audio-Visual Speech Recognition | Jan 5, 2022 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 2 |
| Audio-Visual Segmentation | Jul 11, 2022 | Segmentation | CodeCode Available | 2 |
| Intriguing Properties of Contrastive Losses | Nov 5, 2020 | Contrastive LearningData Augmentation | CodeCode Available | 2 |
| Learning to summarize with human feedback | Dec 1, 2020 | Articles | CodeCode Available | 2 |
| GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis | Apr 9, 2024 | Image GenerationZero-shot Generalization | CodeCode Available | 2 |
| Learning an Animatable Detailed 3D Face Model from In-The-Wild Images | Dec 7, 2020 | 3D Face Alignment3D Face Animation | CodeCode Available | 2 |
| Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph Matching | Dec 16, 2020 | Combinatorial OptimizationDecision Making | CodeCode Available | 2 |
| Improved StyleGAN Embedding: Where are the Good Latents? | Dec 13, 2020 | Diversity | CodeCode Available | 2 |
| Few-Shot Text Generation with Pattern-Exploiting Training | Dec 22, 2020 | Headline Generationtext-classification | CodeCode Available | 2 |
| CodeT: Code Generation with Generated Tests | Jul 21, 2022 | Code GenerationHumanEval | CodeCode Available | 2 |
| VinVL: Revisiting Visual Representations in Vision-Language Models | Jan 2, 2021 | Image CaptioningImage-text matching | CodeCode Available | 2 |
| MolScribe: Robust Molecular Structure Recognition with Image-To-Graph Generation | May 28, 2022 | Data AugmentationGraph Generation | CodeCode Available | 2 |
| Autoregressive Denoising Diffusion Models for Multivariate Probabilistic Time Series Forecasting | Jan 28, 2021 | Multivariate Time Series ForecastingProbabilistic Time Series Forecasting | CodeCode Available | 2 |
| Differentially Private Synthetic Data via Foundation Model APIs 2: Text | Mar 4, 2024 | Privacy Preserving | CodeCode Available | 2 |
| Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue | Aug 7, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| CodeR: Issue Resolving with Multi-Agent and Task Graphs | Jun 3, 2024 | Bug fixing | CodeCode Available | 2 |
| Global Context Vision Transformers | Jun 20, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| RL-X: A Deep Reinforcement Learning Library (not only) for RoboCup | Oct 20, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| Learning to Fly -- a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter Control | Mar 3, 2021 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association | Mar 3, 2021 | Car Pose EstimationKeypoint Detection | CodeCode Available | 2 |
| MedViT: A Robust Vision Transformer for Generalized Medical Image Classification | Feb 19, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores | Nov 10, 2023 | | CodeCode Available | 2 |
| H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer Training | Sep 21, 2023 | | CodeCode Available | 2 |
| Training-free CryoET Tomogram Segmentation | Jul 8, 2024 | Contrastive LearningCryogenic Electron Tomography | CodeCode Available | 2 |
| Beyond Next Token Prediction: Patch-Level Training for Large Language Models | Jul 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Articulated Object Interaction in Unknown Scenes with Whole-Body Mobile Manipulation | Mar 18, 2021 | Object | CodeCode Available | 2 |