| DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory | Aug 16, 2023 | Trajectory ModelingVideo Generation | CodeCode Available | 2 |
| MOMENT: A Family of Open Time-series Foundation Models | Feb 6, 2024 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding | May 18, 2023 | 3D Classification3D Shape Representation | CodeCode Available | 2 |
| ConFIG: Towards Conflict-free Training of Physics Informed Neural Networks | Aug 20, 2024 | | CodeCode Available | 2 |
| Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection | Feb 21, 2025 | 3D Anomaly Detection3D Anomaly Detection and Segmentation | CodeCode Available | 2 |
| CODA: Repurposing Continuous VAEs for Discrete Tokenization | Mar 22, 2025 | | CodeCode Available | 2 |
| SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity | Mar 26, 2025 | Test-time Adaptation | CodeCode Available | 2 |
| Optimisation & Generalisation in Networks of Neurons | Oct 18, 2022 | | CodeCode Available | 2 |
| Leveraging medical Twitter to build a visual–language foundation model for pathology AI | Apr 1, 2023 | Transfer Learning | CodeCode Available | 2 |
| VideoAgent: Long-form Video Understanding with Large Language Model as Agent | Mar 15, 2024 | EgoSchemaForm | CodeCode Available | 2 |
| Instant Volumetric Head Avatars | Nov 22, 2022 | Face ModelGPU | CodeCode Available | 2 |
| Efficient and Effective SPARQL Autocompletion on Very Large Knowledge Graphs | Oct 17, 2022 | Knowledge Graphs | CodeCode Available | 2 |
| Proximal Policy Optimization Algorithms | Jul 20, 2017 | Continuous ControlDota 2 | CodeCode Available | 2 |
| CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field | Mar 24, 2024 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy | Mar 25, 2025 | DenoisingRobot Manipulation | CodeCode Available | 2 |
| Source-Filter-Based Generative Adversarial Neural Vocoder for High Fidelity Speech Synthesis | Apr 26, 2023 | Speech Synthesistext-to-speech | CodeCode Available | 2 |
| Directly Fine-Tuning Diffusion Models on Differentiable Rewards | Sep 29, 2023 | | CodeCode Available | 2 |
| A Dataset and Explorer for 3D Signed Distance Functions | Apr 27, 2022 | GPU | CodeCode Available | 2 |
| U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation | Jan 9, 2024 | Cell SegmentationImage Segmentation | CodeCode Available | 2 |
| Semantic Photo Manipulation with a Generative Image Prior | May 15, 2020 | | CodeCode Available | 2 |
| VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition | Sep 9, 2020 | CPUspeech-recognition | CodeCode Available | 2 |
| Part-aware Shape Generation with Latent 3D Diffusion of Neural Voxel Fields | May 2, 2024 | Decoder | CodeCode Available | 2 |
| FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence | Jan 21, 2020 | Image ClassificationPseudo Label | CodeCode Available | 2 |
| The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks | Feb 12, 2025 | | CodeCode Available | 2 |
| Freeing Hybrid Distributed AI Training Configuration | Aug 20, 2021 | | CodeCode Available | 2 |
| Omnipose: a high-precision, morphology-independent solution for bacterial cell segmentation | Nov 5, 2021 | Cell SegmentationVocal Bursts Intensity Prediction | CodeCode Available | 2 |
| Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation | Jul 26, 2024 | Knowledge DistillationQuestion Answering | CodeCode Available | 2 |
| CyberMetric: A Benchmark Dataset based on Retrieval-Augmented Generation for Evaluating LLMs in Cybersecurity Knowledge | Feb 12, 2024 | General KnowledgeMultiple-choice | CodeCode Available | 2 |
| Enhancing Video Super-Resolution via Implicit Resampling-based Alignment | Apr 29, 2023 | Super-ResolutionVideo Super-Resolution | CodeCode Available | 2 |
| Algorithm Evolution Using Large Language Model | Nov 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Efficient Neural Network Analysis with Sum-of-Infeasibilities | Mar 19, 2022 | Adversarial AttackEfficient Neural Network | CodeCode Available | 2 |
| SyncTweedies: A General Generative Framework Based on Synchronized Diffusions | Mar 21, 2024 | Denoising | CodeCode Available | 2 |
| Visual Programming: Compositional visual reasoning without training | Nov 18, 2022 | In-Context LearningQuestion Answering | CodeCode Available | 2 |
| CC-3DT: Panoramic 3D Object Tracking via Cross-Camera Fusion | Dec 2, 2022 | 3D Object TrackingAutonomous Vehicles | CodeCode Available | 2 |
| Interactive Differentiable Simulation | May 26, 2019 | Model Predictive Controlparameter estimation | CodeCode Available | 2 |
| Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour | Jun 8, 2017 | Stochastic Optimization | CodeCode Available | 2 |
| Single-View View Synthesis in the Wild with Learned Adaptive Multiplane Images | May 24, 2022 | 3D geometryDepth Estimation | CodeCode Available | 2 |
| Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness | Jun 1, 2022 | CPUdocument understanding | CodeCode Available | 2 |
| SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts Commentaries | Apr 10, 2023 | Dense Video CaptioningVideo Captioning | CodeCode Available | 2 |
| T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation | Jul 12, 2023 | AttributeImage Generation | CodeCode Available | 2 |
| Controlling Text-to-Image Diffusion by Orthogonal Finetuning | Jun 12, 2023 | | CodeCode Available | 2 |
| DreamColour: Controllable Video Colour Editing without Training | Dec 6, 2024 | Instance SegmentationSemantic Segmentation | CodeCode Available | 2 |
| PowerSimulationsDynamics.jl -- An Open Source Modeling Package for Modern Power Systems with Inverter-Based Resources | Aug 5, 2023 | | CodeCode Available | 2 |
| LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DeliLaw: A Chinese Legal Counselling System Based on a Large Language Model | Aug 1, 2024 | ArticlesHallucination | CodeCode Available | 2 |
| Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs | Jun 22, 2024 | HallucinationUncertainty Quantification | CodeCode Available | 2 |
| Distillation Enhanced Generative Retrieval | Feb 16, 2024 | RetrievalText Retrieval | CodeCode Available | 2 |
| Any-point Trajectory Modeling for Policy Learning | Dec 28, 2023 | Trajectory ModelingTransfer Learning | CodeCode Available | 2 |
| GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer | Nov 14, 2023 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 2 |
| Teeth3DS+: An Extended Benchmark for Intraoral 3D Scans Analysis | Oct 12, 2022 | 3D Part SegmentationSegmentation | CodeCode Available | 2 |