| Virtual Normal: Enforcing Geometric Constraints for Accurate and Robust Depth Prediction | Mar 7, 2021 | Depth EstimationDepth Prediction | CodeCode Available | 2 |
| TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials | Apr 17, 2025 | Articles | CodeCode Available | 2 |
| Unified Structure Generation for Universal Information Extraction | Mar 23, 2022 | Aspect-Based Sentiment Analysis (ABSA)UIE | CodeCode Available | 2 |
| Customization Assistant for Text-to-image Generation | Dec 5, 2023 | DescriptiveImage Generation | CodeCode Available | 2 |
| IDE-3D: Interactive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis | May 31, 2022 | 3D-Aware Image SynthesisImage Generation | CodeCode Available | 2 |
| PiEEG-16 to Measure 16 EEG Channels with Raspberry Pi for Brain-Computer Interfaces and EEG devices | Sep 13, 2024 | Brain Computer InterfaceEEG | CodeCode Available | 2 |
| GotenNet: Rethinking Efficient 3D Equivariant Graph Neural Networks | Apr 24, 2025 | Atomic ForcesComputational Efficiency | CodeCode Available | 2 |
| Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer | Jan 23, 2017 | Computational EfficiencyGPU | CodeCode Available | 2 |
| Voice Separation with an Unknown Number of Multiple Speakers | Feb 29, 2020 | Speech Separation | CodeCode Available | 2 |
| Differentiable Augmentation for Data-Efficient GAN Training | Jun 18, 2020 | Image Generation | CodeCode Available | 2 |
| MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models | Sep 26, 2024 | Large Language ModelModel Compression | CodeCode Available | 2 |
| ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback | May 23, 2025 | | CodeCode Available | 2 |
| MPNet: Masked and Permuted Pre-training for Language Understanding | Apr 20, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Task-Customized Mixture of Adapters for General Image Fusion | Mar 19, 2024 | Mixture-of-Experts | CodeCode Available | 2 |
| Adaptive Multi-Scale Decomposition Framework for Time Series Forecasting | Jun 6, 2024 | Computational EfficiencyData Integration | CodeCode Available | 2 |
| PyGAD: An Intuitive Genetic Algorithm Python Library | Jun 11, 2021 | | CodeCode Available | 2 |
| Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback Learning | May 23, 2023 | Image GenerationOptical Flow Estimation | CodeCode Available | 2 |
| Modular Primitives for High-Performance Differentiable Rendering | Nov 6, 2020 | AttributeInverse Rendering | CodeCode Available | 2 |
| Backdoor Attacks and Countermeasures on Deep Learning: A Comprehensive Review | Jul 21, 2020 | Deep Learning | CodeCode Available | 2 |
| LidarDM: Generative LiDAR Simulation in a Generated World | Apr 3, 2024 | Autonomous DrivingPoint Cloud Generation | CodeCode Available | 2 |
| ClipCap: CLIP Prefix for Image Captioning | Nov 18, 2021 | Image CaptioningLanguage Modeling | CodeCode Available | 2 |
| End to End Learning for Self-Driving Cars | Apr 25, 2016 | Lane DetectionSelf-Driving Cars | CodeCode Available | 2 |
| Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking | Sep 8, 2021 | BenchmarkingDiversity | CodeCode Available | 2 |
| L4acados: Learning-based models for acados, applied to Gaussian process-based predictive control | Nov 28, 2024 | Computational EfficiencyGaussian Processes | CodeCode Available | 2 |
| MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting | Oct 10, 2024 | 3D ReconstructionDynamic Reconstruction | CodeCode Available | 2 |
| Virgo: A Preliminary Exploration on Reproducing o1-like MLLM | Jan 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Neural Speech Synthesis with Transformer Network | Sep 19, 2018 | DecoderMachine Translation | CodeCode Available | 2 |
| End-To-End Memory Networks | Mar 31, 2015 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| How Well Do Sparse Imagenet Models Transfer? | Nov 26, 2021 | Transfer Learning | CodeCode Available | 2 |
| LitLLM: A Toolkit for Scientific Literature Review | Feb 2, 2024 | RAGRetrieval | CodeCode Available | 2 |
| Adversarial Latent Autoencoders | Apr 9, 2020 | DisentanglementImage Generation | CodeCode Available | 2 |
| Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations | Dec 18, 2020 | 3D Object Detection3D Object Tracking | CodeCode Available | 2 |
| RMP-SNN: Residual Membrane Potential Neuron for Enabling Deeper High-Accuracy and Low-Latency Spiking Neural Network | Feb 25, 2020 | | CodeCode Available | 2 |
| OCNet: Object Context Network for Scene Parsing | Sep 4, 2018 | ObjectRelation | CodeCode Available | 2 |
| BRAU-Net++: U-Shaped Hybrid CNN-Transformer Network for Medical Image Segmentation | Jan 1, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Recurrent Transition Networks for Character Locomotion | Oct 4, 2018 | Super-Resolution | CodeCode Available | 2 |
| Graduated Non-Convexity for Robust Spatial Perception: From Non-Minimal Solvers to Global Outlier Rejection | Sep 18, 2019 | Pose Estimation | CodeCode Available | 2 |
| ChatCell: Facilitating Single-Cell Analysis with Natural Language | Feb 13, 2024 | | CodeCode Available | 2 |
| WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes | Mar 17, 2025 | 3D Reconstruction4D reconstruction | CodeCode Available | 2 |
| SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond | May 26, 2025 | Logical ReasoningReinforcement Learning (RL) | CodeCode Available | 2 |
| OccDepth: A Depth-Aware Method for 3D Semantic Scene Completion | Feb 27, 2023 | 3D geometry3D Semantic Scene Completion | CodeCode Available | 2 |
| Scale Normalized Image Pyramids with AutoFocus for Object Detection | Feb 10, 2021 | Objectobject-detection | CodeCode Available | 2 |
| Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging | May 28, 2021 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 2 |
| Co-Occurrent Features in Semantic Segmentation | Jun 1, 2019 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections | Aug 5, 2020 | NeRF | CodeCode Available | 2 |
| MSGNet: Learning Multi-Scale Inter-Series Correlations for Multivariate Time Series Forecasting | Dec 31, 2023 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 2 |
| DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation | Jul 1, 2020 | Conversational Response GenerationResponse Generation | CodeCode Available | 2 |
| Learning Physically Realizable Skills for Online Packing of General 3D Shapes | Dec 5, 2022 | 3D geometryAction Generation | CodeCode Available | 2 |
| MLOmics: Cancer Multi-Omics Database for Machine Learning | Sep 2, 2024 | | CodeCode Available | 2 |
| The Lovász-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks | May 24, 2017 | Image SegmentationSegmentation | CodeCode Available | 2 |