| A Neural Symbolic Model for Space Physics | Mar 11, 2025 | Large Language Modelmodel | CodeCode Available | 2 |
| OpenPrompt: An Open-source Framework for Prompt-learning | Nov 3, 2021 | Prompt Learning | CodeCode Available | 2 |
| Conditional Image Synthesis with Diffusion Models: A Survey | Sep 28, 2024 | DenoisingDiversity | CodeCode Available | 2 |
| BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature | Jan 13, 2025 | ArticlesImage-text Retrieval | CodeCode Available | 2 |
| PyMilo: A Python Library for ML I/O | Dec 31, 2024 | | CodeCode Available | 2 |
| Pixel-Wise Recognition for Holistic Surgical Scene Understanding | Jan 20, 2024 | Scene UnderstandingSegmentation | CodeCode Available | 2 |
| Robust Semi-supervised Multimodal Medical Image Segmentation via Cross Modality Collaboration | Aug 14, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis | Feb 29, 2024 | DiversityGPU | CodeCode Available | 2 |
| Faster Than Lies: Real-time Deepfake Detection using Binary Neural Networks | Jun 7, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 2 |
| Distribution-Free, Risk-Controlling Prediction Sets | Jan 7, 2021 | BIG-bench Machine LearningClassification | CodeCode Available | 2 |
| Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models | Sep 4, 2024 | Lifelike 3D Human Generation | CodeCode Available | 2 |
| DeePMD-kit: A deep learning package for many-body potential energy representation and molecular dynamics | Dec 11, 2017 | Deep Learning | CodeCode Available | 2 |
| Comparing Differentiable and Dynamic Ray Tracing: Introducing the Multipath Lifetime Map | Oct 18, 2024 | | CodeCode Available | 2 |
| Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension | Mar 6, 2024 | Point Cloud Registration | CodeCode Available | 2 |
| GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing | Jan 23, 2025 | 4k | CodeCode Available | 2 |
| RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models | Dec 7, 2023 | AttributeVideo Editing | CodeCode Available | 2 |
| Real-time Neural Radiance Caching for Path Tracing | Jun 23, 2021 | Neural Radiance Caching | CodeCode Available | 2 |
| Evaluating Mathematical Reasoning Beyond Accuracy | Apr 8, 2024 | MathMathematical Reasoning | CodeCode Available | 2 |
| Temporally Consistent Stereo Matching | Jul 16, 2024 | Depth EstimationStereo Matching | CodeCode Available | 2 |
| LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Statistical Machine Learning for Astronomy -- A Textbook | Jun 13, 2025 | AstronomyBayesian Inference | CodeCode Available | 2 |
| Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow | Jun 3, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Sustainability of Data Center Digital Twins with Reinforcement Learning | Apr 16, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec | Sep 14, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 2 |
| SUTrack: Towards Simple and Unified Single Object Tracking | Dec 26, 2024 | Object TrackingRgb-T Tracking | CodeCode Available | 2 |
| Adaptive Super Resolution For One-Shot Talking-Head Generation | Mar 23, 2024 | DecoderSuper-Resolution | CodeCode Available | 2 |
| A Survey on Multimodal Recommender Systems: Recent Advances and Future Directions | Jan 22, 2025 | Recommendation Systems | CodeCode Available | 2 |
| TextDescriptives: A Python package for calculating a large variety of metrics from text | Jan 5, 2023 | | CodeCode Available | 2 |
| ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian Splatting | Oct 29, 2024 | Active 3D ReconstructionDecision Making | CodeCode Available | 2 |
| MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation | Nov 10, 2022 | Multimodal Intent RecognitionRetrieval | CodeCode Available | 2 |
| PointRegGPT: Boosting 3D Point Cloud Registration using Generative Point-Cloud Pairs for Training | Jul 19, 2024 | Point Cloud Registration | CodeCode Available | 2 |
| Vision-and-Language Navigation via Causal Learning | Apr 16, 2024 | Causal InferenceContrastive Learning | CodeCode Available | 2 |
| Multi-view Surface Reconstruction Using Normal and Reflectance Cues | Jun 4, 2025 | Surface Reconstruction | CodeCode Available | 2 |
| CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning | May 19, 2025 | Text to SQLText-To-SQL | CodeCode Available | 2 |
| Partial Large Kernel CNNs for Efficient Super-Resolution | Apr 18, 2024 | Computational EfficiencyGPU | CodeCode Available | 2 |
| Flora: Low-Rank Adapters Are Secretly Gradient Compressors | Feb 5, 2024 | | CodeCode Available | 2 |
| ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation | Dec 7, 2022 | Semantic Segmentationzero-shot-classification | CodeCode Available | 2 |
| Transformer based Pluralistic Image Completion with Reduced Information Loss | Mar 31, 2024 | DecoderImage Inpainting | CodeCode Available | 2 |
| CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers | Mar 9, 2022 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation | Mar 24, 2023 | Text to 3D | CodeCode Available | 2 |
| LiSA: LiDAR Localization with Semantic Awareness | Jan 1, 2024 | Knowledge DistillationSemantic Segmentation | CodeCode Available | 2 |
| TensorFlow Quantum: A Software Framework for Quantum Machine Learning | Mar 6, 2020 | BIG-bench Machine LearningMeta-Learning | CodeCode Available | 2 |
| GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching | Mar 17, 2025 | Autonomous DrivingImage Generation | CodeCode Available | 2 |
| Exploring Human-Like Translation Strategy with Large Language Models | May 6, 2023 | HallucinationMachine Translation | CodeCode Available | 2 |
| Benchmarking Uncertainty Disentanglement: Specialized Uncertainties for Specialized Tasks | Feb 29, 2024 | BenchmarkingDisentanglement | CodeCode Available | 2 |
| LITA: Language Instructed Temporal-Localization Assistant | Mar 27, 2024 | Instruction FollowingTemporal Localization | CodeCode Available | 2 |
| JaxLife: An Open-Ended Agentic Simulator | Sep 1, 2024 | Artificial Life | CodeCode Available | 2 |
| SeD: Semantic-Aware Discriminator for Image Super-Resolution | Feb 29, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Let Go of Your Labels with Unsupervised Transfer | Jun 11, 2024 | Image ClusteringUnsupervised Image Classification | CodeCode Available | 2 |
| Iterated Denoising Energy Matching for Sampling from Boltzmann Densities | Feb 9, 2024 | DenoisingEfficient Exploration | CodeCode Available | 2 |