| 3DGS-to-PC: Convert a 3D Gaussian Splatting Scene into a Dense Point Cloud or Mesh | Jan 13, 2025 | 3DGSSurface Reconstruction | CodeCode Available | 4 |
| Qlib: An AI-oriented Quantitative Investment Platform | Sep 22, 2020 | Portfolio OptimizationStock Market Prediction | CodeCode Available | 4 |
| Exploiting Diffusion Prior for Real-World Image Super-Resolution | May 11, 2023 | Blind Super-ResolutionImage Super-Resolution | CodeCode Available | 4 |
| MegActor-Σ: Unlocking Flexible Mixed-Modal Control in Portrait Animation with Diffusion Transformer | Aug 27, 2024 | Portrait Animation | CodeCode Available | 4 |
| OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training | Jul 10, 2024 | | CodeCode Available | 4 |
| Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation | Nov 28, 2023 | | CodeCode Available | 4 |
| The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning | Mar 5, 2024 | Multiple-choice | CodeCode Available | 4 |
| Aequitas Flow: Streamlining Fair ML Experimentation | May 9, 2024 | BenchmarkingFairness | CodeCode Available | 4 |
| Efficient Part-level 3D Object Generation via Dual Volume Packing | Jun 11, 2025 | DiversityObject | CodeCode Available | 4 |
| Character Region Awareness for Text Detection | Apr 3, 2019 | Scene Text DetectionText Detection | CodeCode Available | 4 |
| Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization | Feb 5, 2024 | Science Question AnsweringText-to-Video Generation | CodeCode Available | 4 |
| AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators | Mar 29, 2023 | Information RetrievalRetrieval | CodeCode Available | 4 |
| Advancing Parsimonious Deep Learning Weather Prediction using the HEALPix Mesh | Sep 11, 2023 | Deep Learning | CodeCode Available | 4 |
| SurveyX: Academic Survey Automation via Large Language Models | Feb 20, 2025 | Survey | CodeCode Available | 4 |
| Cameras as Rays: Pose Estimation via Ray Diffusion | Feb 22, 2024 | 3D ReconstructionCamera Pose Estimation | CodeCode Available | 4 |
| Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning | May 23, 2025 | DecoderImage Captioning | CodeCode Available | 4 |
| Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence | Jul 9, 2024 | Retrieval-augmented Generation | CodeCode Available | 4 |
| V?: Guided Visual Search as a Core Mechanism in Multimodal LLMs | Jan 1, 2024 | Visual GroundingWorld Knowledge | CodeCode Available | 4 |
| MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions | Jul 8, 2024 | Video AlignmentVideo Generation | CodeCode Available | 4 |
| Conformalized Physics-Informed Neural Networks | May 13, 2024 | Conformal Prediction | CodeCode Available | 4 |
| RETSim: Resilient and Efficient Text Similarity | Nov 28, 2023 | Adversarial TextClustering | CodeCode Available | 4 |
| RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark | Jan 8, 2025 | object-detectionObject Detection | CodeCode Available | 4 |
| Data-Prep-Kit: getting your data ready for LLM application development | Sep 26, 2024 | CPULanguage Modeling | CodeCode Available | 4 |
| NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment | May 2, 2024 | modelparameter-efficient fine-tuning | CodeCode Available | 4 |
| Retrieval-Augmented Generation for Large Language Models: A Survey | Dec 18, 2023 | HallucinationRAG | CodeCode Available | 4 |
| Learning the Beauty in Songs: Neural Singing Voice Beautifier | Feb 27, 2022 | Dynamic Time Warping | CodeCode Available | 4 |
| HVI: A New color space for Low-light Image Enhancement | Feb 27, 2025 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 4 |
| Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models | Apr 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets | Jan 6, 2022 | Memorization | CodeCode Available | 4 |
| Graspness Discovery in Clutters for Fast and Accurate Grasp Detection | Jun 17, 2024 | | CodeCode Available | 4 |
| OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models | Mar 16, 2024 | DenoisingImage Generation | CodeCode Available | 4 |
| Guiding Instruction-based Image Editing via Multimodal Large Language Models | Sep 29, 2023 | Image ManipulationResponse Generation | CodeCode Available | 4 |
| Taming Scalable Visual Tokenizer for Autoregressive Image Generation | Dec 3, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 4 |
| SocialED: A Python Library for Social Event Detection | Dec 18, 2024 | CPUEvent Detection | CodeCode Available | 4 |
| OLMoE: Open Mixture-of-Experts Language Models | Sep 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders | Aug 28, 2024 | Optical Character Recognition | CodeCode Available | 4 |
| On the limits of agency in agent-based models | Sep 14, 2024 | Computational Efficiencycounterfactual | CodeCode Available | 4 |
| Visual Attention Network | Feb 20, 2022 | image-classificationImage Classification | CodeCode Available | 4 |
| Large Language Models for Time Series: A Survey | Feb 2, 2024 | QuantizationSurvey | CodeCode Available | 4 |
| LLMMapReduce: Simplified Long-Sequence Processing using Large Language Models | Oct 12, 2024 | document understanding | CodeCode Available | 4 |
| OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination | Mar 22, 2025 | | CodeCode Available | 4 |
| TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning | Sep 26, 2023 | | CodeCode Available | 4 |
| Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster | Apr 6, 2023 | | CodeCode Available | 4 |
| Large Language Model-Based Agents for Software Engineering: A Survey | Sep 4, 2024 | AI AgentLanguage Modeling | CodeCode Available | 4 |
| R1-Onevision:An Open-Source Multimodal Large Language Model Capable of Deep Reasoning | Feb 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Training Sparse Mixture Of Experts Text Embedding Models | Feb 11, 2025 | Mixture-of-ExpertsRAG | CodeCode Available | 4 |
| PyTorch Adapt | Nov 28, 2022 | Domain Adaptation | CodeCode Available | 4 |
| Tevatron 2.0: Unified Document Retrieval Toolkit across Scale, Language, and Modality | May 5, 2025 | Retrieval | CodeCode Available | 4 |
| RecurrentGemma: Moving Past Transformers for Efficient Open Language Models | Apr 11, 2024 | Language Modelling | CodeCode Available | 4 |
| MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental Learning | Dec 12, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 4 |