| Open-Source Skull Reconstruction with MONAI | Nov 25, 2022 | C++ codeDeep Learning | CodeCode Available | 3 | 5 |
| MMedAgent: Learning to Use Medical Tools with Multi-modal Agent | Jul 2, 2024 | | CodeCode Available | 3 | 5 |
| DiarizationLM: Speaker Diarization Post-Processing with Large Language Models | Jan 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 | 5 |
| RelBench: A Benchmark for Deep Learning on Relational Databases | Jul 29, 2024 | Deep LearningFeature Engineering | CodeCode Available | 3 | 5 |
| A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions | Jun 9, 2024 | 3D visual groundingSurvey | CodeCode Available | 3 | 5 |
| Learning Bipedal Walking On Planned Footsteps For Humanoid Robots | Jul 26, 2022 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 3 | 5 |
| Large Language Monkeys: Scaling Inference Compute with Repeated Sampling | Jul 31, 2024 | GSM8KMath | CodeCode Available | 3 | 5 |
| ECG-FM: An Open Electrocardiogram Foundation Model | Aug 9, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 3 | 5 |
| Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation | Aug 9, 2024 | object-detectionObject Detection | CodeCode Available | 3 | 5 |
| SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning | Jan 26, 2023 | imbalanced classification | CodeCode Available | 3 | 5 |
| SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity | Sep 13, 2024 | Deep AttentionRepresentation Learning | CodeCode Available | 3 | 5 |
| CAD-Recode: Reverse Engineering CAD Code from Point Clouds | Dec 18, 2024 | CAD ReconstructionDecoder | CodeCode Available | 3 | 5 |
| EmergentTTS-Eval: Evaluating TTS Models on Complex Prosodic, Expressiveness, and Linguistic Challenges Using Model-as-a-Judge | May 29, 2025 | text-to-speechText to Speech | CodeCode Available | 3 | 5 |
| DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection | Jul 4, 2023 | DeepFake DetectionFace Swapping | CodeCode Available | 3 | 5 |
| FlowDock: Geometric Flow Matching for Generative Protein-Ligand Docking and Affinity Prediction | Dec 14, 2024 | Blind DockingDrug Discovery | CodeCode Available | 3 | 5 |
| LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models | Apr 18, 2025 | Feature Upsampling | CodeCode Available | 3 | 5 |
| ImageFolder: Autoregressive Image Generation with Folded Tokens | Oct 2, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 | 5 |
| ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation | Feb 6, 2024 | Image to Video GenerationVideo Generation | CodeCode Available | 3 | 5 |
| Simple linear attention language models balance the recall-throughput tradeoff | Feb 28, 2024 | Language ModellingMamba | CodeCode Available | 3 | 5 |
| MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System | Mar 12, 2025 | ChunkingComputational Efficiency | CodeCode Available | 3 | 5 |
| The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features | Jan 6, 2025 | Feature EngineeringTime Series | CodeCode Available | 3 | 5 |
| Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow | Sep 7, 2022 | Domain AdaptationImage Generation | CodeCode Available | 3 | 5 |
| LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer | Dec 18, 2024 | AttributeText Generation | CodeCode Available | 3 | 5 |
| IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization | Jun 15, 2024 | GPUImage Manipulation | CodeCode Available | 3 | 5 |
| IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact | Mar 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |