| Open-Source Skull Reconstruction with MONAI | Nov 25, 2022 | C++ codeDeep Learning | CodeCode Available | 3 |
| MMedAgent: Learning to Use Medical Tools with Multi-modal Agent | Jul 2, 2024 | | CodeCode Available | 3 |
| DiarizationLM: Speaker Diarization Post-Processing with Large Language Models | Jan 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| RelBench: A Benchmark for Deep Learning on Relational Databases | Jul 29, 2024 | Deep LearningFeature Engineering | CodeCode Available | 3 |
| A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions | Jun 9, 2024 | 3D visual groundingSurvey | CodeCode Available | 3 |
| Learning Bipedal Walking On Planned Footsteps For Humanoid Robots | Jul 26, 2022 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 3 |
| Large Language Monkeys: Scaling Inference Compute with Repeated Sampling | Jul 31, 2024 | GSM8KMath | CodeCode Available | 3 |
| ECG-FM: An Open Electrocardiogram Foundation Model | Aug 9, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 3 |
| Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation | Aug 9, 2024 | object-detectionObject Detection | CodeCode Available | 3 |
| SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning | Jan 26, 2023 | imbalanced classification | CodeCode Available | 3 |
| SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity | Sep 13, 2024 | Deep AttentionRepresentation Learning | CodeCode Available | 3 |
| CAD-Recode: Reverse Engineering CAD Code from Point Clouds | Dec 18, 2024 | CAD ReconstructionDecoder | CodeCode Available | 3 |
| EmergentTTS-Eval: Evaluating TTS Models on Complex Prosodic, Expressiveness, and Linguistic Challenges Using Model-as-a-Judge | May 29, 2025 | text-to-speechText to Speech | CodeCode Available | 3 |
| DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection | Jul 4, 2023 | DeepFake DetectionFace Swapping | CodeCode Available | 3 |
| FlowDock: Geometric Flow Matching for Generative Protein-Ligand Docking and Affinity Prediction | Dec 14, 2024 | Blind DockingDrug Discovery | CodeCode Available | 3 |
| LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models | Apr 18, 2025 | Feature Upsampling | CodeCode Available | 3 |
| ImageFolder: Autoregressive Image Generation with Folded Tokens | Oct 2, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 |
| ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation | Feb 6, 2024 | Image to Video GenerationVideo Generation | CodeCode Available | 3 |
| Simple linear attention language models balance the recall-throughput tradeoff | Feb 28, 2024 | Language ModellingMamba | CodeCode Available | 3 |
| MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System | Mar 12, 2025 | ChunkingComputational Efficiency | CodeCode Available | 3 |
| The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features | Jan 6, 2025 | Feature EngineeringTime Series | CodeCode Available | 3 |
| Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow | Sep 7, 2022 | Domain AdaptationImage Generation | CodeCode Available | 3 |
| LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer | Dec 18, 2024 | AttributeText Generation | CodeCode Available | 3 |
| IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization | Jun 15, 2024 | GPUImage Manipulation | CodeCode Available | 3 |
| IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact | Mar 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Multi-agent Architecture Search via Agentic Supernet | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Strassen Multisystolic Array Hardware Architectures | Feb 14, 2025 | | CodeCode Available | 3 |
| CoverM: Read alignment statistics for metagenomics | Jan 20, 2025 | Computational Efficiency | CodeCode Available | 3 |
| OneForecast: A Universal Framework for Global and Regional Weather Forecasting | Feb 1, 2025 | Weather Forecasting | CodeCode Available | 3 |
| Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise Reward | May 18, 2025 | GPUGraph Matching | CodeCode Available | 3 |
| Improved 3D Point-Line Mapping Regression for Camera Relocalization | Feb 28, 2025 | Camera Relocalizationregression | CodeCode Available | 3 |
| 3D Gaussian Splatting: Survey, Technologies, Challenges, and Opportunities | Jul 24, 2024 | 3DGSSurvey | CodeCode Available | 3 |
| Delay-penalized transducer for low-latency streaming ASR | Oct 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| Intervention-Aware Forecasting: Breaking Historical Limits from a System Perspective | May 22, 2024 | Data IntegrationSensitivity | CodeCode Available | 3 |
| BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis | Mar 10, 2022 | Gesture GenerationGesture Recognition | CodeCode Available | 3 |
| TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation | May 8, 2025 | Quantization | CodeCode Available | 3 |
| Deep Learning for Protein-Ligand Docking: Are We There Yet? | May 23, 2024 | Deep LearningDrug Discovery | CodeCode Available | 3 |
| Autoregressive Image Generation using Residual Quantization | Mar 3, 2022 | Conditional Image GenerationImage Generation | CodeCode Available | 3 |
| SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression | Mar 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| AutoSurvey: Large Language Models Can Automatically Write Surveys | Jun 10, 2024 | RetrievalSurvey | CodeCode Available | 3 |
| OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion | Jul 10, 2024 | Object DetectionZero-Shot Object Detection | CodeCode Available | 3 |
| A Survey on Large Language Model Acceleration based on KV Cache Management | Dec 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMs | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image Reconstruction | Nov 15, 2021 | Compressive SensingImage Reconstruction | CodeCode Available | 3 |
| CryptoMamba: Leveraging State Space Models for Accurate Bitcoin Price Prediction | Jan 2, 2025 | MambaState Space Models | CodeCode Available | 3 |
| Alignment of Diffusion Models: Fundamentals, Challenges, and Future | Sep 11, 2024 | | CodeCode Available | 3 |
| Learning with 3D rotations, a hitchhiker's guide to SO(3) | Apr 17, 2024 | | CodeCode Available | 3 |
| SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications | Nov 7, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 3 |
| Tails Tell Tales: Chapter-Wide Manga Transcriptions with Character Names | Aug 1, 2024 | | CodeCode Available | 3 |
| 4D Panoptic Scene Graph Generation | May 16, 2024 | 4D Panoptic SegmentationGraph Generation | CodeCode Available | 3 |