| CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning | Jun 30, 2023 | Causal InferenceMedical Report Generation | CodeCode Available | 3 | 5 |
| PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models | Apr 3, 2024 | GSM8KQuantization | CodeCode Available | 3 | 5 |
| MLZero: A Multi-Agent System for End-to-end Machine Learning Automation | May 20, 2025 | AutoMLCode Generation | CodeCode Available | 3 | 5 |
| Deformable DETR: Deformable Transformers for End-to-End Object Detection | Oct 8, 2020 | 2D Object DetectionObject Detection | CodeCode Available | 3 | 5 |
| VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation | Sep 6, 2024 | Image Generation | CodeCode Available | 3 | 5 |
| Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't | Mar 20, 2025 | Mathematical ReasoningReinforcement Learning (RL) | CodeCode Available | 3 | 5 |
| Vine Copulas as Differentiable Computational Graphs | Jun 16, 2025 | GPUScheduling | CodeCode Available | 3 | 5 |
| Safe RLHF: Safe Reinforcement Learning from Human Feedback | Oct 19, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 3 | 5 |
| Predicting from Strings: Language Model Embeddings for Bayesian Optimization | Oct 14, 2024 | Bayesian OptimizationExperimental Design | CodeCode Available | 3 | 5 |
| Discovering Language Model Behaviors with Model-Written Evaluations | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| A Survey of Camouflaged Object Detection and Beyond | Aug 26, 2024 | Instance SegmentationObject | CodeCode Available | 3 | 5 |
| MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving | Sep 23, 2024 | 3D Multi-Object TrackingAutonomous Driving | CodeCode Available | 3 | 5 |
| Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents | Mar 4, 2024 | Contrastive Learning | CodeCode Available | 3 | 5 |
| PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition | Jul 15, 2024 | Automated Theorem Proving | CodeCode Available | 3 | 5 |
| A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond | Mar 21, 2024 | Survey | CodeCode Available | 3 | 5 |
| MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo | Jan 22, 2024 | 3D ReconstructionDepth Estimation | CodeCode Available | 3 | 5 |
| Prisma: An Open Source Toolkit for Mechanistic Interpretability in Vision and Video | Apr 28, 2025 | | CodeCode Available | 3 | 5 |
| MyoSuite -- A contact-rich simulation suite for musculoskeletal motor control | May 26, 2022 | continuous-controlContinuous Control | CodeCode Available | 3 | 5 |
| Effects of charging and discharging capabilities on trade-offs between model accuracy and computational efficiency in pumped thermal electricity storage | Nov 8, 2024 | Computational Efficiency | CodeCode Available | 3 | 5 |
| Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey | Jun 11, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 3 | 5 |
| Towards Kinetic Manipulation of the Latent Space | Sep 15, 2024 | | CodeCode Available | 3 | 5 |
| Medical SAM Adapter: Adapting Segment Anything Model for Medical Image Segmentation | Apr 25, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 3 | 5 |
| AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP | Mar 9, 2025 | Anomaly DetectionAnomaly Localization | CodeCode Available | 3 | 5 |
| xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart | Jul 1, 2024 | 3D Medical Imaging Segmentationimage-classification | CodeCode Available | 3 | 5 |
| Open-Source Skull Reconstruction with MONAI | Nov 25, 2022 | C++ codeDeep Learning | CodeCode Available | 3 | 5 |
| MMedAgent: Learning to Use Medical Tools with Multi-modal Agent | Jul 2, 2024 | | CodeCode Available | 3 | 5 |
| DiarizationLM: Speaker Diarization Post-Processing with Large Language Models | Jan 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 | 5 |
| RelBench: A Benchmark for Deep Learning on Relational Databases | Jul 29, 2024 | Deep LearningFeature Engineering | CodeCode Available | 3 | 5 |
| A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions | Jun 9, 2024 | 3D visual groundingSurvey | CodeCode Available | 3 | 5 |
| Learning Bipedal Walking On Planned Footsteps For Humanoid Robots | Jul 26, 2022 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 3 | 5 |
| Large Language Monkeys: Scaling Inference Compute with Repeated Sampling | Jul 31, 2024 | GSM8KMath | CodeCode Available | 3 | 5 |
| ECG-FM: An Open Electrocardiogram Foundation Model | Aug 9, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 3 | 5 |
| Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation | Aug 9, 2024 | object-detectionObject Detection | CodeCode Available | 3 | 5 |
| SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning | Jan 26, 2023 | imbalanced classification | CodeCode Available | 3 | 5 |
| SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity | Sep 13, 2024 | Deep AttentionRepresentation Learning | CodeCode Available | 3 | 5 |
| CAD-Recode: Reverse Engineering CAD Code from Point Clouds | Dec 18, 2024 | CAD ReconstructionDecoder | CodeCode Available | 3 | 5 |
| EmergentTTS-Eval: Evaluating TTS Models on Complex Prosodic, Expressiveness, and Linguistic Challenges Using Model-as-a-Judge | May 29, 2025 | text-to-speechText to Speech | CodeCode Available | 3 | 5 |
| DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection | Jul 4, 2023 | DeepFake DetectionFace Swapping | CodeCode Available | 3 | 5 |
| FlowDock: Geometric Flow Matching for Generative Protein-Ligand Docking and Affinity Prediction | Dec 14, 2024 | Blind DockingDrug Discovery | CodeCode Available | 3 | 5 |
| LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models | Apr 18, 2025 | Feature Upsampling | CodeCode Available | 3 | 5 |
| ImageFolder: Autoregressive Image Generation with Folded Tokens | Oct 2, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 | 5 |
| ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation | Feb 6, 2024 | Image to Video GenerationVideo Generation | CodeCode Available | 3 | 5 |
| Simple linear attention language models balance the recall-throughput tradeoff | Feb 28, 2024 | Language ModellingMamba | CodeCode Available | 3 | 5 |
| MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System | Mar 12, 2025 | ChunkingComputational Efficiency | CodeCode Available | 3 | 5 |
| The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features | Jan 6, 2025 | Feature EngineeringTime Series | CodeCode Available | 3 | 5 |
| Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow | Sep 7, 2022 | Domain AdaptationImage Generation | CodeCode Available | 3 | 5 |
| LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer | Dec 18, 2024 | AttributeText Generation | CodeCode Available | 3 | 5 |
| IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization | Jun 15, 2024 | GPUImage Manipulation | CodeCode Available | 3 | 5 |
| IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact | Mar 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| Multi-agent Architecture Search via Agentic Supernet | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |