| Vision-Language Pre-training: Basics, Recent Advances, and Future Trends | Oct 17, 2022 | Few-Shot LearningImage Captioning | CodeCode Available | 3 | 5 |
| CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning | Jun 30, 2023 | Causal InferenceMedical Report Generation | CodeCode Available | 3 | 5 |
| PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models | Apr 3, 2024 | GSM8KQuantization | CodeCode Available | 3 | 5 |
| MLZero: A Multi-Agent System for End-to-end Machine Learning Automation | May 20, 2025 | AutoMLCode Generation | CodeCode Available | 3 | 5 |
| Deformable DETR: Deformable Transformers for End-to-End Object Detection | Oct 8, 2020 | 2D Object DetectionObject Detection | CodeCode Available | 3 | 5 |
| VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation | Sep 6, 2024 | Image Generation | CodeCode Available | 3 | 5 |
| Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't | Mar 20, 2025 | Mathematical ReasoningReinforcement Learning (RL) | CodeCode Available | 3 | 5 |
| Vine Copulas as Differentiable Computational Graphs | Jun 16, 2025 | GPUScheduling | CodeCode Available | 3 | 5 |
| Safe RLHF: Safe Reinforcement Learning from Human Feedback | Oct 19, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 3 | 5 |
| Predicting from Strings: Language Model Embeddings for Bayesian Optimization | Oct 14, 2024 | Bayesian OptimizationExperimental Design | CodeCode Available | 3 | 5 |
| Discovering Language Model Behaviors with Model-Written Evaluations | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| A Survey of Camouflaged Object Detection and Beyond | Aug 26, 2024 | Instance SegmentationObject | CodeCode Available | 3 | 5 |
| MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving | Sep 23, 2024 | 3D Multi-Object TrackingAutonomous Driving | CodeCode Available | 3 | 5 |
| Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents | Mar 4, 2024 | Contrastive Learning | CodeCode Available | 3 | 5 |
| PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition | Jul 15, 2024 | Automated Theorem Proving | CodeCode Available | 3 | 5 |
| A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond | Mar 21, 2024 | Survey | CodeCode Available | 3 | 5 |
| MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo | Jan 22, 2024 | 3D ReconstructionDepth Estimation | CodeCode Available | 3 | 5 |
| Prisma: An Open Source Toolkit for Mechanistic Interpretability in Vision and Video | Apr 28, 2025 | | CodeCode Available | 3 | 5 |
| MyoSuite -- A contact-rich simulation suite for musculoskeletal motor control | May 26, 2022 | continuous-controlContinuous Control | CodeCode Available | 3 | 5 |
| Effects of charging and discharging capabilities on trade-offs between model accuracy and computational efficiency in pumped thermal electricity storage | Nov 8, 2024 | Computational Efficiency | CodeCode Available | 3 | 5 |
| Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey | Jun 11, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 3 | 5 |
| Towards Kinetic Manipulation of the Latent Space | Sep 15, 2024 | | CodeCode Available | 3 | 5 |
| Medical SAM Adapter: Adapting Segment Anything Model for Medical Image Segmentation | Apr 25, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 3 | 5 |
| AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP | Mar 9, 2025 | Anomaly DetectionAnomaly Localization | CodeCode Available | 3 | 5 |
| xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart | Jul 1, 2024 | 3D Medical Imaging Segmentationimage-classification | CodeCode Available | 3 | 5 |