| Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2 | Aug 3, 2024 | DiversitySegmentation | CodeCode Available | 3 | 5 |
| Large-Scale 3D Medical Image Pre-training with Geometric Context Priors | Oct 13, 2024 | Contrastive LearningMedical Image Analysis | CodeCode Available | 3 | 5 |
| ERNIE 2.0: A Continual Pre-training Framework for Language Understanding | Jul 29, 2019 | Chinese Named Entity RecognitionChinese Reading Comprehension | CodeCode Available | 3 | 5 |
| PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection | Apr 8, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 3 | 5 |
| SINERGYM -- A virtual testbed for building energy optimization with Reinforcement Learning | Dec 11, 2024 | continuous-controlContinuous Control | CodeCode Available | 3 | 5 |
| ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities | May 18, 2023 | 1 Image, 2*2 StitchiAction Classification | CodeCode Available | 3 | 5 |
| Video ReCap: Recursive Captioning of Hour-Long Videos | Feb 20, 2024 | EgoSchemaVideo Captioning | CodeCode Available | 3 | 5 |
| Magnitude-aware Probabilistic Speaker Embeddings | Feb 28, 2022 | Out-of-Distribution DetectionSpeaker Verification | CodeCode Available | 3 | 5 |
| RobustSAM: Segment Anything Robustly on Degraded Images | Jun 13, 2024 | DeblurringImage Dehazing | CodeCode Available | 3 | 5 |
| Centaur: a foundation model of human cognition | Oct 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network Fabrics | Feb 9, 2024 | | CodeCode Available | 3 | 5 |
| GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation | Apr 3, 2025 | Image GenerationWorld Knowledge | CodeCode Available | 3 | 5 |
| DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors | Jun 3, 2024 | | CodeCode Available | 3 | 5 |
| Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks | Feb 6, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 3 | 5 |
| Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation | Jun 11, 2024 | DecoderKnowledge Distillation | CodeCode Available | 3 | 5 |
| Model-based Asynchronous Hyperparameter and Neural Architecture Search | Mar 24, 2020 | AutoMLBayesian Optimization | CodeCode Available | 3 | 5 |
| ContextCite: Attributing Model Generation to Context | Sep 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| Evaluation of the MACE Force Field Architecture: from Medicinal Chemistry to Materials Science | May 23, 2023 | | CodeCode Available | 3 | 5 |
| Language Model Inversion | Nov 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| Evalverse: Unified and Accessible Library for Large Language Model Evaluation | Apr 1, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 3 | 5 |
| DBA-Fusion: Tightly Integrating Deep Dense Visual Bundle Adjustment with Multiple Sensors for Large-Scale Localization and Mapping | Mar 20, 2024 | Optical Flow EstimationSensor Fusion | CodeCode Available | 3 | 5 |
| GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF Fusion | Aug 22, 2024 | Computational Efficiency | CodeCode Available | 3 | 5 |
| Improved motif-scaffolding with SE(3) flow matching | Jan 8, 2024 | Data AugmentationDiversity | CodeCode Available | 3 | 5 |
| GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEA | Apr 1, 2024 | GPUMultiobjective Optimization | CodeCode Available | 3 | 5 |
| SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression | Mar 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| OmniPred: Language Models as Universal Regressors | Feb 22, 2024 | Experimental Designregression | CodeCode Available | 3 | 5 |
| Deep OC-SORT: Multi-Pedestrian Tracking by Adaptive Re-Identification | Feb 23, 2023 | Multi-Object TrackingObject | CodeCode Available | 3 | 5 |
| ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution | Feb 2, 2024 | Combinatorial OptimizationEvolutionary Algorithms | CodeCode Available | 3 | 5 |
| ADBench: Anomaly Detection Benchmark | Jun 19, 2022 | Anomaly DetectionOutlier Detection | CodeCode Available | 3 | 5 |
| 94% on CIFAR-10 in 3.29 Seconds on a Single GPU | Mar 30, 2024 | GPU | CodeCode Available | 3 | 5 |
| MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction | Aug 10, 2023 | Autonomous DrivingOnline Vectorized HD Map Construction | CodeCode Available | 3 | 5 |
| On the Trajectory Regularity of ODE-based Diffusion Sampling | May 18, 2024 | DenoisingImage Generation | CodeCode Available | 3 | 5 |
| Amplifier: Bringing Attention to Neglected Low-Energy Components in Time Series Forecasting | Jan 28, 2025 | SpecificityTime Series | CodeCode Available | 3 | 5 |
| IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models | May 22, 2025 | BenchmarkingInstruction Following | CodeCode Available | 3 | 5 |
| SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis | Nov 29, 2023 | NeRFTalking Face Generation | CodeCode Available | 3 | 5 |
| Single-Image Shadow Removal Using Deep Learning: A Comprehensive Survey | Jul 11, 2024 | Deep LearningImage Restoration | CodeCode Available | 3 | 5 |
| VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model | Jan 21, 2025 | Image GenerationInstruction Following | CodeCode Available | 3 | 5 |
| Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning | Oct 10, 2024 | 3D Parameter-Efficient Fine-Tuning for Classification3D Point Cloud Classification | CodeCode Available | 3 | 5 |
| GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting | Nov 24, 2023 | NeRF | CodeCode Available | 3 | 5 |
| GraphStorm: all-in-one graph machine learning framework for industry applications | Jun 10, 2024 | Allgraph construction | CodeCode Available | 3 | 5 |
| TokenPacker: Efficient Visual Projector for Multimodal LLM | Jul 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 3 | 5 |
| WeatherMesh-3: Fast and accurate operational global weather forecasting | Mar 28, 2025 | Computational EfficiencyGPU | CodeCode Available | 3 | 5 |
| NdLinear Is All You Need for Representation Learning | Mar 21, 2025 | AllRepresentation Learning | CodeCode Available | 3 | 5 |
| Bake off redux: a review and experimental evaluation of recent time series classification algorithms | Apr 25, 2023 | Dynamic Time WarpingTime Series | CodeCode Available | 3 | 5 |
| TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic Representation | Apr 5, 2025 | | CodeCode Available | 3 | 5 |
| CameraHMR: Aligning People with Perspective | Nov 12, 2024 | 3D human pose and shape estimation | CodeCode Available | 3 | 5 |
| DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge | Jul 6, 2025 | Image GenerationMultimodal Reasoning | CodeCode Available | 3 | 5 |
| DEFOM-Stereo: Depth Foundation Model Based Stereo Matching | Jan 16, 2025 | Depth EstimationDisparity Estimation | CodeCode Available | 3 | 5 |
| Rainbow: Combining Improvements in Deep Reinforcement Learning | Oct 6, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 3 | 5 |
| Mambular: A Sequential Model for Tabular Deep Learning | Aug 12, 2024 | Deep LearningMamba | CodeCode Available | 3 | 5 |