| RobustSAM: Segment Anything Robustly on Degraded Images | Jun 13, 2024 | DeblurringImage Dehazing | CodeCode Available | 3 |
| Centaur: a foundation model of human cognition | Oct 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network Fabrics | Feb 9, 2024 | | CodeCode Available | 3 |
| GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation | Apr 3, 2025 | Image GenerationWorld Knowledge | CodeCode Available | 3 |
| DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors | Jun 3, 2024 | | CodeCode Available | 3 |
| Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks | Feb 6, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 3 |
| Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation | Jun 11, 2024 | DecoderKnowledge Distillation | CodeCode Available | 3 |
| Model-based Asynchronous Hyperparameter and Neural Architecture Search | Mar 24, 2020 | AutoMLBayesian Optimization | CodeCode Available | 3 |
| ContextCite: Attributing Model Generation to Context | Sep 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Evaluation of the MACE Force Field Architecture: from Medicinal Chemistry to Materials Science | May 23, 2023 | | CodeCode Available | 3 |
| Language Model Inversion | Nov 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Evalverse: Unified and Accessible Library for Large Language Model Evaluation | Apr 1, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 3 |
| DBA-Fusion: Tightly Integrating Deep Dense Visual Bundle Adjustment with Multiple Sensors for Large-Scale Localization and Mapping | Mar 20, 2024 | Optical Flow EstimationSensor Fusion | CodeCode Available | 3 |
| GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF Fusion | Aug 22, 2024 | Computational Efficiency | CodeCode Available | 3 |
| Improved motif-scaffolding with SE(3) flow matching | Jan 8, 2024 | Data AugmentationDiversity | CodeCode Available | 3 |
| GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEA | Apr 1, 2024 | GPUMultiobjective Optimization | CodeCode Available | 3 |
| SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression | Mar 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| OmniPred: Language Models as Universal Regressors | Feb 22, 2024 | Experimental Designregression | CodeCode Available | 3 |
| Deep OC-SORT: Multi-Pedestrian Tracking by Adaptive Re-Identification | Feb 23, 2023 | Multi-Object TrackingObject | CodeCode Available | 3 |
| ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution | Feb 2, 2024 | Combinatorial OptimizationEvolutionary Algorithms | CodeCode Available | 3 |
| ADBench: Anomaly Detection Benchmark | Jun 19, 2022 | Anomaly DetectionOutlier Detection | CodeCode Available | 3 |
| 94% on CIFAR-10 in 3.29 Seconds on a Single GPU | Mar 30, 2024 | GPU | CodeCode Available | 3 |
| MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction | Aug 10, 2023 | Autonomous DrivingOnline Vectorized HD Map Construction | CodeCode Available | 3 |
| On the Trajectory Regularity of ODE-based Diffusion Sampling | May 18, 2024 | DenoisingImage Generation | CodeCode Available | 3 |
| Amplifier: Bringing Attention to Neglected Low-Energy Components in Time Series Forecasting | Jan 28, 2025 | SpecificityTime Series | CodeCode Available | 3 |
| IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models | May 22, 2025 | BenchmarkingInstruction Following | CodeCode Available | 3 |
| SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis | Nov 29, 2023 | NeRFTalking Face Generation | CodeCode Available | 3 |
| Single-Image Shadow Removal Using Deep Learning: A Comprehensive Survey | Jul 11, 2024 | Deep LearningImage Restoration | CodeCode Available | 3 |
| VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model | Jan 21, 2025 | Image GenerationInstruction Following | CodeCode Available | 3 |
| Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning | Oct 10, 2024 | 3D Parameter-Efficient Fine-Tuning for Classification3D Point Cloud Classification | CodeCode Available | 3 |
| GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting | Nov 24, 2023 | NeRF | CodeCode Available | 3 |
| GraphStorm: all-in-one graph machine learning framework for industry applications | Jun 10, 2024 | Allgraph construction | CodeCode Available | 3 |
| TokenPacker: Efficient Visual Projector for Multimodal LLM | Jul 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 3 |
| WeatherMesh-3: Fast and accurate operational global weather forecasting | Mar 28, 2025 | Computational EfficiencyGPU | CodeCode Available | 3 |
| NdLinear Is All You Need for Representation Learning | Mar 21, 2025 | AllRepresentation Learning | CodeCode Available | 3 |
| Bake off redux: a review and experimental evaluation of recent time series classification algorithms | Apr 25, 2023 | Dynamic Time WarpingTime Series | CodeCode Available | 3 |
| TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic Representation | Apr 5, 2025 | | CodeCode Available | 3 |
| CameraHMR: Aligning People with Perspective | Nov 12, 2024 | 3D human pose and shape estimation | CodeCode Available | 3 |
| DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge | Jul 6, 2025 | Image GenerationMultimodal Reasoning | CodeCode Available | 3 |
| DEFOM-Stereo: Depth Foundation Model Based Stereo Matching | Jan 16, 2025 | Depth EstimationDisparity Estimation | CodeCode Available | 3 |
| Rainbow: Combining Improvements in Deep Reinforcement Learning | Oct 6, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 3 |
| Mambular: A Sequential Model for Tabular Deep Learning | Aug 12, 2024 | Deep LearningMamba | CodeCode Available | 3 |
| Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization | Jun 6, 2024 | DenoisingImage Generation | CodeCode Available | 3 |
| WHAC: World-grounded Humans and Cameras | Mar 19, 2024 | Camera Pose EstimationPose Estimation | CodeCode Available | 3 |
| GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations | Feb 19, 2024 | Card GamesLogical Reasoning | CodeCode Available | 3 |
| Generative AI Act II: Test Time Scaling Drives Cognition Engineering | Apr 18, 2025 | Prompt Engineering | CodeCode Available | 3 |
| ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models | Oct 25, 2024 | | CodeCode Available | 3 |
| Cognify: Supercharging Gen-AI Workflows With Hierarchical Autotuning | Feb 12, 2025 | RAGText to SQL | CodeCode Available | 3 |
| Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI | Jan 25, 2024 | | CodeCode Available | 3 |
| Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents | Oct 7, 2024 | Natural Language Visual GroundingNavigate | CodeCode Available | 3 |