| BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments | May 27, 2024 | AI AgentBayesian Optimization | CodeCode Available | 2 | 5 |
| 3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation | Sep 12, 2022 | 3D Face AnimationDisentanglement | CodeCode Available | 2 | 5 |
| ExpeL: LLM Agents Are Experiential Learners | Aug 20, 2023 | Decision MakingTransfer Learning | CodeCode Available | 2 | 5 |
| SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words | Jun 19, 2024 | Dialogue Understanding | CodeCode Available | 2 | 5 |
| MuMA-ToM: Multi-modal Multi-Agent Theory of Mind | Aug 22, 2024 | | CodeCode Available | 2 | 5 |
| Retrieval-Augmented Diffusion Models for Time Series Forecasting | Oct 24, 2024 | DenoisingRetrieval | CodeCode Available | 2 | 5 |
| Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient | Nov 26, 2024 | GPUImage Generation | CodeCode Available | 2 | 5 |
| Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection | Oct 5, 2022 | 3D Object Detectionobject-detection | CodeCode Available | 2 | 5 |
| Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation | Jun 24, 2021 | MuJoCoOpenAI Gym | CodeCode Available | 2 | 5 |
| SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery | Jun 26, 2024 | Domain AdaptationEarth Observation | CodeCode Available | 2 | 5 |
| Machine learning interatomic potential can infer electrical response | Apr 7, 2025 | | CodeCode Available | 2 | 5 |
| HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization | Jun 9, 2025 | Combinatorial OptimizationMemorization | CodeCode Available | 2 | 5 |
| Fully Sparse 3D Occupancy Prediction | Dec 28, 2023 | Autonomous DrivingPrediction | CodeCode Available | 2 | 5 |
| SensorLLM: Human-Intuitive Alignment of Multivariate Sensor Data with LLMs for Activity Recognition | Oct 14, 2024 | Activity RecognitionDescriptive | CodeCode Available | 2 | 5 |
| MambaMorph: a Mamba-based Framework for Medical MR-CT Deformable Registration | Jan 25, 2024 | Computed Tomography (CT)Image Registration | CodeCode Available | 2 | 5 |
| Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category Reconstruction | Sep 1, 2021 | 3D ReconstructionNeural Rendering | CodeCode Available | 2 | 5 |
| Human Pose as Compositional Tokens | Mar 21, 2023 | DecoderPose Estimation | CodeCode Available | 2 | 5 |
| Dense Distinct Query for End-to-End Object Detection | Mar 22, 2023 | Objectobject-detection | CodeCode Available | 2 | 5 |
| Deduplicating Training Data Makes Language Models Better | Jul 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Approximate Convex Decomposition for 3D Meshes with Collision-Aware Concavity and Tree Search | May 5, 2022 | | CodeCode Available | 2 | 5 |
| Autonomous GIS: the next-generation AI-powered GIS | May 10, 2023 | Code GenerationInformation Retrieval | CodeCode Available | 2 | 5 |
| The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning | Jun 2, 2025 | MathMathematical Reasoning | CodeCode Available | 2 | 5 |
| DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data | Jun 6, 2024 | 3D GenerationText to 3D | CodeCode Available | 2 | 5 |
| TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation | Sep 19, 2024 | Vision-Language-Action | CodeCode Available | 2 | 5 |
| Graph Neural Network Surrogates to leverage Mechanistic Expert Knowledge towards Reliable and Immediate Pandemic Response | Nov 10, 2024 | Decision MakingGraph Neural Network | CodeCode Available | 2 | 5 |
| UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis | Mar 20, 2025 | Document Layout AnalysisDocument Summarization | CodeCode Available | 2 | 5 |
| LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching | Jun 20, 2023 | Brain Tumor ClassificationContrastive Learning | CodeCode Available | 2 | 5 |
| ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning | Jul 15, 2022 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 2 | 5 |
| PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation | Mar 30, 2023 | 3D Human Pose EstimationClassification | CodeCode Available | 2 | 5 |
| SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model | Dec 5, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 2 | 5 |
| Bracketing Image Restoration and Enhancement with High-Low Frequency Decomposition | Apr 21, 2024 | Image Restoration | CodeCode Available | 2 | 5 |
| LLM4EDA: Emerging Progress in Large Language Models for Electronic Design Automation | Dec 28, 2023 | Answer GenerationChatbot | CodeCode Available | 2 | 5 |
| Overview of the PromptCBLUE Shared Task in CHIP2023 | Dec 29, 2023 | In-Context Learning | CodeCode Available | 2 | 5 |
| DebugBench: Evaluating Debugging Capability of Large Language Models | Jan 9, 2024 | Code Generation | CodeCode Available | 2 | 5 |
| SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning | Dec 14, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 2 | 5 |
| Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs | Apr 22, 2024 | Misinformation | CodeCode Available | 2 | 5 |
| PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation | Jan 15, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion | Feb 8, 2024 | Computational EfficiencyMultimodal Reasoning | CodeCode Available | 2 | 5 |
| VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation | Dec 6, 2023 | Language ModellingNavigate | CodeCode Available | 2 | 5 |
| STEVE-1: A Generative Model for Text-to-Behavior in Minecraft | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 2 | 5 |
| An Efficient and Mixed Heterogeneous Model for Image Restoration | Apr 15, 2025 | Image RestorationMamba | CodeCode Available | 2 | 5 |
| Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena | Jun 11, 2024 | Multiple-choiceSelection bias | CodeCode Available | 2 | 5 |
| DreamLIP: Language-Image Pre-training with Long Captions | Mar 25, 2024 | Contrastive LearningImage-text Retrieval | CodeCode Available | 2 | 5 |
| ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning | Mar 29, 2024 | Continual LearningContinual Panoptic Segmentation | CodeCode Available | 2 | 5 |
| Therapeutics Data Commons: Machine Learning Datasets and Tasks for Drug Discovery and Development | Feb 18, 2021 | BIG-bench Machine LearningDrug Discovery | CodeCode Available | 2 | 5 |
| Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model Eras | Apr 29, 2024 | Multi-Task LearningPrognosis | CodeCode Available | 2 | 5 |
| 2nd Place Winning Solution for the CVPR2023 Visual Anomaly and Novelty Detection Challenge: Multimodal Prompting for Data-centric Anomaly Detection | Jun 15, 2023 | Anomaly DetectionAnomaly Localization | CodeCode Available | 2 | 5 |
| TeCH: Text-guided Reconstruction of Lifelike Clothed Humans | Aug 16, 2023 | DescriptiveQuestion Answering | CodeCode Available | 2 | 5 |
| BMFM-RNA: An Open Framework for Building and Evaluating Transcriptomic Foundation Models | Jun 17, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 | 5 |
| LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking | Jan 14, 2025 | Autonomous DrivingDecision Making | CodeCode Available | 2 | 5 |