| Recipe for a General, Powerful, Scalable Graph Transformer | May 25, 2022 | Graph ClassificationGraph Property Prediction | CodeCode Available | 2 | 5 |
| WATT: Weight Average Test-Time Adaptation of CLIP | Jun 19, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion | Jul 17, 2024 | Autonomous Driving | CodeCode Available | 2 | 5 |
| CoverNet: Multimodal Behavior Prediction using Trajectory Sets | Nov 23, 2019 | Predictionregression | CodeCode Available | 2 | 5 |
| VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling | Aug 2, 2024 | Image Generation | CodeCode Available | 2 | 5 |
| MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary Programming | May 29, 2025 | DiversityEfficient Exploration | CodeCode Available | 2 | 5 |
| Fused Gromov-Wasserstein distance for structured objects: theoretical foundations and mathematical properties | Nov 7, 2018 | BIG-bench Machine Learning | CodeCode Available | 2 | 5 |
| RSL-SQL: Robust Schema Linking in Text-to-SQL Generation | Oct 31, 2024 | Text to SQLText-To-SQL | CodeCode Available | 2 | 5 |
| The Geometry of Categorical and Hierarchical Concepts in Large Language Models | Jun 3, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 | 5 |
| Multi-modal Time Series Analysis: A Tutorial and Survey | Mar 17, 2025 | SurveyTime Series | CodeCode Available | 2 | 5 |
| Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner | May 16, 2025 | Cross-Modal RetrievalDiagnostic | CodeCode Available | 2 | 5 |
| Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion Models | Apr 17, 2023 | Bokeh Effect RenderingDenoising | CodeCode Available | 2 | 5 |
| Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models | Apr 15, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 | 5 |
| Diffusion-Enhanced Test-time Adaptation with Text and Image Augmentation | Dec 12, 2024 | Image AugmentationImage Generation | CodeCode Available | 2 | 5 |
| Revisiting Unreasonable Effectiveness of Data in Deep Learning Era | Jul 10, 2017 | Deep Learningimage-classification | CodeCode Available | 2 | 5 |
| Generating Long Videos of Dynamic Scenes | Jun 7, 2022 | MORPHVideo Generation | CodeCode Available | 2 | 5 |
| Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation | Dec 27, 2024 | Image SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos | Dec 5, 2024 | AttributeQuantization | CodeCode Available | 2 | 5 |
| Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models | Dec 24, 2024 | Question AnsweringVideo Question Answering | CodeCode Available | 2 | 5 |
| TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On | Apr 1, 2024 | Virtual Try-on | CodeCode Available | 2 | 5 |
| Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges | Apr 27, 2021 | Deep LearningProtein Folding | CodeCode Available | 2 | 5 |
| Large Scale Transfer Learning for Tabular Data via Language Modeling | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| An end-to-end attention-based approach for learning on graphs | Feb 16, 2024 | Graph ClassificationGraph Regression | CodeCode Available | 2 | 5 |
| TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement | Feb 26, 2024 | Machine TranslationTranslation | CodeCode Available | 2 | 5 |
| CLAIMED, a visual and scalable component library for Trusted AI | Mar 4, 2021 | Adversarial RobustnessFairness | CodeCode Available | 2 | 5 |
| ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery | Oct 7, 2024 | scientific discovery | CodeCode Available | 2 | 5 |
| FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Jun 5, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 | 5 |
| A DeNoising FPN With Transformer R-CNN for Tiny Object Detection | Jun 9, 2024 | Contrastive LearningDenoising | CodeCode Available | 2 | 5 |
| Mapping the Increasing Use of LLMs in Scientific Papers | Apr 1, 2024 | | CodeCode Available | 2 | 5 |
| Efficient Training of Deep Equilibrium Models | Apr 23, 2023 | | CodeCode Available | 2 | 5 |
| BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training | Mar 24, 2022 | Objectobject-detection | CodeCode Available | 2 | 5 |
| Frugal Optimization for Cost-related Hyperparameters | May 4, 2020 | AutoMLBIG-bench Machine Learning | CodeCode Available | 2 | 5 |
| Point-SLAM: Dense Neural Point Cloud-based SLAM | Apr 9, 2023 | Simultaneous Localization and Mapping | CodeCode Available | 2 | 5 |
| V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians | Sep 20, 2024 | 3DGS | CodeCode Available | 2 | 5 |
| BirdNET: A deep learning solution for avian diversity monitoring | Jan 27, 2021 | Data AugmentationDeep Learning | CodeCode Available | 2 | 5 |
| Secure & Private Federated Neuroimaging | May 11, 2022 | Federated Learning | CodeCode Available | 2 | 5 |
| SPot-the-Difference Self-Supervised Pre-training for Anomaly Detection and Segmentation | Jul 28, 2022 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 2 | 5 |
| LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models | Mar 22, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 | 5 |
| Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning | Oct 24, 2019 | Meta-LearningMeta Reinforcement Learning | CodeCode Available | 2 | 5 |
| Fast inference of deep neural networks in FPGAs for particle physics | Apr 16, 2018 | BIG-bench Machine LearningHigh-Level Synthesis | CodeCode Available | 2 | 5 |
| Riemannian Adaptive Optimization Methods | Oct 1, 2018 | Riemannian optimizationStochastic Optimization | CodeCode Available | 2 | 5 |
| Training Strategies for Improved Lip-reading | Sep 3, 2022 | Data AugmentationLipreading | CodeCode Available | 2 | 5 |
| Content-Based Search for Deep Generative Models | Oct 6, 2022 | Contrastive LearningImage and Sketch based Model Retrieval | CodeCode Available | 2 | 5 |
| Introducing v0.5 of the AI Safety Benchmark from MLCommons | Apr 18, 2024 | | CodeCode Available | 2 | 5 |
| Improving Long-Text Alignment for Text-to-Image Diffusion Models | Oct 15, 2024 | | CodeCode Available | 2 | 5 |
| Boosting Latent Diffusion with Flow Matching | Dec 12, 2023 | DecoderDiversity | CodeCode Available | 2 | 5 |
| Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning | Sep 3, 2023 | Scene Text Recognition | CodeCode Available | 2 | 5 |
| Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks | Apr 13, 2016 | Depth Estimation | CodeCode Available | 2 | 5 |
| Benchmarks and leaderboards for sound demixing tasks | May 12, 2023 | | CodeCode Available | 2 | 5 |
| AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection | Oct 29, 2023 | Anomaly DetectionPrompt Learning | CodeCode Available | 2 | 5 |