| LitSearch: A Retrieval Benchmark for Scientific Literature Search | Jul 10, 2024 | ArticlesReranking | CodeCode Available | 2 |
| xPatch: Dual-Stream Time Series Forecasting with Exponential Seasonal-Trend Decomposition | Dec 23, 2024 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 2 |
| Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models | Nov 3, 2022 | GPU | CodeCode Available | 2 |
| Auto-Encoded Supervision for Perceptual Image Super-Resolution | Nov 28, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment | Aug 21, 2024 | Video AlignmentVideo Editing | CodeCode Available | 2 |
| Learning Spatio-Temporal Dynamics for Trajectory Recovery via Time-Aware Transformer | May 20, 2025 | Trajectory Recovery | CodeCode Available | 2 |
| JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework | Feb 19, 2025 | Change DetectionEarth Observation | CodeCode Available | 2 |
| Squeezed Attention: Accelerating Long Context Length LLM Inference | Nov 14, 2024 | Code GenerationLarge Language Model | CodeCode Available | 2 |
| FAdam: Adam is a natural gradient optimizer using diagonal empirical Fisher information | May 21, 2024 | Speech Recognition | CodeCode Available | 2 |
| Adaptive Dual-domain Learning for Underwater Image Enhancement | Apr 27, 2025 | Image EnhancementUIE | CodeCode Available | 2 |
| Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models | Mar 8, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| FATE-LLM: A Industrial Grade Federated Learning Framework for Large Language Models | Oct 16, 2023 | Federated Learningparameter-efficient fine-tuning | CodeCode Available | 2 |
| Slim attention: cut your context memory in half without loss of accuracy -- K-cache is all you need for MHA | Mar 7, 2025 | AllDecoder | CodeCode Available | 2 |
| Monocular Lane Detection Based on Deep Learning: A Survey | Nov 25, 2024 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 |
| Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation | Apr 4, 2025 | Domain GeneralizationMamba | CodeCode Available | 2 |
| PRformer: Pyramidal Recurrent Transformer for Multivariate Time Series Forecasting | Aug 20, 2024 | Multivariate Time Series ForecastingTemporal Sequences | CodeCode Available | 2 |
| Diffusion Model Quantization: A Review | May 8, 2025 | modelQuantization | CodeCode Available | 2 |
| CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training | Mar 22, 2022 | DecoderImage Inpainting | CodeCode Available | 2 |
| A Self-Supervised Descriptor for Image Copy Detection | Feb 21, 2022 | Contrastive LearningCopy Detection | CodeCode Available | 2 |
| CFDBench: A Large-Scale Benchmark for Machine Learning Methods in Fluid Dynamics | Sep 13, 2023 | | CodeCode Available | 2 |
| MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models | Oct 23, 2024 | | CodeCode Available | 2 |
| ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond | Feb 21, 2022 | Image ClassificationInductive Bias | CodeCode Available | 2 |
| The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer | Apr 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Neural Discrete Representation Learning | Nov 2, 2017 | DecoderRepresentation Learning | CodeCode Available | 2 |
| Diffusion-based Image Translation using Disentangled Style and Content Representation | Sep 30, 2022 | Style TransferTranslation | CodeCode Available | 2 |
| ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation | Jun 3, 2024 | GPUVideo Generation | CodeCode Available | 2 |
| One Fits All: Power General Time Series Analysis by Pretrained LM | Sep 21, 2023 | | CodeCode Available | 2 |
| SFNet: Faster, Accurate, and Domain Agnostic Semantic Segmentation via Semantic Flow | Jul 10, 2022 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| GraphWiz: An Instruction-Following Language Model for Graph Problems | Feb 25, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| LeYOLO, New Scalable and Efficient CNN Architecture for Object Detection | Jun 20, 2024 | Computational EfficiencyObject | CodeCode Available | 2 |
| Exploring the Causality of End-to-End Autonomous Driving | Jul 9, 2024 | Autonomous Drivingcounterfactual | CodeCode Available | 2 |
| Crosslingual Generalization through Multitask Finetuning | Nov 3, 2022 | Coreference ResolutionCross-Lingual Transfer | CodeCode Available | 2 |
| Adapt or Perish: Adaptive Sparse Transformer with Attentive Feature Refinement for Image Restoration | Jan 1, 2024 | Image RestorationRaindrop Removal | CodeCode Available | 2 |
| Generalizable Human Gaussians from Single-View Image | Jun 10, 2024 | Novel View SynthesisSSIM | CodeCode Available | 2 |
| Deep Geometrized Cartoon Line Inbetweening | Sep 28, 2023 | | CodeCode Available | 2 |
| DistPred: A Distribution-Free Probabilistic Inference Method for Regression and Forecasting | Jun 17, 2024 | Bayesian InferenceComputational Efficiency | CodeCode Available | 2 |
| GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models | Dec 20, 2021 | DiversityImage Generation | CodeCode Available | 2 |
| Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models | Oct 24, 2023 | Audio ClassificationAudio Tagging | CodeCode Available | 2 |
| Denoising Diffusion Restoration Models | Jan 27, 2022 | ColorizationDeblurring | CodeCode Available | 2 |
| AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients | Oct 15, 2020 | image-classificationImage Classification | CodeCode Available | 2 |
| AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation | Jul 5, 2024 | Action RecognitionFew-Shot Image Classification | CodeCode Available | 2 |
| Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis | Jul 9, 2024 | | CodeCode Available | 2 |
| The Power of Noise: Redefining Retrieval for RAG Systems | Jan 26, 2024 | Information RetrievalRAG | CodeCode Available | 2 |
| BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving | Mar 5, 2025 | Autonomous DrivingMotion Planning | CodeCode Available | 2 |
| CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition | Mar 20, 2023 | RetrievalScene Understanding | CodeCode Available | 2 |
| Class-Incremental Learning with CLIP: Adaptive Representation Adjustment and Parameter Fusion | Jul 19, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 2 |
| One Configuration to Rule Them All? Towards Hyperparameter Transfer in Topic Models using Multi-Objective Bayesian Optimization | Feb 15, 2022 | AllBayesian Optimization | CodeCode Available | 2 |
| NeuRAD: Neural Rendering for Autonomous Driving | Nov 26, 2023 | Autonomous DrivingData Augmentation | CodeCode Available | 2 |
| ControlVideo: Conditional Control for One-shot Text-driven Video Editing and Beyond | May 26, 2023 | Text-to-Video EditingVideo Editing | CodeCode Available | 2 |
| MDETR - Modulated Detection for End-to-End Multi-Modal Understanding | Jan 1, 2021 | Phrase GroundingQuestion Answering | CodeCode Available | 2 |