| Unifying Unsupervised Graph-Level Anomaly Detection and Out-of-Distribution Detection: A Benchmark | Jun 21, 2024 | Anomaly DetectionOut-of-Distribution Detection | CodeCode Available | 2 | 5 |
| Dilated Neighborhood Attention Transformer | Sep 29, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 | 5 |
| UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models | Jun 27, 2024 | AttributeBenchmarking | CodeCode Available | 2 | 5 |
| SEAL: Steerable Reasoning Calibration of Large Language Models for Free | Apr 7, 2025 | GSM8K | CodeCode Available | 2 | 5 |
| LightGNN: Simple Graph Neural Network for Recommendation | Jan 6, 2025 | Computational EfficiencyGraph Neural Network | CodeCode Available | 2 | 5 |
| Edicho: Consistent Image Editing in the Wild | Dec 30, 2024 | Denoising | CodeCode Available | 2 | 5 |
| SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model | Jan 28, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 | 5 |
| Real-Time Fitness Exercise Classification and Counting from Video Frames | Nov 18, 2024 | | CodeCode Available | 2 | 5 |
| What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning | Dec 25, 2023 | | CodeCode Available | 2 | 5 |
| Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization | Dec 23, 2024 | Position | CodeCode Available | 2 | 5 |
| RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQL | Feb 12, 2023 | DecoderLanguage Modeling | CodeCode Available | 2 | 5 |
| FinBERT-QA: Financial Question Answering with pre-trained BERT Language Models | Apr 24, 2025 | Answer SelectionInformation Retrieval | CodeCode Available | 2 | 5 |
| Iterative Methods for Vecchia-Laplace Approximations for Latent Gaussian Process Models | Oct 18, 2023 | | CodeCode Available | 2 | 5 |
| LitSearch: A Retrieval Benchmark for Scientific Literature Search | Jul 10, 2024 | ArticlesReranking | CodeCode Available | 2 | 5 |
| xPatch: Dual-Stream Time Series Forecasting with Exponential Seasonal-Trend Decomposition | Dec 23, 2024 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 2 | 5 |
| Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models | Nov 3, 2022 | GPU | CodeCode Available | 2 | 5 |
| Auto-Encoded Supervision for Perceptual Image Super-Resolution | Nov 28, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment | Aug 21, 2024 | Video AlignmentVideo Editing | CodeCode Available | 2 | 5 |
| Learning Spatio-Temporal Dynamics for Trajectory Recovery via Time-Aware Transformer | May 20, 2025 | Trajectory Recovery | CodeCode Available | 2 | 5 |
| JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework | Feb 19, 2025 | Change DetectionEarth Observation | CodeCode Available | 2 | 5 |
| Squeezed Attention: Accelerating Long Context Length LLM Inference | Nov 14, 2024 | Code GenerationLarge Language Model | CodeCode Available | 2 | 5 |
| FAdam: Adam is a natural gradient optimizer using diagonal empirical Fisher information | May 21, 2024 | Speech Recognition | CodeCode Available | 2 | 5 |
| Adaptive Dual-domain Learning for Underwater Image Enhancement | Apr 27, 2025 | Image EnhancementUIE | CodeCode Available | 2 | 5 |
| Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models | Mar 8, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 | 5 |
| FATE-LLM: A Industrial Grade Federated Learning Framework for Large Language Models | Oct 16, 2023 | Federated Learningparameter-efficient fine-tuning | CodeCode Available | 2 | 5 |
| Slim attention: cut your context memory in half without loss of accuracy -- K-cache is all you need for MHA | Mar 7, 2025 | AllDecoder | CodeCode Available | 2 | 5 |
| Monocular Lane Detection Based on Deep Learning: A Survey | Nov 25, 2024 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation | Apr 4, 2025 | Domain GeneralizationMamba | CodeCode Available | 2 | 5 |
| PRformer: Pyramidal Recurrent Transformer for Multivariate Time Series Forecasting | Aug 20, 2024 | Multivariate Time Series ForecastingTemporal Sequences | CodeCode Available | 2 | 5 |
| Diffusion Model Quantization: A Review | May 8, 2025 | modelQuantization | CodeCode Available | 2 | 5 |
| CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training | Mar 22, 2022 | DecoderImage Inpainting | CodeCode Available | 2 | 5 |
| A Self-Supervised Descriptor for Image Copy Detection | Feb 21, 2022 | Contrastive LearningCopy Detection | CodeCode Available | 2 | 5 |
| CFDBench: A Large-Scale Benchmark for Machine Learning Methods in Fluid Dynamics | Sep 13, 2023 | | CodeCode Available | 2 | 5 |
| MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models | Oct 23, 2024 | | CodeCode Available | 2 | 5 |
| ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond | Feb 21, 2022 | Image ClassificationInductive Bias | CodeCode Available | 2 | 5 |
| The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer | Apr 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Neural Discrete Representation Learning | Nov 2, 2017 | DecoderRepresentation Learning | CodeCode Available | 2 | 5 |
| Diffusion-based Image Translation using Disentangled Style and Content Representation | Sep 30, 2022 | Style TransferTranslation | CodeCode Available | 2 | 5 |
| ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation | Jun 3, 2024 | GPUVideo Generation | CodeCode Available | 2 | 5 |
| One Fits All: Power General Time Series Analysis by Pretrained LM | Sep 21, 2023 | | CodeCode Available | 2 | 5 |
| SFNet: Faster, Accurate, and Domain Agnostic Semantic Segmentation via Semantic Flow | Jul 10, 2022 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| GraphWiz: An Instruction-Following Language Model for Graph Problems | Feb 25, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 | 5 |
| LeYOLO, New Scalable and Efficient CNN Architecture for Object Detection | Jun 20, 2024 | Computational EfficiencyObject | CodeCode Available | 2 | 5 |
| Exploring the Causality of End-to-End Autonomous Driving | Jul 9, 2024 | Autonomous Drivingcounterfactual | CodeCode Available | 2 | 5 |
| Crosslingual Generalization through Multitask Finetuning | Nov 3, 2022 | Coreference ResolutionCross-Lingual Transfer | CodeCode Available | 2 | 5 |
| Adapt or Perish: Adaptive Sparse Transformer with Attentive Feature Refinement for Image Restoration | Jan 1, 2024 | Image RestorationRaindrop Removal | CodeCode Available | 2 | 5 |
| Generalizable Human Gaussians from Single-View Image | Jun 10, 2024 | Novel View SynthesisSSIM | CodeCode Available | 2 | 5 |
| Deep Geometrized Cartoon Line Inbetweening | Sep 28, 2023 | | CodeCode Available | 2 | 5 |
| DistPred: A Distribution-Free Probabilistic Inference Method for Regression and Forecasting | Jun 17, 2024 | Bayesian InferenceComputational Efficiency | CodeCode Available | 2 | 5 |
| GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models | Dec 20, 2021 | DiversityImage Generation | CodeCode Available | 2 | 5 |