| Heterogeneous Multi-Robot Reinforcement Learning | Jan 17, 2023 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference | Feb 28, 2025 | | CodeCode Available | 2 |
| TRADES: Generating Realistic Market Simulations with Diffusion Models | Jan 31, 2025 | Denoising | CodeCode Available | 2 |
| Learning to Compress Prompts with Gist Tokens | Apr 17, 2023 | Decoder | CodeCode Available | 2 |
| Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models | Oct 6, 2023 | Code GenerationDecision Making | CodeCode Available | 2 |
| Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation | Aug 24, 2023 | Image-to-Image Translation | CodeCode Available | 2 |
| Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A Survey | Feb 8, 2025 | FairnessRAG | CodeCode Available | 2 |
| LongReward: Improving Long-context Large Language Models with AI Feedback | Oct 28, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 2 |
| LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models | Mar 4, 2022 | DecoderGPU | CodeCode Available | 2 |
| Conformal Symplectic Optimization for Stable Reinforcement Learning | Dec 3, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 2 |
| SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs | Jun 5, 2025 | | CodeCode Available | 2 |
| A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation | Oct 2, 2024 | Image GenerationQuantization | CodeCode Available | 2 |
| Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping | Apr 9, 2024 | Image RetrievalObject | CodeCode Available | 2 |
| Unifying Unsupervised Graph-Level Anomaly Detection and Out-of-Distribution Detection: A Benchmark | Jun 21, 2024 | Anomaly DetectionOut-of-Distribution Detection | CodeCode Available | 2 |
| Dilated Neighborhood Attention Transformer | Sep 29, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models | Jun 27, 2024 | AttributeBenchmarking | CodeCode Available | 2 |
| SEAL: Steerable Reasoning Calibration of Large Language Models for Free | Apr 7, 2025 | GSM8K | CodeCode Available | 2 |
| LightGNN: Simple Graph Neural Network for Recommendation | Jan 6, 2025 | Computational EfficiencyGraph Neural Network | CodeCode Available | 2 |
| Edicho: Consistent Image Editing in the Wild | Dec 30, 2024 | Denoising | CodeCode Available | 2 |
| SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model | Jan 28, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Real-Time Fitness Exercise Classification and Counting from Video Frames | Nov 18, 2024 | | CodeCode Available | 2 |
| What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning | Dec 25, 2023 | | CodeCode Available | 2 |
| Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization | Dec 23, 2024 | Position | CodeCode Available | 2 |
| RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQL | Feb 12, 2023 | DecoderLanguage Modeling | CodeCode Available | 2 |
| FinBERT-QA: Financial Question Answering with pre-trained BERT Language Models | Apr 24, 2025 | Answer SelectionInformation Retrieval | CodeCode Available | 2 |
| Iterative Methods for Vecchia-Laplace Approximations for Latent Gaussian Process Models | Oct 18, 2023 | | CodeCode Available | 2 |
| LitSearch: A Retrieval Benchmark for Scientific Literature Search | Jul 10, 2024 | ArticlesReranking | CodeCode Available | 2 |
| xPatch: Dual-Stream Time Series Forecasting with Exponential Seasonal-Trend Decomposition | Dec 23, 2024 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 2 |
| Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models | Nov 3, 2022 | GPU | CodeCode Available | 2 |
| Auto-Encoded Supervision for Perceptual Image Super-Resolution | Nov 28, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment | Aug 21, 2024 | Video AlignmentVideo Editing | CodeCode Available | 2 |
| Learning Spatio-Temporal Dynamics for Trajectory Recovery via Time-Aware Transformer | May 20, 2025 | Trajectory Recovery | CodeCode Available | 2 |
| JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework | Feb 19, 2025 | Change DetectionEarth Observation | CodeCode Available | 2 |
| Squeezed Attention: Accelerating Long Context Length LLM Inference | Nov 14, 2024 | Code GenerationLarge Language Model | CodeCode Available | 2 |
| FAdam: Adam is a natural gradient optimizer using diagonal empirical Fisher information | May 21, 2024 | Speech Recognition | CodeCode Available | 2 |
| Adaptive Dual-domain Learning for Underwater Image Enhancement | Apr 27, 2025 | Image EnhancementUIE | CodeCode Available | 2 |
| Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models | Mar 8, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| FATE-LLM: A Industrial Grade Federated Learning Framework for Large Language Models | Oct 16, 2023 | Federated Learningparameter-efficient fine-tuning | CodeCode Available | 2 |
| Slim attention: cut your context memory in half without loss of accuracy -- K-cache is all you need for MHA | Mar 7, 2025 | AllDecoder | CodeCode Available | 2 |
| Monocular Lane Detection Based on Deep Learning: A Survey | Nov 25, 2024 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 |
| Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation | Apr 4, 2025 | Domain GeneralizationMamba | CodeCode Available | 2 |
| PRformer: Pyramidal Recurrent Transformer for Multivariate Time Series Forecasting | Aug 20, 2024 | Multivariate Time Series ForecastingTemporal Sequences | CodeCode Available | 2 |
| Diffusion Model Quantization: A Review | May 8, 2025 | modelQuantization | CodeCode Available | 2 |
| CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training | Mar 22, 2022 | DecoderImage Inpainting | CodeCode Available | 2 |
| A Self-Supervised Descriptor for Image Copy Detection | Feb 21, 2022 | Contrastive LearningCopy Detection | CodeCode Available | 2 |
| CFDBench: A Large-Scale Benchmark for Machine Learning Methods in Fluid Dynamics | Sep 13, 2023 | | CodeCode Available | 2 |
| MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models | Oct 23, 2024 | | CodeCode Available | 2 |
| ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond | Feb 21, 2022 | Image ClassificationInductive Bias | CodeCode Available | 2 |
| The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer | Apr 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Neural Discrete Representation Learning | Nov 2, 2017 | DecoderRepresentation Learning | CodeCode Available | 2 |