| Rethinking LayerNorm in Image Restoration Transformers | Apr 9, 2025 | Image Restoration | CodeCode Available | 2 |
| On Aliased Resizing and Surprising Subtleties in GAN Evaluation | Apr 22, 2021 | Image Generation | CodeCode Available | 2 |
| DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism | May 6, 2021 | Generative Adversarial NetworkSinging Voice Synthesis | CodeCode Available | 2 |
| High-performance symbolic-numerics via multiple dispatch | May 9, 2021 | CPUVocal Bursts Intensity Prediction | CodeCode Available | 2 |
| Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation | May 22, 2024 | InformativenessLanguage Modeling | CodeCode Available | 2 |
| Manual Evaluation Matters: Reviewing Test Protocols of Distantly Supervised Relation Extraction | May 20, 2021 | RelationRelation Extraction | CodeCode Available | 2 |
| CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks | May 25, 2021 | BIG-bench Machine LearningCode Classification | CodeCode Available | 2 |
| LaVy: Vietnamese Multimodal Large Language Model | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| YOLO5Face: Why Reinventing a Face Detector | May 27, 2021 | Face Detectionobject-detection | CodeCode Available | 2 |
| Efficient and Accurate Gradients for Neural SDEs | May 27, 2021 | | CodeCode Available | 2 |
| Model-agnostic and Scalable Counterfactual Explanations via Reinforcement Learning | Jun 4, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 2 |
| Scalable Video Object Segmentation with Identification Mechanism | Mar 22, 2022 | ObjectSegmentation | CodeCode Available | 2 |
| Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities | Jul 29, 2024 | Contrastive LearningDeepFake Detection | CodeCode Available | 2 |
| Graph Transformer Networks: Learning Meta-path Graphs to Improve GNNs | Jun 11, 2021 | Node Classification | CodeCode Available | 2 |
| Graph Neural Networks for Natural Language Processing: A Survey | Jun 10, 2021 | Decodergraph construction | CodeCode Available | 2 |
| LumiGauss: Relightable Gaussian Splatting in the Wild | Aug 6, 2024 | 3D ReconstructionNeRF | CodeCode Available | 2 |
| A Supervised Learning Approach to Rankability | Mar 14, 2022 | | CodeCode Available | 2 |
| Transformer Meets Convolution: A Bilateral Awareness Network for Semantic Segmentation of Very Fine Resolution Urban Scene Images | Jun 23, 2021 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting | Jun 24, 2021 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| FLAME: Learning to Navigate with Multimodal LLM in Urban Environments | Aug 20, 2024 | NavigateVision and Language Navigation | CodeCode Available | 2 |
| Task-Specific Directions: Definition, Exploration, and Utilization in Parameter Efficient Fine-Tuning | Sep 2, 2024 | parameter-efficient fine-tuning | CodeCode Available | 2 |
| EduNLP: Towards a Unified and Modularized Library for Educational Resources | Jun 3, 2024 | | CodeCode Available | 2 |
| Conditional GANs with Auxiliary Discriminative Classifier | Jul 21, 2021 | Conditional Image GenerationDiversity | CodeCode Available | 2 |
| QuantumNAS: Noise-Adaptive Search for Robust Quantum Circuits | Jul 22, 2021 | | CodeCode Available | 2 |
| TorchXRayVision: A library of chest X-ray datasets and models | Oct 31, 2021 | Image ClassificationMedical Image Retrieval | CodeCode Available | 2 |
| Open-World Entity Segmentation | Jul 29, 2021 | Image ManipulationImage Segmentation | CodeCode Available | 2 |
| Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers | Aug 16, 2021 | DecoderMedical Image Segmentation | CodeCode Available | 2 |
| Fast and Flexible Human Pose Estimation with HyperPose | Aug 26, 2021 | Pose Estimation | CodeCode Available | 2 |
| CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow | Nov 18, 2022 | Optical Flow EstimationPosition | CodeCode Available | 2 |
| Uncertainty Toolbox: an Open-Source Library for Assessing, Visualizing, and Improving Uncertainty Quantification | Sep 21, 2021 | BIG-bench Machine LearningUncertainty Quantification | CodeCode Available | 2 |
| dattri: A Library for Efficient Data Attribution | Oct 6, 2024 | Benchmarking | CodeCode Available | 2 |
| Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer | Oct 20, 2021 | Formality Style TransferStyle Transfer | CodeCode Available | 2 |
| NeRF-Supervised Deep Stereo | Mar 30, 2023 | NeRFNeural Rendering | CodeCode Available | 2 |
| Let Your Graph Do the Talking: Encoding Structured Data for LLMs | Feb 8, 2024 | | CodeCode Available | 2 |
| Self-Reflection in LLM Agents: Effects on Problem-Solving Performance | May 5, 2024 | Multiple-choice | CodeCode Available | 2 |
| Eco2AI: carbon emissions tracking of machine learning models as the first step towards sustainable AI | Jul 31, 2022 | | CodeCode Available | 2 |
| EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture Models | Oct 30, 2024 | DeblurringEnsemble Learning | CodeCode Available | 2 |
| ThermoNeRF: Joint RGB and Thermal Novel View Synthesis for Building Facades using Multimodal Neural Radiance Fields | Mar 18, 2024 | 3D geometryImage Generation | CodeCode Available | 2 |
| ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks | May 29, 2025 | Spatial Reasoning | CodeCode Available | 2 |
| SSAST: Self-Supervised Audio Spectrogram Transformer | Oct 19, 2021 | Audio ClassificationClassification | CodeCode Available | 2 |
| QuantumNAT: Quantum Noise-Aware Training with Noise Injection, Quantization and Normalization | Oct 21, 2021 | DenoisingQuantization | CodeCode Available | 2 |
| Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs | Jun 12, 2025 | PhilosophyPrompt Engineering | CodeCode Available | 2 |
| Efficient Modulation for Vision Networks | Mar 29, 2024 | GPU | CodeCode Available | 2 |
| Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models | May 24, 2023 | ChatbotNatural Language Understanding | CodeCode Available | 2 |
| TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text | Oct 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SE(3)-Transformers: 3D Roto-Translation Equivariant Attention Networks | Jun 18, 2020 | Translation | CodeCode Available | 2 |
| gCastle: A Python Toolbox for Causal Discovery | Nov 30, 2021 | Causal DiscoveryGPU | CodeCode Available | 2 |
| ICON: Implicit Clothed humans Obtained from Normals | Dec 16, 2021 | 3D Human Pose Estimation3D Human Reconstruction | CodeCode Available | 2 |
| Memory-assisted prompt editing to improve GPT-3 after deployment | Jan 16, 2022 | | CodeCode Available | 2 |
| Watanabe's expansion: A Solution for the convexity conundrum | Apr 1, 2024 | | CodeCode Available | 2 |