| TextBox: A Unified, Modularized, and Extensible Framework for Text Generation | Jan 6, 2021 | Text Generation | CodeCode Available | 2 | 5 |
| Generative Image as Action Models | Jul 10, 2024 | Image GenerationRobot Manipulation | CodeCode Available | 2 | 5 |
| DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion | Oct 6, 2024 | DeepFake DetectionDomain Generalization | CodeCode Available | 2 | 5 |
| Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering | Sep 29, 2023 | Image to textPassage Retrieval | CodeCode Available | 2 | 5 |
| DataComp: In search of the next generation of multimodal datasets | Apr 27, 2023 | | CodeCode Available | 2 | 5 |
| Do We Need Domain-Specific Embedding Models? An Empirical Investigation | Sep 27, 2024 | | CodeCode Available | 2 | 5 |
| MACE: An Efficient Model-Agnostic Framework for Counterfactual Explanation | May 31, 2022 | BIG-bench Machine Learningcounterfactual | CodeCode Available | 2 | 5 |
| EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training | Mar 17, 2022 | Chatbot | CodeCode Available | 2 | 5 |
| EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild | Nov 21, 2024 | 3D ReconstructionObject | CodeCode Available | 2 | 5 |
| P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks | Oct 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| MAIRA-2: Grounded Radiology Report Generation | Jun 6, 2024 | Text Generation | CodeCode Available | 2 | 5 |
| ADAPT: Action-aware Driving Caption Transformer | Feb 1, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 2 | 5 |
| PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning | Nov 21, 2022 | 3D Classification3D Object Detection | CodeCode Available | 2 | 5 |
| Stochastic Taylor Derivative Estimator: Efficient amortization for arbitrary differential operators | Nov 27, 2024 | GPU | CodeCode Available | 2 | 5 |
| Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models | May 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Contrastive language and vision learning of general fashion concepts | Apr 8, 2022 | Contrastive LearningRetrieval | CodeCode Available | 2 | 5 |
| Macro Graph Neural Networks for Online Billion-Scale Recommender Systems | Jan 26, 2024 | Recommendation Systems | CodeCode Available | 2 | 5 |
| Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% | Jun 17, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| RoMe: Towards Large Scale Road Surface Reconstruction via Mesh Representation | Jun 20, 2023 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 | 5 |
| Programming Refusal with Conditional Activation Steering | Sep 6, 2024 | | CodeCode Available | 2 | 5 |
| MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices | Jan 1, 2023 | Efficient Neural NetworkImage Inpainting | CodeCode Available | 2 | 5 |
| A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency | May 3, 2025 | | CodeCode Available | 2 | 5 |
| Granite Guardian | Dec 10, 2024 | HallucinationLanguage Modeling | CodeCode Available | 2 | 5 |
| BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated Motion | Jun 29, 2023 | Synthetic Data Generation | CodeCode Available | 2 | 5 |
| High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model | Feb 27, 2025 | Portrait Animation | CodeCode Available | 2 | 5 |
| Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster | Nov 14, 2023 | GPUPosition | CodeCode Available | 2 | 5 |
| Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Models | Jan 30, 2024 | Data CompressionLanguage Modelling | CodeCode Available | 2 | 5 |
| Multi-Task Dense Prediction via Mixture of Low-Rank Experts | Mar 26, 2024 | DecoderMixture-of-Experts | CodeCode Available | 2 | 5 |
| Three scenarios for continual learning | Apr 15, 2019 | class-incremental learningClass Incremental Learning | CodeCode Available | 2 | 5 |
| Distributional Gradient Boosting Machines | Apr 2, 2022 | Prediction Intervalsregression | CodeCode Available | 2 | 5 |
| Practical tradeoffs between memory, compute, and performance in learned optimizers | Mar 22, 2022 | | CodeCode Available | 2 | 5 |
| AnalogCoder: Analog Circuit Design via Training-Free Code Generation | May 23, 2024 | Code Generation | CodeCode Available | 2 | 5 |
| RouteFinder: Towards Foundation Models for Vehicle Routing Problems | Jun 21, 2024 | AttributeMulti-Task Learning | CodeCode Available | 2 | 5 |
| MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization | Jan 28, 2023 | HallucinationMultiple-choice | CodeCode Available | 2 | 5 |
| Higher Layers Need More LoRA Experts | Feb 13, 2024 | Mixture-of-Experts | CodeCode Available | 2 | 5 |
| FLAMO: An Open-Source Library for Frequency-Domain Differentiable Audio Processing | Sep 13, 2024 | | CodeCode Available | 2 | 5 |
| PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs | Jun 15, 2023 | Benchmarking | CodeCode Available | 2 | 5 |
| Dataset Quantization | Aug 21, 2023 | Dataset Distillationobject-detection | CodeCode Available | 2 | 5 |
| Frouros: A Python library for drift detection in machine learning systems | Aug 14, 2022 | Drift Detection | CodeCode Available | 2 | 5 |
| Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets | Oct 6, 2023 | | CodeCode Available | 2 | 5 |
| Training Language Models to Reason Efficiently | Feb 6, 2025 | Reinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs | Apr 20, 2023 | NeRFNovel View Synthesis | CodeCode Available | 2 | 5 |
| DOT: A Distillation-Oriented Trainer | Jul 17, 2023 | Knowledge Distillation | CodeCode Available | 2 | 5 |
| Image Super-Resolution Using Very Deep Residual Channel Attention Networks | Jul 8, 2018 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| WaferLLM: Large Language Model Inference at Wafer Scale | Feb 6, 2025 | GPULanguage Modeling | CodeCode Available | 2 | 5 |
| FABLES: Evaluating faithfulness and content selection in book-length summarization | Apr 1, 2024 | Long-Context Understanding | CodeCode Available | 2 | 5 |
| CMax-SLAM: Event-based Rotational-Motion Bundle Adjustment and SLAM System using Contrast Maximization | Mar 12, 2024 | Motion Estimation | CodeCode Available | 2 | 5 |
| BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection | Jun 13, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers | Mar 18, 2022 | Camera Pose EstimationNeRF | CodeCode Available | 2 | 5 |
| OpenFE: Automated Feature Generation with Expert-level Performance | Nov 22, 2022 | Feature Importance | CodeCode Available | 2 | 5 |