| Deep-TEMPEST: Using Deep Learning to Eavesdrop on HDMI from its Unintended Electromagnetic Emanations | Jul 12, 2024 | | CodeCode Available | 4 | 5 |
| Adversarial Diffusion Compression for Real-World Image Super-Resolution | Nov 20, 2024 | DecoderDenoising | CodeCode Available | 4 | 5 |
| Learning Multiple Stock Trading Patterns with Temporal Routing Adaptor and Optimal Transport | Jun 24, 2021 | Stock Prediction | CodeCode Available | 4 | 5 |
| Mathematical Supplement for the gsplat Library | Dec 4, 2023 | | CodeCode Available | 4 | 5 |
| Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures | Mar 4, 2024 | image-classificationImage Classification | CodeCode Available | 4 | 5 |
| OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models | Aug 2, 2023 | Visual Question AnsweringVisual Question Answering (VQA) | CodeCode Available | 4 | 5 |
| Desiderata for next generation of ML model serving | Oct 26, 2022 | modelPosition | CodeCode Available | 4 | 5 |
| Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians | Mar 26, 2024 | NeRFNeural Rendering | CodeCode Available | 4 | 5 |
| MARS: Unleashing the Power of Variance Reduction for Training Large Models | Nov 15, 2024 | Stochastic Optimization | CodeCode Available | 4 | 5 |
| Trackastra: Transformer-based cell tracking for live-cell microscopy | May 24, 2024 | Cell TrackingMultiple Object Tracking | CodeCode Available | 4 | 5 |
| The Whole Is Greater than the Sum of Its Parts: Improving Music Source Separation by Bridging Network | May 13, 2023 | Music Source Separation | CodeCode Available | 4 | 5 |
| Hallucination of Multimodal Large Language Models: A Survey | Apr 29, 2024 | HallucinationSurvey | CodeCode Available | 4 | 5 |
| Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI | May 29, 2024 | EEGElectroencephalogram (EEG) | CodeCode Available | 4 | 5 |
| Pseudo-Simulation for Autonomous Driving | Jun 4, 2025 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 4 | 5 |
| UniK3D: Universal Camera Monocular 3D Estimation | Mar 20, 2025 | 3D ReconstructionDisentanglement | CodeCode Available | 4 | 5 |
| TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks | Jun 27, 2024 | Feature EngineeringModel Selection | CodeCode Available | 4 | 5 |
| InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions | Nov 10, 2022 | 2D Object DetectionClassification | CodeCode Available | 4 | 5 |
| DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads | Oct 14, 2024 | GPUQuantization | CodeCode Available | 4 | 5 |
| State Space Model for New-Generation Network Alternative to Transformers: A Survey | Apr 15, 2024 | | CodeCode Available | 4 | 5 |
| Predicting Subjective Features of Questions of QA Websites using BERT | Feb 24, 2020 | Community Question AnsweringQuestion Answering | CodeCode Available | 4 | 5 |
| Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard | Jun 13, 2023 | Information RetrievalRepresentation Learning | CodeCode Available | 4 | 5 |
| FuseChat: Knowledge Fusion of Chat Models | Aug 15, 2024 | Instruction Following | CodeCode Available | 4 | 5 |
| QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving | May 7, 2024 | GPULanguage Modelling | CodeCode Available | 4 | 5 |
| Vidur: A Large-Scale Simulation Framework For LLM Inference | May 8, 2024 | CPUGPU | CodeCode Available | 4 | 5 |
| Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models | Jan 20, 2025 | | CodeCode Available | 4 | 5 |
| VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling | Dec 31, 2024 | Memorization | CodeCode Available | 4 | 5 |
| Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe | Sep 12, 2022 | Autonomous Driving | CodeCode Available | 4 | 5 |
| RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark | Jun 29, 2023 | Combinatorial OptimizationComputational Efficiency | CodeCode Available | 4 | 5 |
| GLIPv2: Unifying Localization and Vision-Language Understanding | Jun 12, 2022 | 2D Object DetectionContrastive Learning | CodeCode Available | 4 | 5 |
| Cube: A Roblox View of 3D Intelligence | Mar 19, 2025 | Scene GenerationText Generation | CodeCode Available | 4 | 5 |
| Open-Set Image Tagging with Multi-Grained Text Supervision | Oct 23, 2023 | Human-Object Interaction DetectionOpen Set Learning | CodeCode Available | 4 | 5 |
| DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation | Oct 14, 2022 | Natural Language UnderstandingText Generation | CodeCode Available | 4 | 5 |
| VILA: On Pre-training for Visual Language Models | Dec 12, 2023 | In-Context LearningLanguage Modelling | CodeCode Available | 4 | 5 |
| ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing | Sep 17, 2023 | Model SelectionPrompt Engineering | CodeCode Available | 4 | 5 |
| Streaming 4D Visual Geometry Transformer | Jul 15, 2025 | 4D reconstructionPhilosophy | CodeCode Available | 4 | 5 |
| Skywork Open Reasoner 1 Technical Report | May 28, 2025 | MathReinforcement Learning (RL) | CodeCode Available | 4 | 5 |
| Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models | Apr 15, 2024 | Image GenerationImage Restoration | CodeCode Available | 4 | 5 |
| Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning | Jun 17, 2024 | Emotion RecognitionMultimodal Emotion Recognition | CodeCode Available | 4 | 5 |
| Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation | Feb 7, 2024 | Cardiac SegmentationComputational Efficiency | CodeCode Available | 4 | 5 |
| MutaPLM: Protein Language Modeling for Mutation Explanation and Engineering | Oct 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text | Oct 10, 2023 | | CodeCode Available | 4 | 5 |
| XGBoost: Scalable GPU Accelerated Learning | Jun 29, 2018 | Cloud ComputingData Compression | CodeCode Available | 4 | 5 |
| DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models | Feb 29, 2024 | GPU | CodeCode Available | 4 | 5 |
| Galactica: A Large Language Model for Science | Nov 16, 2022 | AnachronismsBias Detection | CodeCode Available | 4 | 5 |
| RTMDet: An Empirical Study of Designing Real-Time Object Detectors | Dec 14, 2022 | GPUInstance Segmentation | CodeCode Available | 4 | 5 |
| 3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers | Oct 11, 2023 | DecoderImage Segmentation | CodeCode Available | 4 | 5 |
| DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing | Mar 26, 2024 | 3D ReconstructionDepth Estimation | CodeCode Available | 4 | 5 |
| DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior | Aug 29, 2023 | Blind Face RestorationDenoising | CodeCode Available | 4 | 5 |
| VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation | Mar 15, 2023 | Code GenerationDenoising | CodeCode Available | 4 | 5 |