| Improving Dictionary Learning with Gated Sparse Autoencoders | Apr 24, 2024 | Dictionary Learning | CodeCode Available | 3 |
| Open3D: A Modern Library for 3D Data Processing | Jan 30, 2018 | Point Cloud Registration | CodeCode Available | 3 |
| ATPrompt: Textual Prompt Learning with Embedded Attributes | Dec 12, 2024 | AttributeLarge Language Model | CodeCode Available | 3 |
| N-BEATS: Neural basis expansion analysis for interpretable time series forecasting | May 24, 2019 | Time SeriesTime Series Analysis | CodeCode Available | 3 |
| Mip-Splatting: Alias-free 3D Gaussian Splatting | Nov 27, 2023 | Novel View Synthesis | CodeCode Available | 3 |
| Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations | Dec 19, 2024 | Contrastive LearningImage Reconstruction | CodeCode Available | 3 |
| A Vision-Language Foundation Model to Enhance Efficiency of Chest X-ray Interpretation | Jan 22, 2024 | BenchmarkingDiagnostic | CodeCode Available | 3 |
| Scaling Rectified Flow Transformers for High-Resolution Image Synthesis | Mar 5, 2024 | Image Generation | CodeCode Available | 3 |
| MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval | Dec 19, 2024 | Image RetrievalRetrieval | CodeCode Available | 3 |
| BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond | Dec 3, 2020 | Super-ResolutionVideo deraining | CodeCode Available | 3 |
| Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMs | Aug 23, 2023 | counterfactualQuestion Answering | CodeCode Available | 3 |
| WHAM: Reconstructing World-grounded Humans with Accurate 3D Motion | Dec 12, 2023 | 3D Human Pose Estimation | CodeCode Available | 3 |
| Block-NeRF: Scalable Large Scene Neural View Synthesis | Feb 10, 2022 | NeRF | CodeCode Available | 3 |
| SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound | Apr 30, 2024 | DecoderLanguage Modelling | CodeCode Available | 3 |
| Vision Transformers for Dense Prediction | Mar 24, 2021 | DecoderDepth Estimation | CodeCode Available | 3 |
| RepViT: Revisiting Mobile CNN From ViT Perspective | Jul 18, 2023 | | CodeCode Available | 3 |
| MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model | Nov 27, 2023 | Image Animation | CodeCode Available | 3 |
| CRAG -- Comprehensive RAG Benchmark | Jun 7, 2024 | HallucinationLanguage Modelling | CodeCode Available | 3 |
| Major TOM: Expandable Datasets for Earth Observation | Feb 19, 2024 | Earth Observation | CodeCode Available | 3 |
| Uni-QSAR: an Auto-ML Tool for Molecular Property Prediction | Apr 24, 2023 | Drug DiscoveryModel Selection | CodeCode Available | 3 |
| Optimal Variable Speed Limit Control Strategy on Freeway Segments under Fog Conditions | Jul 30, 2021 | | CodeCode Available | 3 |
| Towards General-purpose Infrastructure for Protecting Scientific Data Under Study | Oct 4, 2021 | | CodeCode Available | 3 |
| L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning | Mar 6, 2025 | | CodeCode Available | 3 |
| Genie: Generative Interactive Environments | Feb 23, 2024 | | CodeCode Available | 3 |
| Exploring Regional Clues in CLIP for Zero-Shot Semantic Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 3 |
| Efficiently Serving LLM Reasoning Programs with Certaindex | Dec 30, 2024 | Code GenerationMathematical Problem-Solving | CodeCode Available | 3 |
| SPO: Sequential Monte Carlo Policy Optimisation | Feb 12, 2024 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 3 |
| AgentStudio: A Toolkit for Building General Virtual Agents | Mar 26, 2024 | Visual Grounding | CodeCode Available | 3 |
| Is Value Learning Really the Main Bottleneck in Offline RL? | Jun 13, 2024 | Imitation LearningOffline RL | CodeCode Available | 3 |
| DANA: Domain-Aware Neurosymbolic Agents for Consistency and Accuracy | Sep 27, 2024 | Financial Analysis | CodeCode Available | 3 |
| Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields | Aug 7, 2024 | 3DGSModel Compression | CodeCode Available | 3 |
| MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM | Nov 25, 2024 | Autonomous DrivingNovel View Synthesis | CodeCode Available | 3 |
| Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2 | Aug 9, 2024 | All | CodeCode Available | 3 |
| DPLM-2: A Multimodal Diffusion Protein Language Model | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Automated Formulaic Alpha Generation for Quantitative Investing using Evolutionary Algorithms | Mar 13, 2022 | Evolutionary Algorithms | CodeCode Available | 3 |
| The False Promise of Imitating Proprietary LLMs | May 25, 2023 | Language Modelling | CodeCode Available | 3 |
| Visual Geometry Grounded Deep Structure From Motion | Dec 7, 2023 | Point Tracking | CodeCode Available | 3 |
| A Foundation Model for the Earth System | May 20, 2024 | Computational EfficiencyDeep Learning | CodeCode Available | 3 |
| DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning | Jun 14, 2024 | Offline RL | CodeCode Available | 3 |
| Human-level play in the game of Diplomacy by combining language models with strategic reasoning | Nov 22, 2022 | AI AgentLanguage Modeling | CodeCode Available | 3 |
| Improving Text Embeddings with Large Language Models | Dec 31, 2023 | DecoderDiversity | CodeCode Available | 3 |
| Performance Analysis of Open Source Machine Learning Frameworks for Various Parameters in Single-Threaded and Multi-Threaded Modes | Aug 29, 2017 | BIG-bench Machine LearningCPU | CodeCode Available | 3 |
| Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models | Oct 3, 2024 | | CodeCode Available | 3 |
| RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control | May 27, 2024 | | CodeCode Available | 3 |
| Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models | Dec 18, 2024 | Representation LearningRobot Manipulation | CodeCode Available | 3 |
| RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation | Mar 8, 2024 | Code GenerationHallucination | CodeCode Available | 3 |
| Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders | Jul 19, 2024 | | CodeCode Available | 3 |
| DataDecide: How to Predict Best Pretraining Data with Small Experiments | Apr 15, 2025 | ARCHellaSwag | CodeCode Available | 3 |
| The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry | Feb 6, 2024 | | CodeCode Available | 3 |
| Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection | Jun 8, 2020 | Dense Object DetectionGeneral Classification | CodeCode Available | 3 |