| Is Value Learning Really the Main Bottleneck in Offline RL? | Jun 13, 2024 | Imitation LearningOffline RL | CodeCode Available | 3 |
| DANA: Domain-Aware Neurosymbolic Agents for Consistency and Accuracy | Sep 27, 2024 | Financial Analysis | CodeCode Available | 3 |
| Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields | Aug 7, 2024 | 3DGSModel Compression | CodeCode Available | 3 |
| MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM | Nov 25, 2024 | Autonomous DrivingNovel View Synthesis | CodeCode Available | 3 |
| Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2 | Aug 9, 2024 | All | CodeCode Available | 3 |
| DPLM-2: A Multimodal Diffusion Protein Language Model | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Automated Formulaic Alpha Generation for Quantitative Investing using Evolutionary Algorithms | Mar 13, 2022 | Evolutionary Algorithms | CodeCode Available | 3 |
| The False Promise of Imitating Proprietary LLMs | May 25, 2023 | Language Modelling | CodeCode Available | 3 |
| Visual Geometry Grounded Deep Structure From Motion | Dec 7, 2023 | Point Tracking | CodeCode Available | 3 |
| A Foundation Model for the Earth System | May 20, 2024 | Computational EfficiencyDeep Learning | CodeCode Available | 3 |
| DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning | Jun 14, 2024 | Offline RL | CodeCode Available | 3 |
| Human-level play in the game of Diplomacy by combining language models with strategic reasoning | Nov 22, 2022 | AI AgentLanguage Modeling | CodeCode Available | 3 |
| Improving Text Embeddings with Large Language Models | Dec 31, 2023 | DecoderDiversity | CodeCode Available | 3 |
| Performance Analysis of Open Source Machine Learning Frameworks for Various Parameters in Single-Threaded and Multi-Threaded Modes | Aug 29, 2017 | BIG-bench Machine LearningCPU | CodeCode Available | 3 |
| Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models | Oct 3, 2024 | | CodeCode Available | 3 |
| RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control | May 27, 2024 | | CodeCode Available | 3 |
| Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models | Dec 18, 2024 | Representation LearningRobot Manipulation | CodeCode Available | 3 |
| RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation | Mar 8, 2024 | Code GenerationHallucination | CodeCode Available | 3 |
| Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders | Jul 19, 2024 | | CodeCode Available | 3 |
| DataDecide: How to Predict Best Pretraining Data with Small Experiments | Apr 15, 2025 | ARCHellaSwag | CodeCode Available | 3 |
| The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry | Feb 6, 2024 | | CodeCode Available | 3 |
| UCF: Uncovering Common Features for Generalizable Deepfake Detection | Apr 27, 2023 | Binary ClassificationDecoder | CodeCode Available | 3 |
| Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection | Mar 19, 2024 | Anomaly DetectionBenchmarking | CodeCode Available | 3 |
| REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers | Apr 15, 2025 | Image Generation | CodeCode Available | 3 |
| C-Adapter: Adapting Deep Classifiers for Efficient Conformal Prediction Sets | Oct 12, 2024 | Conformal PredictionPrediction | CodeCode Available | 3 |
| Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis | May 16, 2024 | Language ModellingLarge Language Model | CodeCode Available | 3 |
| CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification | Mar 13, 2022 | Audio ClassificationKnowledge Distillation | CodeCode Available | 3 |
| Modular Duality in Deep Learning | Oct 28, 2024 | Deep LearningGPU | CodeCode Available | 3 |
| Distributed Prioritized Experience Replay | Mar 2, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 3 |
| PromptHMR: Promptable Human Mesh Recovery | Apr 8, 2025 | 3D Human Pose EstimationHuman Mesh Recovery | CodeCode Available | 3 |
| Pushing the Limits of Large Language Model Quantization via the Linearity Theorem | Nov 26, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| U-Net: Convolutional Networks for Biomedical Image Segmentation | May 18, 2015 | Cell SegmentationCell Tracking | CodeCode Available | 3 |
| History-Guided Video Diffusion | Feb 10, 2025 | Video Generation | CodeCode Available | 3 |
| Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services | Apr 25, 2024 | GPU | CodeCode Available | 3 |
| Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information Retrieval | Feb 17, 2025 | Information RetrievalRetrieval | CodeCode Available | 3 |
| Probabilistic Volumetric Fusion for Dense Monocular SLAM | Oct 3, 2022 | | CodeCode Available | 3 |
| Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation | May 30, 2023 | Machine TranslationSegmentation | CodeCode Available | 3 |
| Discovered Policy Optimisation | Oct 11, 2022 | IngenuityMeta-Learning | CodeCode Available | 3 |
| MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning | May 13, 2024 | Data AugmentationGSM8K | CodeCode Available | 3 |
| On Distillation of Guided Diffusion Models | Oct 6, 2022 | DenoisingImage Generation | CodeCode Available | 3 |
| SWE-bench-java: A GitHub Issue Resolving Benchmark for Java | Aug 26, 2024 | | CodeCode Available | 3 |
| SoundStream: An End-to-End Neural Audio Codec | Jul 7, 2021 | CPUDecoder | CodeCode Available | 3 |
| Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization Perspective | Feb 2, 2025 | Multi-Task Learning | CodeCode Available | 3 |
| On the Content Bias in Fréchet Video Distance | Apr 18, 2024 | Video Generation | CodeCode Available | 3 |
| Flow Matching for Generative Modeling | Oct 6, 2022 | Density EstimationImage Generation | CodeCode Available | 3 |
| W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training | Aug 7, 2021 | Contrastive LearningLanguage Modeling | CodeCode Available | 3 |
| 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations | Feb 16, 2024 | DenoisingRobot Manipulation | CodeCode Available | 3 |
| Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion | Jun 6, 2024 | 3D Generation | CodeCode Available | 3 |
| SkyMath: Technical Report | Oct 25, 2023 | GSM8KLanguage Modeling | CodeCode Available | 3 |
| XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions Parameters | May 19, 2023 | | CodeCode Available | 3 |