| TripNet: Learning Large-scale High-fidelity 3D Car Aerodynamics with Triplane Networks | Mar 19, 2025 | 3D geometry | CodeCode Available | 3 | 5 |
| CountGD: Multi-Modal Open-World Counting | Jul 5, 2024 | Object CountingOpen-vocabulary object counting | CodeCode Available | 3 | 5 |
| AudioSR: Versatile Audio Super-resolution at Scale | Sep 13, 2023 | Audio Super-ResolutionSuper-Resolution | CodeCode Available | 3 | 5 |
| UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining | Apr 18, 2023 | | CodeCode Available | 3 | 5 |
| Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different Scenes | Dec 2, 2024 | In-Context LearningVideo Segmentation | CodeCode Available | 3 | 5 |
| CGCE: A Chinese Generative Chat Evaluation Benchmark for General and Financial Domains | May 23, 2023 | Text Generation | CodeCode Available | 3 | 5 |
| GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents | Apr 14, 2025 | Vision-Language-Action | CodeCode Available | 3 | 5 |
| Hierarchical Text-Conditional Image Generation with CLIP Latents | Apr 13, 2022 | Conditional Image GenerationDecoder | CodeCode Available | 3 | 5 |
| Self-QA: Unsupervised Knowledge Guided Language Model Alignment | May 19, 2023 | DiversityLanguage Modeling | CodeCode Available | 3 | 5 |
| Self-Discover: Large Language Models Self-Compose Reasoning Structures | Feb 6, 2024 | Math | CodeCode Available | 3 | 5 |
| Common Sense Reasoning for Deepfake Detection | Jan 31, 2024 | Binary ClassificationCommon Sense Reasoning | CodeCode Available | 3 | 5 |
| Mosaic: An Architecture for Scalable & Interoperable Data Views | Oct 26, 2023 | | CodeCode Available | 3 | 5 |
| The Unreasonable Ineffectiveness of the Deeper Layers | Mar 26, 2024 | GPUQuantization | CodeCode Available | 3 | 5 |
| White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is? | Nov 22, 2023 | AllData Compression | CodeCode Available | 3 | 5 |
| Difference-in-Differences Estimation with Spatial Spillovers | May 8, 2021 | counterfactual | CodeCode Available | 3 | 5 |
| Prompting Is Programming: A Query Language for Large Language Models | Dec 12, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 3 | 5 |
| Scaling Instruction-Finetuned Language Models | Oct 20, 2022 | Coreference ResolutionCross-Lingual Question Answering | CodeCode Available | 3 | 5 |
| MagicDrive: Street View Generation with Diverse 3D Geometry Control | Oct 4, 2023 | 3D geometry3D Object Detection | CodeCode Available | 3 | 5 |
| SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion | Mar 18, 2024 | 3D Generation3D Reconstruction | CodeCode Available | 3 | 5 |
| A Survey on Causal Discovery Methods for I.I.D. and Time Series Data | Mar 27, 2023 | Causal DiscoveryTime Series | CodeCode Available | 3 | 5 |
| FengWu-GHR: Learning the Kilometer-scale Medium-range Global Weather Forecasting | Jan 28, 2024 | Weather Forecasting | CodeCode Available | 3 | 5 |
| The Forward-Forward Algorithm: Some Preliminary Investigations | Dec 27, 2022 | | CodeCode Available | 3 | 5 |
| Benchmarking Automatic Machine Learning Frameworks | Aug 17, 2018 | Automated Feature EngineeringAutoML | CodeCode Available | 3 | 5 |
| OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at Scale | Jul 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations | Feb 18, 2024 | DenoisingRobot Manipulation | CodeCode Available | 3 | 5 |
| INP-Former++: Advancing Universal Anomaly Detection via Intrinsic Normal Prototypes and Residual Learning | Jun 4, 2025 | Anomaly DetectionMedical Diagnosis | CodeCode Available | 3 | 5 |
| Koopman-Based Surrogate Modelling of Turbulent Rayleigh-Bénard Convection | May 10, 2024 | | CodeCode Available | 3 | 5 |
| Local motion phases for learning multi-contact character movements | Jun 1, 2020 | | CodeCode Available | 3 | 5 |
| Traj-LIO: A Resilient Multi-LiDAR Multi-IMU State Estimator Through Sparse Gaussian Process | Feb 14, 2024 | | CodeCode Available | 3 | 5 |
| On the Error Analysis of 3D Gaussian Splatting and an Optimal Projection Strategy | Feb 1, 2024 | Neural Rendering | CodeCode Available | 3 | 5 |
| T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design | Oct 8, 2024 | Video AlignmentVideo Generation | CodeCode Available | 3 | 5 |
| Transolver++: An Accurate Neural Solver for PDEs on Million-Scale Geometries | Feb 4, 2025 | GPU | CodeCode Available | 3 | 5 |
| Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels | May 27, 2024 | 4D reconstruction | CodeCode Available | 3 | 5 |
| Rectified Flow: A Marginal Preserving Approach to Optimal Transport | Sep 29, 2022 | valid | CodeCode Available | 3 | 5 |
| TabArena: A Living Benchmark for Machine Learning on Tabular Data | Jun 20, 2025 | Benchmarking | CodeCode Available | 3 | 5 |
| SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting | Jun 28, 2024 | 3DGS3D Reconstruction | CodeCode Available | 3 | 5 |
| Grad: Guided Relation Diffusion Generation for Graph Augmentation in Graph Fraud Detection | Apr 22, 2025 | Contrastive LearningFraud Detection | CodeCode Available | 3 | 5 |
| ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization | Feb 4, 2025 | Quantization | CodeCode Available | 3 | 5 |
| Large Language Model based Long-tail Query Rewriting in Taobao Search | Nov 7, 2023 | Contrastive LearningLanguage Modeling | CodeCode Available | 3 | 5 |
| SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models | Jun 1, 2025 | | CodeCode Available | 3 | 5 |
| MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost | Dec 2, 2024 | Image Generation | CodeCode Available | 3 | 5 |
| View Selection for 3D Captioning via Diffusion Ranking | Apr 11, 2024 | 3D Object CaptioningHallucination | CodeCode Available | 3 | 5 |
| Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates | Sep 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 | 5 |
| ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation | Feb 25, 2025 | Image Generation | CodeCode Available | 3 | 5 |
| Lossless and Near-Lossless Compression for Foundation Models | Apr 5, 2024 | | CodeCode Available | 3 | 5 |
| StarWhisper Telescope: Agent-Based Observation Assistant System to Approach AI Astrophysicist | Dec 9, 2024 | | CodeCode Available | 3 | 5 |
| CTNet: A Convolutional Transformer Network for EEG-Based Motor Imagery Classification | Aug 30, 2024 | Brain Computer InterfaceEEG | CodeCode Available | 3 | 5 |
| Affordable AI Assistants with Knowledge Graph of Thoughts | Apr 3, 2025 | Knowledge GraphsLLM real-life tasks | CodeCode Available | 3 | 5 |
| The Elephant in the Room: Towards A Reliable Time-Series Anomaly Detection Benchmark | Sep 26, 2024 | Anomaly DetectionBenchmarking | CodeCode Available | 3 | 5 |
| DDT: Decoupled Diffusion Transformer | Apr 8, 2025 | DenoisingImage Generation | CodeCode Available | 3 | 5 |