| PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos | Mar 23, 2025 | 4D reconstructionDeformable Object Manipulation | CodeCode Available | 3 | 5 |
| Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification | Apr 16, 2024 | Feature EngineeringLanguage Modeling | CodeCode Available | 3 | 5 |
| Detecting hallucinations in large language models using semantic entropy | Jun 19, 2024 | Large Language ModelQuestion Answering | CodeCode Available | 3 | 5 |
| The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio | Oct 16, 2024 | Hallucination | CodeCode Available | 3 | 5 |
| BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking | Apr 12, 2024 | Motion CompensationMulti-Object Tracking | CodeCode Available | 3 | 5 |
| SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling | Dec 23, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 3 | 5 |
| PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360^ | Mar 23, 2023 | Image GenerationImage Segmentation | CodeCode Available | 3 | 5 |
| VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning | Apr 9, 2025 | MVBenchObject Tracking | CodeCode Available | 3 | 5 |
| Improving Dictionary Learning with Gated Sparse Autoencoders | Apr 24, 2024 | Dictionary Learning | CodeCode Available | 3 | 5 |
| Open3D: A Modern Library for 3D Data Processing | Jan 30, 2018 | Point Cloud Registration | CodeCode Available | 3 | 5 |
| ATPrompt: Textual Prompt Learning with Embedded Attributes | Dec 12, 2024 | AttributeLarge Language Model | CodeCode Available | 3 | 5 |
| N-BEATS: Neural basis expansion analysis for interpretable time series forecasting | May 24, 2019 | Time SeriesTime Series Analysis | CodeCode Available | 3 | 5 |
| Mip-Splatting: Alias-free 3D Gaussian Splatting | Nov 27, 2023 | Novel View Synthesis | CodeCode Available | 3 | 5 |
| Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations | Dec 19, 2024 | Contrastive LearningImage Reconstruction | CodeCode Available | 3 | 5 |
| A Vision-Language Foundation Model to Enhance Efficiency of Chest X-ray Interpretation | Jan 22, 2024 | BenchmarkingDiagnostic | CodeCode Available | 3 | 5 |
| Scaling Rectified Flow Transformers for High-Resolution Image Synthesis | Mar 5, 2024 | Image Generation | CodeCode Available | 3 | 5 |
| MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval | Dec 19, 2024 | Image RetrievalRetrieval | CodeCode Available | 3 | 5 |
| BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond | Dec 3, 2020 | Super-ResolutionVideo deraining | CodeCode Available | 3 | 5 |
| Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMs | Aug 23, 2023 | counterfactualQuestion Answering | CodeCode Available | 3 | 5 |
| WHAM: Reconstructing World-grounded Humans with Accurate 3D Motion | Dec 12, 2023 | 3D Human Pose Estimation | CodeCode Available | 3 | 5 |
| Block-NeRF: Scalable Large Scene Neural View Synthesis | Feb 10, 2022 | NeRF | CodeCode Available | 3 | 5 |
| SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound | Apr 30, 2024 | DecoderLanguage Modelling | CodeCode Available | 3 | 5 |
| Vision Transformers for Dense Prediction | Mar 24, 2021 | DecoderDepth Estimation | CodeCode Available | 3 | 5 |
| RepViT: Revisiting Mobile CNN From ViT Perspective | Jul 18, 2023 | | CodeCode Available | 3 | 5 |
| MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model | Nov 27, 2023 | Image Animation | CodeCode Available | 3 | 5 |
| CRAG -- Comprehensive RAG Benchmark | Jun 7, 2024 | HallucinationLanguage Modelling | CodeCode Available | 3 | 5 |
| Major TOM: Expandable Datasets for Earth Observation | Feb 19, 2024 | Earth Observation | CodeCode Available | 3 | 5 |
| Uni-QSAR: an Auto-ML Tool for Molecular Property Prediction | Apr 24, 2023 | Drug DiscoveryModel Selection | CodeCode Available | 3 | 5 |
| Optimal Variable Speed Limit Control Strategy on Freeway Segments under Fog Conditions | Jul 30, 2021 | | CodeCode Available | 3 | 5 |
| Towards General-purpose Infrastructure for Protecting Scientific Data Under Study | Oct 4, 2021 | | CodeCode Available | 3 | 5 |
| L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning | Mar 6, 2025 | | CodeCode Available | 3 | 5 |
| Genie: Generative Interactive Environments | Feb 23, 2024 | | CodeCode Available | 3 | 5 |
| Exploring Regional Clues in CLIP for Zero-Shot Semantic Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 3 | 5 |
| Efficiently Serving LLM Reasoning Programs with Certaindex | Dec 30, 2024 | Code GenerationMathematical Problem-Solving | CodeCode Available | 3 | 5 |
| SPO: Sequential Monte Carlo Policy Optimisation | Feb 12, 2024 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 3 | 5 |
| AgentStudio: A Toolkit for Building General Virtual Agents | Mar 26, 2024 | Visual Grounding | CodeCode Available | 3 | 5 |
| Is Value Learning Really the Main Bottleneck in Offline RL? | Jun 13, 2024 | Imitation LearningOffline RL | CodeCode Available | 3 | 5 |
| DANA: Domain-Aware Neurosymbolic Agents for Consistency and Accuracy | Sep 27, 2024 | Financial Analysis | CodeCode Available | 3 | 5 |
| Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields | Aug 7, 2024 | 3DGSModel Compression | CodeCode Available | 3 | 5 |
| MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM | Nov 25, 2024 | Autonomous DrivingNovel View Synthesis | CodeCode Available | 3 | 5 |
| Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2 | Aug 9, 2024 | All | CodeCode Available | 3 | 5 |
| DPLM-2: A Multimodal Diffusion Protein Language Model | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| Automated Formulaic Alpha Generation for Quantitative Investing using Evolutionary Algorithms | Mar 13, 2022 | Evolutionary Algorithms | CodeCode Available | 3 | 5 |
| The False Promise of Imitating Proprietary LLMs | May 25, 2023 | Language Modelling | CodeCode Available | 3 | 5 |
| Visual Geometry Grounded Deep Structure From Motion | Dec 7, 2023 | Point Tracking | CodeCode Available | 3 | 5 |
| A Foundation Model for the Earth System | May 20, 2024 | Computational EfficiencyDeep Learning | CodeCode Available | 3 | 5 |
| DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning | Jun 14, 2024 | Offline RL | CodeCode Available | 3 | 5 |
| Human-level play in the game of Diplomacy by combining language models with strategic reasoning | Nov 22, 2022 | AI AgentLanguage Modeling | CodeCode Available | 3 | 5 |
| Improving Text Embeddings with Large Language Models | Dec 31, 2023 | DecoderDiversity | CodeCode Available | 3 | 5 |
| Performance Analysis of Open Source Machine Learning Frameworks for Various Parameters in Single-Threaded and Multi-Threaded Modes | Aug 29, 2017 | BIG-bench Machine LearningCPU | CodeCode Available | 3 | 5 |