| MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model | May 30, 2024 | Image AnimationVideo Generation | CodeCode Available | 4 | 5 |
| Generalizable Humanoid Manipulation with 3D Diffusion Policies | Oct 14, 2024 | Camera CalibrationPoint Cloud Segmentation | CodeCode Available | 4 | 5 |
| LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA | Sep 4, 2024 | Question AnsweringSentence | CodeCode Available | 4 | 5 |
| No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images | Oct 31, 2024 | 3D ReconstructionGeneralizable Novel View Synthesis | CodeCode Available | 4 | 5 |
| Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models | Jul 30, 2023 | HallucinationPrompt Engineering | CodeCode Available | 4 | 5 |
| Multimodal Chain-of-Thought Reasoning in Language Models | Feb 2, 2023 | HallucinationLanguage Modelling | CodeCode Available | 4 | 5 |
| Efficient Automated Deep Learning for Time Series Forecasting | May 11, 2022 | AutoMLBayesian Optimization | CodeCode Available | 4 | 5 |
| SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM | Dec 4, 2023 | Camera Pose EstimationNovel View Synthesis | CodeCode Available | 4 | 5 |
| Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection | Feb 23, 2023 | Code CompletionComputer Security | CodeCode Available | 4 | 5 |
| Lean Workbook: A large-scale Lean problem set formalized from natural language math problems | Jun 6, 2024 | Automated Theorem ProvingMath | CodeCode Available | 4 | 5 |
| GeoCalib: Learning Single-image Calibration with Geometric Optimization | Sep 10, 2024 | 3D geometryVisual Localization | CodeCode Available | 4 | 5 |
| ManimML: Communicating Machine Learning Architectures with Animation | Jun 29, 2023 | | CodeCode Available | 4 | 5 |
| Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving | Jun 6, 2024 | Autonomous DrivingBench2Drive | CodeCode Available | 4 | 5 |
| TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization | Dec 30, 2024 | Audio GenerationGPU | CodeCode Available | 4 | 5 |
| SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models | Feb 13, 2025 | Question AnsweringRAG | CodeCode Available | 4 | 5 |
| Reasoning with Language Model is Planning with World Model | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think | Sep 17, 2024 | Conditional Image GenerationDepth Estimation | CodeCode Available | 4 | 5 |
| DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks | May 7, 2024 | BinarizationDeblurring | CodeCode Available | 4 | 5 |
| PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis | Sep 30, 2023 | GPU | CodeCode Available | 4 | 5 |
| Flamingo: a Visual Language Model for Few-Shot Learning | Apr 29, 2022 | Few-Shot LearningGenerative Visual Question Answering | CodeCode Available | 4 | 5 |
| Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences | Apr 9, 2024 | | CodeCode Available | 4 | 5 |
| Prompt2Model: Generating Deployable Models from Natural Language Instructions | Aug 23, 2023 | Data-free Knowledge DistillationDataset Generation | CodeCode Available | 4 | 5 |
| Sequential Models in the Synthetic Data Vault | Jul 28, 2022 | Generative Adversarial Network | CodeCode Available | 4 | 5 |
| UniTS: A Unified Multi-Task Time Series Model | Feb 29, 2024 | Anomaly DetectionImputation | CodeCode Available | 4 | 5 |
| YuLan: An Open-source Large Language Model | Jun 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |