| Long-Context Autoregressive Video Modeling with Next-Frame Prediction | Mar 25, 2025 | Text GenerationVideo Generation | CodeCode Available | 3 |
| ID-Animator: Zero-Shot Identity-Preserving Human Video Generation | Apr 23, 2024 | AttributeVideo Generation | CodeCode Available | 3 |
| Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion | Mar 20, 2024 | 3DGSNovel View Synthesis | CodeCode Available | 3 |
| Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow | Jun 12, 2023 | | CodeCode Available | 3 |
| Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer | Jul 2, 2019 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 3 |
| Consistency Models Made Easy | Jun 20, 2024 | Computational EfficiencyGPU | CodeCode Available | 3 |
| Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers | Sep 6, 2024 | Experimental Designscientific discovery | CodeCode Available | 3 |
| UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction | Mar 22, 2024 | DiversityPrediction | CodeCode Available | 3 |
| Scalable Optimization in the Modular Norm | May 23, 2024 | | CodeCode Available | 3 |
| SupeRANSAC: One RANSAC to Rule Them All | Jun 5, 2025 | AllPose Estimation | CodeCode Available | 3 |
| Wordflow: Social Prompt Engineering for Large Language Models | Jan 25, 2024 | Prompt Engineering | CodeCode Available | 3 |
| HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing | Dec 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines | Jun 20, 2024 | Diversityobject-detection | CodeCode Available | 3 |
| Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models | Jun 19, 2024 | Instruction Following | CodeCode Available | 3 |
| Face Anonymization Made Simple | Nov 1, 2024 | AttributeFace Anonymization | CodeCode Available | 3 |
| Locating and Editing Factual Associations in GPT | Feb 10, 2022 | counterfactualModel Editing | CodeCode Available | 3 |
| OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection network | Sep 10, 2022 | Continual LearningObject | CodeCode Available | 3 |
| DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos | May 3, 2024 | Depth EstimationDepth Prediction | CodeCode Available | 3 |
| ImageInWords: Unlocking Hyper-Detailed Image Descriptions | May 5, 2024 | Image GenerationSpecificity | CodeCode Available | 3 |
| Flow Q-Learning | Feb 4, 2025 | Action GenerationD4RL | CodeCode Available | 3 |
| MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs | Apr 1, 2025 | Knowledge GraphsMathematical Reasoning | CodeCode Available | 3 |
| CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility | Mar 18, 2024 | Image InpaintingVideo Alignment | CodeCode Available | 3 |
| Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition | Dec 12, 2024 | EgoSchema | CodeCode Available | 3 |
| The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report | Apr 16, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 3 |
| LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding | Oct 22, 2024 | Token ReductionVideo Question Answering | CodeCode Available | 3 |
| Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation | Jun 2, 2025 | 4kDescriptive | CodeCode Available | 3 |
| PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers | Feb 13, 2024 | Question AnsweringRetrieval | CodeCode Available | 3 |
| FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning | May 15, 2022 | FairnessSemi-Supervised Image Classification | CodeCode Available | 3 |
| Unlimited-Size Diffusion Restoration | Mar 1, 2023 | Image GenerationImage Restoration | CodeCode Available | 3 |
| TorchSparse: Efficient Point Cloud Inference Engine | Apr 21, 2022 | Autonomous Driving | CodeCode Available | 3 |
| Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis | Jul 18, 2023 | NeRF | CodeCode Available | 3 |
| AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection | Jul 22, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 3 |
| From Matching to Generation: A Survey on Generative Information Retrieval | Apr 23, 2024 | Incremental LearningInformation Retrieval | CodeCode Available | 3 |
| SAM-Med2D | Aug 30, 2023 | DecoderImage Segmentation | CodeCode Available | 3 |
| Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents | Feb 8, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 |
| MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Apr 8, 2024 | Image GenerationImage-to-Image Translation | CodeCode Available | 3 |
| DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations | Mar 11, 2024 | Disentanglement | CodeCode Available | 3 |
| GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation | Jun 10, 2024 | 3D GenerationNeRF | CodeCode Available | 3 |
| Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details | Jun 19, 2025 | Texture Synthesis | CodeCode Available | 3 |
| ResearchTown: Simulator of Human Research Community | Dec 23, 2024 | | CodeCode Available | 3 |
| From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point Supervision | Dec 15, 2024 | Active Learning | CodeCode Available | 3 |
| How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs | Jan 12, 2024 | | CodeCode Available | 3 |
| LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion | Nov 4, 2023 | BenchmarkingImitation Learning | CodeCode Available | 3 |
| TorchDrug: A Powerful and Flexible Machine Learning Platform for Drug Discovery | Feb 16, 2022 | BIG-bench Machine LearningDrug Discovery | CodeCode Available | 3 |
| MathArena: Evaluating LLMs on Uncontaminated Math Competitions | May 29, 2025 | MathMathematical Reasoning | CodeCode Available | 3 |
| Frequency-aware Feature Fusion for Dense Image Prediction | Aug 23, 2024 | Prediction | CodeCode Available | 3 |
| VoiceBench: Benchmarking LLM-Based Voice Assistants | Oct 22, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation | Mar 18, 2024 | 3D Generation3D Reconstruction | CodeCode Available | 3 |
| MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents | Jan 24, 2025 | Benchmarking | CodeCode Available | 3 |
| GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and Reconstruction | Mar 13, 2025 | Autonomous DrivingSurface Reconstruction | CodeCode Available | 3 |