| Unicorn: Text-Only Data Synthesis for Vision Language Model Training | Mar 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Tactile-Augmented Radiance Fields | May 7, 2024 | | CodeCode Available | 2 | 5 |
| DeGCN: Deformable Graph Convolutional Networks for Skeleton-Based Action Recognition | Mar 25, 2024 | Action RecognitionSkeleton Based Action Recognition | CodeCode Available | 2 | 5 |
| Polis: Scaling Deliberation by Mapping High Dimensional Opinion Spaces | Jul 22, 2021 | Data VisualizationDecision Making | CodeCode Available | 2 | 5 |
| Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward | Apr 1, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 | 5 |
| SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image Understanding | Nov 28, 2022 | Time SeriesTime Series Analysis | CodeCode Available | 2 | 5 |
| Dense Text Retrieval based on Pretrained Language Models: A Survey | Nov 27, 2022 | RetrievalSurvey | CodeCode Available | 2 | 5 |
| DynIBaR: Neural Dynamic Image-Based Rendering | Nov 20, 2022 | Dynamic Reconstruction | CodeCode Available | 2 | 5 |
| V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer | Jan 9, 2025 | | CodeCode Available | 2 | 5 |
| Iterative Geometry Encoding Volume for Stereo Matching | Mar 12, 2023 | Omnnidirectional Stereo Depth EstimationStereo Matching | CodeCode Available | 2 | 5 |
| FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition | May 22, 2024 | Image Generation | CodeCode Available | 2 | 5 |
| Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline | Jan 29, 2023 | Data AugmentationLightweight Deployment | CodeCode Available | 2 | 5 |
| DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning | Apr 20, 2025 | AttributeFace Swapping | CodeCode Available | 2 | 5 |
| Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction | Jul 3, 2024 | 3DGS3D Reconstruction | CodeCode Available | 2 | 5 |
| From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation | Apr 23, 2024 | Image Generation | CodeCode Available | 2 | 5 |
| E-SQL: Direct Schema Linking via Question Enrichment in Text-to-SQL | Sep 25, 2024 | Natural Language QueriesText to SQL | CodeCode Available | 2 | 5 |
| CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models | Mar 28, 2025 | GPUGSM8K | CodeCode Available | 2 | 5 |
| Diffusion Predictive Control with Constraints | Dec 12, 2024 | Denoising | CodeCode Available | 2 | 5 |
| HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation | Nov 30, 2020 | 3D human pose and shape estimation3D Human Pose Estimation | CodeCode Available | 2 | 5 |
| OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows | Dec 2, 2024 | Audio SynthesisImage Generation | CodeCode Available | 2 | 5 |
| DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection | Dec 11, 2023 | Anomaly DetectionDenoising | CodeCode Available | 2 | 5 |
| AI-Generated Video Detection via Spatio-Temporal Anomaly Learning | Mar 25, 2024 | Optical Flow Estimation | CodeCode Available | 2 | 5 |
| DSBench: How Far Are Data Science Agents to Becoming Data Science Experts? | Sep 12, 2024 | | CodeCode Available | 2 | 5 |
| BakedSDF: Meshing Neural SDFs for Real-Time View Synthesis | Feb 28, 2023 | Novel View Synthesis | CodeCode Available | 2 | 5 |
| Hyperbolic Vision Transformers: Combining Improvements in Metric Learning | Mar 21, 2022 | Metric Learning | CodeCode Available | 2 | 5 |
| Sequential Multivariate Change Detection with Calibrated and Memoryless False Detection Rates | Aug 2, 2021 | Change Detection | CodeCode Available | 2 | 5 |
| TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks | Sep 9, 2024 | ClassificationLanguage Modeling | CodeCode Available | 2 | 5 |
| Efficient Long-Range Attention Network for Image Super-resolution | Mar 13, 2022 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey | Aug 18, 2023 | DeblurringImage Restoration | CodeCode Available | 2 | 5 |
| Honegumi: An Interface for Accelerating the Adoption of Bayesian Optimization in the Experimental Sciences | Feb 4, 2025 | Bayesian OptimizationExperimental Design | CodeCode Available | 2 | 5 |
| PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization | Dec 18, 2019 | Abstractive Text SummarizationDecoder | CodeCode Available | 2 | 5 |
| Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era | Mar 13, 2024 | | CodeCode Available | 2 | 5 |
| TransNeXt: Robust Foveal Visual Perception for Vision Transformers | Nov 28, 2023 | ClassificationDomain Generalization | CodeCode Available | 2 | 5 |
| UniDrive: Towards Universal Driving Perception Across Camera Configurations | Oct 17, 2024 | Autonomous Driving | CodeCode Available | 2 | 5 |
| PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization | Oct 25, 2023 | Navigate | CodeCode Available | 2 | 5 |
| Continual Learning on Graphs: Challenges, Solutions, and Opportunities | Feb 18, 2024 | Continual LearningGraph Learning | CodeCode Available | 2 | 5 |
| Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework | Mar 22, 2022 | Object TrackingRelation | CodeCode Available | 2 | 5 |
| Adapting a Language Model While Preserving its General Knowledge | Jan 21, 2023 | Continual LearningGeneral Knowledge | CodeCode Available | 2 | 5 |
| DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design | Feb 26, 2024 | AvgDrug Design | CodeCode Available | 2 | 5 |
| NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality | May 9, 2022 | SentenceSpeech Synthesis | CodeCode Available | 2 | 5 |
| General Detection-based Text Line Recognition | Sep 25, 2024 | HTROptical Character Recognition (OCR) | CodeCode Available | 2 | 5 |
| Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models | Apr 14, 2025 | Action GenerationDenoising | CodeCode Available | 2 | 5 |
| FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling | Jun 9, 2025 | Density Estimation | CodeCode Available | 2 | 5 |
| Denoising Diffusion Models for Plug-and-Play Image Restoration | May 15, 2023 | DeblurringDenoising | CodeCode Available | 2 | 5 |
| OmniColor: A Global Camera Pose Optimization Approach of LiDAR-360Camera Fusion for Colorizing Point Clouds | Apr 6, 2024 | 3D Reconstruction | CodeCode Available | 2 | 5 |
| RemDet: Rethinking Efficient Model Design for UAV Object Detection | Dec 13, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| Pose2Sim: An open-source Python package for multiview markerless kinematics | Sep 14, 2022 | | CodeCode Available | 2 | 5 |
| Scaling Laws of Synthetic Images for Model Training ... for Now | Dec 7, 2023 | | CodeCode Available | 2 | 5 |
| DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal Inconsistency | Mar 10, 2024 | PredictionPrognosis | CodeCode Available | 2 | 5 |
| FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose Generation | Mar 29, 2024 | Blind DockingDrug Discovery | CodeCode Available | 2 | 5 |