| Tensor-Var: Variational Data Assimilation in Tensor Product Feature Space | Jan 23, 2025 | | CodeCode Available | 2 |
| CleanDIFT: Diffusion Features without Noise | Dec 4, 2024 | Semantic correspondence | CodeCode Available | 2 |
| ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant Tightness | Mar 13, 2025 | 3D Human Pose Estimation3D Human Shape Estimation | CodeCode Available | 2 |
| CAnDOIT: Causal Discovery with Observational and Interventional Data from Time-Series | Oct 3, 2024 | Causal DiscoveryTime Series | CodeCode Available | 2 |
| GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting | Jan 22, 2025 | Autonomous DrivingNeRF | CodeCode Available | 2 |
| Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior | Apr 29, 2024 | Image CompressionImage Reconstruction | CodeCode Available | 2 |
| SRFormerV2: Taking a Closer Look at Permuted Self-Attention for Image Super-Resolution | Mar 17, 2023 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases | Jun 8, 2023 | | CodeCode Available | 2 |
| Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models | Feb 19, 2025 | GPUQuantization | CodeCode Available | 2 |
| Next Best Sense: Guiding Vision and Touch with FisherRF for 3D Gaussian Splatting | Oct 7, 2024 | 3DGS | CodeCode Available | 2 |
| Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration | Apr 2, 2024 | AllDecoder | CodeCode Available | 2 |
| Towards Training-free Anomaly Detection with Vision and Language Foundation Models | Mar 24, 2025 | Anomaly Detection | CodeCode Available | 2 |
| Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer | Apr 7, 2022 | Video Generation | CodeCode Available | 2 |
| LLM As DBA | Aug 10, 2023 | | CodeCode Available | 2 |
| 4D-DRESS: A 4D Dataset of Real-world Human Clothing with Semantic Annotations | Apr 29, 2024 | Human Parsing | CodeCode Available | 2 |
| DLF: Disentangled-Language-Focused Multimodal Sentiment Analysis | Dec 16, 2024 | DisentanglementMultimodal Sentiment Analysis | CodeCode Available | 2 |
| VPIT: Real-time Embedded Single Object 3D Tracking Using Voxel Pseudo Images | Jun 6, 2022 | 3D Single Object TrackingObject | CodeCode Available | 2 |
| Image Matching Filtering and Refinement by Planes and Beyond | Nov 14, 2024 | Deep LearningTemplate Matching | CodeCode Available | 2 |
| scDiffusion: conditional generation of high-quality single-cell data using diffusion model | Jan 8, 2024 | | CodeCode Available | 2 |
| TFPred: Learning Discriminative Representations from Unlabeled Data for Few-Label Rotating Machinery Fault Diagnosis | May 1, 2024 | Fault DetectionFault Diagnosis | CodeCode Available | 2 |
| RestoreFormer: High-Quality Blind Face Restoration from Undegraded Key-Value Pairs | Jan 17, 2022 | Blind Face RestorationFace Reconstruction | CodeCode Available | 2 |
| ZipIt! Merging Models from Different Tasks without Training | May 4, 2023 | | CodeCode Available | 2 |
| ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification | Feb 12, 2025 | DecoderDescriptive | CodeCode Available | 2 |
| A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1 | Mar 13, 2025 | | CodeCode Available | 2 |
| SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation | Apr 7, 2022 | Autonomous DrivingDepth Estimation | CodeCode Available | 2 |
| Generalized Category Discovery | Jan 7, 2022 | Fine-Grained Visual RecognitionOpen-World Semi-Supervised Learning | CodeCode Available | 2 |
| Personalized Representation from Personalized Generation | Dec 20, 2024 | Contrastive LearningImage Generation | CodeCode Available | 2 |
| genomepy: genes and genomes at your fingertips | Sep 2, 2022 | | CodeCode Available | 2 |
| A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges | Mar 15, 2024 | | CodeCode Available | 2 |
| Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process | Jun 26, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation | Dec 9, 2024 | 3D GenerationImage to 3D | CodeCode Available | 2 |
| SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation | Apr 19, 2025 | ERPVideo Generation | CodeCode Available | 2 |
| Open3DBench: Open-Source Benchmark for 3D-IC Backend Implementation and PPA Evaluation | Mar 17, 2025 | | CodeCode Available | 2 |
| Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models | Apr 10, 2025 | Emotion InterpretationEmotion Recognition | CodeCode Available | 2 |
| GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Nov 19, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K | Feb 6, 2024 | 16kBenchmarking | CodeCode Available | 2 |
| textless-lib: a Library for Textless Spoken Language Processing | Feb 15, 2022 | Resynthesis | CodeCode Available | 2 |
| MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning | Mar 29, 2024 | Multi-Task Learningparameter-efficient fine-tuning | CodeCode Available | 2 |
| QCNeXt: A Next-Generation Framework For Joint Multi-Agent Trajectory Prediction | Jun 18, 2023 | Autonomous DrivingDecoder | CodeCode Available | 2 |
| DiffMS: Diffusion Generation of Molecules Conditioned on Mass Spectra | Feb 13, 2025 | DecoderDe novo molecule generation from MS/MS spectrum (bonus chemical formulae) | CodeCode Available | 2 |
| LoRA-IR: Taming Low-Rank Experts for Efficient All-in-One Image Restoration | Oct 20, 2024 | AllComputational Efficiency | CodeCode Available | 2 |
| AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans | Nov 27, 2024 | Navigate | CodeCode Available | 2 |
| FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning | Mar 30, 2025 | 2kGPU | CodeCode Available | 2 |
| Alfie: Democratising RGBA Image Generation With No $ | Aug 27, 2024 | Image GenerationImage Matting | CodeCode Available | 2 |
| Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models | Mar 19, 2024 | Instruction Followingvisual instruction following | CodeCode Available | 2 |
| Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment | May 28, 2024 | | CodeCode Available | 2 |
| Realistic Test-Time Adaptation of Vision-Language Models | Jan 7, 2025 | Test-time Adaptation | CodeCode Available | 2 |
| Compact 3D Gaussian Representation for Radiance Field | Nov 22, 2023 | 3DGSModel Compression | CodeCode Available | 2 |
| Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning | Jun 3, 2025 | | CodeCode Available | 2 |
| GREC: Generalized Referring Expression Comprehension | Aug 30, 2023 | Generalized Referring Expression ComprehensionReferring Expression | CodeCode Available | 2 |