| RevColV2: Exploring Disentangled Representations in Masked Image Modeling | Sep 2, 2023 | Decoderimage-classification | CodeCode Available | 2 | 5 |
| Audio Deepfake Detection with Self-Supervised XLS-R and SLS Classifier | Oct 28, 2024 | Audio Deepfake DetectionAudio Generation | CodeCode Available | 2 | 5 |
| UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild | May 18, 2023 | Image Generation | CodeCode Available | 2 | 5 |
| SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints | Dec 5, 2023 | Model OptimizationNovel Concepts | CodeCode Available | 2 | 5 |
| Foundation Models for Weather and Climate Data Understanding: A Comprehensive Survey | Dec 5, 2023 | | CodeCode Available | 2 | 5 |
| HouseDiffusion: Vector Floorplan Generation via a Diffusion Model with Discrete and Continuous Denoising | Nov 23, 2022 | DenoisingVector Graphics | CodeCode Available | 2 | 5 |
| Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation | Dec 7, 2023 | Domain Generalization | CodeCode Available | 2 | 5 |
| DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data | Jun 15, 2023 | | CodeCode Available | 2 | 5 |
| Training-Free Text-Guided Image Editing with Visual Autoregressive Model | Mar 31, 2025 | text-guided-image-editing | CodeCode Available | 2 | 5 |
| SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining | Mar 25, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 | 5 |
| Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models | Dec 21, 2023 | 2k | CodeCode Available | 2 | 5 |
| HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models | Dec 21, 2023 | 2kImage Inpainting | CodeCode Available | 2 | 5 |
| Visual Point Cloud Forecasting enables Scalable Autonomous Driving | Dec 29, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 | 5 |
| PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation | Sep 10, 2024 | | CodeCode Available | 2 | 5 |
| Malla: Demystifying Real-world Large Language Model Integrated Malicious Services | Jan 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, Geometry | Jan 7, 2024 | Data AugmentationDrug Discovery | CodeCode Available | 2 | 5 |
| SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems | Jan 8, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 | 5 |
| Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach | Jan 11, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment | Jan 16, 2024 | DisentanglementSelf-Supervised Learning | CodeCode Available | 2 | 5 |
| EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records | Jan 13, 2024 | Code GenerationFew-Shot Learning | CodeCode Available | 2 | 5 |
| Improved Implicit Neural Representation with Fourier Reparameterized Training | Jan 15, 2024 | | CodeCode Available | 2 | 5 |
| Multi-Memory Matching for Unsupervised Visible-Infrared Person Re-Identification | Jan 12, 2024 | ClusteringPerson Re-Identification | CodeCode Available | 2 | 5 |
| Extending LLMs' Context Window with 100 Samples | Jan 13, 2024 | Position | CodeCode Available | 2 | 5 |
| Recovering the Pre-Fine-Tuning Weights of Generative Models | Feb 15, 2024 | Pre-Fine-Tuning Weight Recovery | CodeCode Available | 2 | 5 |
| Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model | Jan 19, 2024 | Offline RLreinforcement-learning | CodeCode Available | 2 | 5 |
| CloSe: A 3D Clothing Segmentation Dataset and Model | Jan 22, 2024 | Continual Learningmodel | CodeCode Available | 2 | 5 |
| Coverage Axis++: Efficient Inner Point Selection for 3D Shape Skeletonization | Jan 23, 2024 | | CodeCode Available | 2 | 5 |
| Learning Universal Predictors | Jan 26, 2024 | Meta-Learning | CodeCode Available | 2 | 5 |
| LYT-NET: Lightweight YUV Transformer-based Network for Low-light Image Enhancement | Jan 26, 2024 | Color Image DenoisingImage Enhancement | CodeCode Available | 2 | 5 |
| Grokking at the Edge of Numerical Stability | Jan 8, 2025 | | CodeCode Available | 2 | 5 |
| Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance | Feb 3, 2024 | Denoising | CodeCode Available | 2 | 5 |
| Hydra: Sequentially-Dependent Draft Heads for Medusa Decoding | Feb 7, 2024 | | CodeCode Available | 2 | 5 |
| Closing the Gap Between SGP4 and High-Precision Propagation via Differentiable Programming | Feb 7, 2024 | | CodeCode Available | 2 | 5 |
| Paralinguistics-Aware Speech-Empowered Large Language Models for Natural Conversation | Feb 8, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 | 5 |
| TimeSeriesBench: An Industrial-Grade Benchmark for Time Series Anomaly Detection Models | Feb 16, 2024 | Anomaly DetectionTime Series | CodeCode Available | 2 | 5 |
| OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models | Feb 16, 2024 | Common Sense ReasoningNavigate | CodeCode Available | 2 | 5 |
| Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models | Feb 19, 2024 | Adversarial DefenseMultimodal Deep Learning | CodeCode Available | 2 | 5 |
| JAXbind: Bind any function to JAX | Mar 13, 2024 | | CodeCode Available | 2 | 5 |
| XoFTR: Cross-modal Feature Matching Transformer | Apr 15, 2024 | Image Augmentation | CodeCode Available | 2 | 5 |
| Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning | Feb 21, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 | 5 |
| Aligning Diffusion Models by Optimizing Human Utility | Apr 6, 2024 | | CodeCode Available | 2 | 5 |
| ToMBench: Benchmarking Theory of Mind in Large Language Models | Feb 23, 2024 | BenchmarkingMultiple-choice | CodeCode Available | 2 | 5 |
| HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models | Feb 24, 2024 | DenoisingImage Restoration | CodeCode Available | 2 | 5 |
| Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech | Feb 26, 2024 | QuantizationSpeech Enhancement | CodeCode Available | 2 | 5 |
| Morphological Symmetries in Robotics | Feb 23, 2024 | Data Augmentation | CodeCode Available | 2 | 5 |
| Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization | Feb 27, 2024 | Prompt Engineering | CodeCode Available | 2 | 5 |
| CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision | Feb 26, 2024 | Representation LearningTransfer Learning | CodeCode Available | 2 | 5 |
| PromptMM: Multi-Modal Knowledge Distillation for Recommendation with Prompt-Tuning | Feb 27, 2024 | Knowledge DistillationModel Compression | CodeCode Available | 2 | 5 |
| SURE: SUrvey REcipes for building reliable and robust deep networks | Mar 1, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects | Sep 21, 2024 | | CodeCode Available | 2 | 5 |