| Ensured: Explanations for Decreasing the Epistemic Uncertainty in Predictions | Oct 7, 2024 | | CodeCode Available | 2 | 5 |
| HAIR: Hypernetworks-based All-in-One Image Restoration | Aug 15, 2024 | 5-Degradation Blind All-in-One Image RestorationAll | CodeCode Available | 2 | 5 |
| Unifying Pairwise Interactions in Complex Dynamics | Jan 28, 2022 | Causal InferenceTime Series | CodeCode Available | 2 | 5 |
| Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather | Jul 2, 2024 | Data AugmentationLIDAR Semantic Segmentation | CodeCode Available | 2 | 5 |
| Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector | Mar 26, 2025 | Binary ClassificationDeepFake Detection | CodeCode Available | 2 | 5 |
| Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts | Oct 10, 2024 | Mixture-of-Experts | CodeCode Available | 2 | 5 |
| GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation | Nov 2, 2024 | Imitation Learning | CodeCode Available | 2 | 5 |
| TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods | Jul 31, 2024 | | CodeCode Available | 2 | 5 |
| MTGS: Multi-Traversal Gaussian Splatting | Mar 16, 2025 | NavigateNovel View Synthesis | CodeCode Available | 2 | 5 |
| What Was Your Prompt? A Remote Keylogging Attack on AI Assistants | Mar 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception | Mar 17, 2025 | Future predictionScene Generation | CodeCode Available | 2 | 5 |
| Discovery of 2D materials using Transformer Network based Generative Design | Jan 14, 2023 | Formation EnergySelf-Learning | CodeCode Available | 2 | 5 |
| A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future Directions | Feb 9, 2023 | Multimodal RecommendationRecommendation Systems | CodeCode Available | 2 | 5 |
| PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety | Jan 22, 2024 | | CodeCode Available | 2 | 5 |
| Parameter-Efficient Fine-Tuning with Discrete Fourier Transform | May 5, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| CERT: Continual Pre-Training on Sketches for Library-Oriented Code Generation | Jun 14, 2022 | Code GenerationLibrary-Oriented Code Generation | CodeCode Available | 2 | 5 |
| HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion | May 10, 2023 | Motion SynthesisNovel View Synthesis | CodeCode Available | 2 | 5 |
| Misalignment-Robust Frequency Distribution Loss for Image Transformation | Feb 28, 2024 | Image EnhancementStyle Transfer | CodeCode Available | 2 | 5 |
| TRESTLE: A Model of Concept Formation in Structured Domains | Oct 14, 2024 | Attribute | CodeCode Available | 2 | 5 |
| Synthesis of discrete-continuous quantum circuits with multimodal diffusion models | Jun 2, 2025 | DenoisingParameter Prediction | CodeCode Available | 2 | 5 |
| SecAlign: Defending Against Prompt Injection with Preference Optimization | Oct 7, 2024 | | CodeCode Available | 2 | 5 |
| decoupleQ: Towards 2-bit Post-Training Uniform Quantization via decoupling Parameters into Integer and Floating Points | Apr 19, 2024 | Quantization | CodeCode Available | 2 | 5 |
| SegFace: Face Segmentation of Long-Tail Classes | Dec 11, 2024 | Face ParsingFace Swapping | CodeCode Available | 2 | 5 |
| Gaussian Deja-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities | Sep 23, 2024 | 3DGSNeRF | CodeCode Available | 2 | 5 |
| IMU-Aided Event-based Stereo Visual Odometry | May 7, 2024 | Pose TrackingVisual Odometry | CodeCode Available | 2 | 5 |
| Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs | Feb 19, 2024 | Question Answering | CodeCode Available | 2 | 5 |
| MotionLLaMA: A Unified Framework for Motion Synthesis and Comprehension | Nov 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning | May 7, 2025 | Multiple-choiceQuestion Answering | CodeCode Available | 2 | 5 |
| SE(3)-Stochastic Flow Matching for Protein Backbone Generation | Oct 3, 2023 | | CodeCode Available | 2 | 5 |
| AnyText2: Visual Text Generation and Editing With Customizable Attributes | Nov 22, 2024 | Image GenerationText Generation | CodeCode Available | 2 | 5 |
| Smaller But Better: Unifying Layout Generation with Smaller Large Language Models | Feb 19, 2025 | Layout Generation | CodeCode Available | 2 | 5 |
| Dialectal Coverage And Generalization in Arabic Speech Recognition | Nov 7, 2024 | Arabic Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 2 | 5 |
| Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection | Mar 30, 2022 | 2D Object DetectionBilevel Optimization | CodeCode Available | 2 | 5 |
| LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning | Nov 30, 2023 | 3D dense captioningDense Captioning | CodeCode Available | 2 | 5 |
| QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control | Jun 15, 2023 | CPUDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| FinMTEB: Finance Massive Text Embedding Benchmark | Feb 16, 2025 | ArticlesSemantic Textual Similarity | CodeCode Available | 2 | 5 |
| Online Vectorized HD Map Construction using Geometry | Dec 6, 2023 | Online Vectorized HD Map Construction | CodeCode Available | 2 | 5 |
| Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution | Jan 27, 2025 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| TabDiff: a Multi-Modal Diffusion Model for Tabular Data Generation | Oct 27, 2024 | ImputationTabular Data Generation | CodeCode Available | 2 | 5 |
| VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control | Dec 30, 2024 | DenoisingImage Generation | CodeCode Available | 2 | 5 |
| Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving | May 10, 2023 | Autonomous DrivingBench2Drive | CodeCode Available | 2 | 5 |
| Transformer tricks: Removing weights for skipless transformers | Apr 18, 2024 | | CodeCode Available | 2 | 5 |
| Hyper-3DG: Text-to-3D Gaussian Generation via Hypergraph | Mar 14, 2024 | 3D Generation3DGS | CodeCode Available | 2 | 5 |
| Listen, Think, and Understand | May 18, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 | 5 |
| Pre-training Differentially Private Models with Limited Public Data | Feb 28, 2024 | TAG | CodeCode Available | 2 | 5 |
| Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling | Jul 8, 2024 | DenoisingImage Inpainting | CodeCode Available | 2 | 5 |
| BARS: Towards Open Benchmarking for Recommender Systems | May 19, 2022 | BenchmarkingClick-Through Rate Prediction | CodeCode Available | 2 | 5 |
| RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models | Oct 17, 2024 | Image CaptioningQuestion Answering | CodeCode Available | 2 | 5 |
| Optimal Invariant Bases for Atomistic Machine Learning | Mar 30, 2025 | | CodeCode Available | 2 | 5 |
| Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration | May 31, 2024 | Deformable Medical Image RegistrationImage Registration | CodeCode Available | 2 | 5 |