| Aligning to Thousands of Preferences via System Message Generalization | May 28, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Learning Manipulation by Predicting Interaction | Jun 1, 2024 | Representation Learning | CodeCode Available | 2 |
| HiFuse: Hierarchical Multi-Scale Feature Fusion Network for Medical Image Classification | Sep 21, 2022 | Classificationimage-classification | CodeCode Available | 2 |
| Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation | Nov 24, 2022 | FairnessFraud Detection | CodeCode Available | 2 |
| Revealing Single Frame Bias for Video-and-Language Learning | Jun 7, 2022 | Action RecognitionFine-grained Action Recognition | CodeCode Available | 2 |
| OctFormer: Octree-based Transformers for 3D Point Clouds | May 4, 2023 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| MobileFaceSwap: A Lightweight Framework for Video Face Swapping | Jan 11, 2022 | Face SwappingKnowledge Distillation | CodeCode Available | 2 |
| Vision Language Models in Autonomous Driving: A Survey and Outlook | Oct 22, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Architectures of Topological Deep Learning: A Survey of Message-Passing Topological Neural Networks | Apr 20, 2023 | | CodeCode Available | 2 |
| NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction | Oct 25, 2024 | SSIMVideo Reconstruction | CodeCode Available | 2 |
| Beyond Prompts: Dynamic Conversational Benchmarking of Large Language Models | Sep 30, 2024 | BenchmarkingContinual Learning | CodeCode Available | 2 |
| DeepCore: A Comprehensive Library for Coreset Selection in Deep Learning | Apr 18, 2022 | Active LearningContinual Learning | CodeCode Available | 2 |
| A Self-Attention Ansatz for Ab-initio Quantum Chemistry | Nov 24, 2022 | | CodeCode Available | 2 |
| SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models | Dec 11, 2023 | | CodeCode Available | 2 |
| Deep Hierarchical Semantic Segmentation | Mar 27, 2022 | Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION | CodeCode Available | 2 |
| A Comprehensive Survey on Graph Reduction: Sparsification, Coarsening, and Condensation | Jan 29, 2024 | Survey | CodeCode Available | 2 |
| EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation | Mar 20, 2023 | 3D Face AnimationDecoder | CodeCode Available | 2 |
| SelfReg-UNet: Self-Regularized UNet for Medical Image Segmentation | Jun 21, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| YuLan-OneSim: Towards the Next Generation of Social Simulator with Large Language Models | May 12, 2025 | Large Language ModelSociology | CodeCode Available | 2 |
| Learning Causally Invariant Representations for Out-of-Distribution Generalization on Graphs | Feb 11, 2022 | Drug DiscoveryGraph Learning | CodeCode Available | 2 |
| Learning Multi-Agent Communication from Graph Modeling Perspective | May 14, 2024 | | CodeCode Available | 2 |
| WorldPM: Scaling Human Preference Modeling | May 15, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Turning a CLIP Model into a Scene Text Spotter | Aug 21, 2023 | object-detectionObject Detection | CodeCode Available | 2 |
| Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation | Mar 3, 2025 | Representation LearningRetrieval | CodeCode Available | 2 |
| Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous Driving | Sep 24, 2024 | Autonomous DrivingImitation Learning | CodeCode Available | 2 |
| Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation | May 28, 2024 | Instance SegmentationObject Proposal Generation | CodeCode Available | 2 |
| Benchmarking Representations for Speech, Music, and Acoustic Events | May 2, 2024 | Audio ClassificationBenchmarking | CodeCode Available | 2 |
| FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning | Dec 19, 2023 | Contrastive LearningDenoising | CodeCode Available | 2 |
| SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation | Apr 2, 2024 | 3D Pose EstimationPose Estimation | CodeCode Available | 2 |
| ZeroGUI: Automating Online GUI Learning at Zero Human Cost | May 29, 2025 | | CodeCode Available | 2 |
| Customizable Perturbation Synthesis for Robust SLAM Benchmarking | Feb 12, 2024 | BenchmarkingSimultaneous Localization and Mapping | CodeCode Available | 2 |
| Deep Learning Recommendation Model for Personalization and Recommendation Systems | May 31, 2019 | Deep LearningRecommendation Systems | CodeCode Available | 2 |
| Learning Truncated Causal History Model for Video Restoration | Oct 4, 2024 | DeblurringDenoising | CodeCode Available | 2 |
| ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks | Feb 23, 2021 | Image Classification | CodeCode Available | 2 |
| Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement | May 24, 2024 | HallucinationImage Comprehension | CodeCode Available | 2 |
| Zoology: Measuring and Improving Recall in Efficient Language Models | Dec 8, 2023 | | CodeCode Available | 2 |
| One for All: Towards Training One Graph Model for All Classification Tasks | Sep 29, 2023 | AllGraph Classification | CodeCode Available | 2 |
| Modern Methods in Associative Memory | Jul 8, 2025 | | CodeCode Available | 2 |
| State-specific protein-ligand complex structure prediction with a multi-scale deep generative model | Sep 30, 2022 | BenchmarkingBlind Docking | CodeCode Available | 2 |
| Spacing Loss for Discovering Novel Categories | Apr 22, 2022 | Novel Class Discovery | CodeCode Available | 2 |
| The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation | Jul 27, 2023 | Depth EstimationImage Restoration | CodeCode Available | 2 |
| OmniVid: A Generative Framework for Universal Video Understanding | Mar 26, 2024 | Action RecognitionDecoder | CodeCode Available | 2 |
| Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems | Jan 16, 2023 | Simultaneous Localization and Mapping | CodeCode Available | 2 |
| Token-Budget-Aware LLM Reasoning | Dec 24, 2024 | | CodeCode Available | 2 |
| Reference-based Video Super-Resolution Using Multi-Camera Video Triplets | Mar 28, 2022 | Reference-based Video Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Conformal Prediction for Deep Classifier via Label Ranking | Oct 10, 2023 | Conformal PredictionPrediction | CodeCode Available | 2 |
| Localizing Task Information for Improved Model Merging and Compression | May 13, 2024 | Task Arithmetic | CodeCode Available | 2 |
| Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions | Jun 13, 2024 | Philosophy | CodeCode Available | 2 |
| FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and Transferable Topic Model | May 28, 2024 | RelationTopic Models | CodeCode Available | 2 |
| MTR-A: 1st Place Solution for 2022 Waymo Open Dataset Challenge -- Motion Prediction | Sep 20, 2022 | motion predictionPrediction | CodeCode Available | 2 |