| OncoReg: Medical Image Registration for Oncological Challenges | Mar 29, 2025 | Image RegistrationMedical Image Registration | CodeCode Available | 2 | 5 |
| DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion | Mar 13, 2023 | Denoising | CodeCode Available | 2 | 5 |
| A 3D Generative Model for Structure-Based Drug Design | Mar 20, 2022 | Drug Designvalid | CodeCode Available | 2 | 5 |
| TriDet: Temporal Action Detection with Relative Boundary Modeling | Mar 13, 2023 | Action DetectionTemporal Action Localization | CodeCode Available | 2 | 5 |
| Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation | Oct 17, 2024 | | CodeCode Available | 2 | 5 |
| Pop2Piano : Pop Audio-based Piano Cover Generation | Nov 2, 2022 | | CodeCode Available | 2 | 5 |
| Tensor4D : Efficient Neural 4D Decomposition for High-fidelity Dynamic Reconstruction and Rendering | Nov 21, 2022 | Dynamic ReconstructionTensor Decomposition | CodeCode Available | 2 | 5 |
| Seq vs Seq: An Open Suite of Paired Encoders and Decoders | Jul 15, 2025 | DecoderLarge Language Model | CodeCode Available | 2 | 5 |
| QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models | Oct 25, 2023 | GPUMixture-of-Experts | CodeCode Available | 2 | 5 |
| TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation | Apr 29, 2024 | DenoisingImage Generation | CodeCode Available | 2 | 5 |
| Metadata Conditioning Accelerates Language Model Pre-training | Jan 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models | Oct 17, 2024 | Contrastive LearningDiversity | CodeCode Available | 2 | 5 |
| SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning | Feb 6, 2025 | BenchmarkingData Poisoning | CodeCode Available | 2 | 5 |
| Large-Scale Pre-training for Person Re-identification with Noisy Labels | Mar 30, 2022 | Contrastive LearningMulti-Object Tracking | CodeCode Available | 2 | 5 |
| RFLA: Gaussian Receptive Field based Label Assignment for Tiny Object Detection | Aug 18, 2022 | Objectobject-detection | CodeCode Available | 2 | 5 |
| Towards Real-world Event-guided Low-light Video Enhancement and Deblurring | Aug 27, 2024 | DeblurringVideo Enhancement | CodeCode Available | 2 | 5 |
| Data Selection for Language Models via Importance Resampling | Feb 6, 2023 | | CodeCode Available | 2 | 5 |
| UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design | Feb 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining | May 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| SSLRec: A Self-Supervised Learning Framework for Recommendation | Aug 10, 2023 | Collaborative FilteringData Augmentation | CodeCode Available | 2 | 5 |
| Do Membership Inference Attacks Work on Large Language Models? | Feb 12, 2024 | Membership Inference Attack | CodeCode Available | 2 | 5 |
| AMLB: an AutoML Benchmark | Jul 25, 2022 | AutoML | CodeCode Available | 2 | 5 |
| NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing | Jun 10, 2024 | SchedulingVideo Editing | CodeCode Available | 2 | 5 |
| Large Scale Radio Frequency Wideband Signal Detection & Recognition | Nov 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting | Jan 2, 2023 | 3D Object DetectionMotion Forecasting | CodeCode Available | 2 | 5 |
| Optimization of Rank Losses for Image Retrieval | Sep 15, 2023 | Image RetrievalRetrieval | CodeCode Available | 2 | 5 |
| VisionUnite: A Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge | Aug 5, 2024 | Clinical KnowledgeDiagnostic | CodeCode Available | 2 | 5 |
| One Train for Two Tasks: An Encrypted Traffic Classification Framework Using Supervised Contrastive Learning | Feb 12, 2024 | ClassificationContrastive Learning | CodeCode Available | 2 | 5 |
| Large Language Models Meet NLP: A Survey | May 21, 2024 | Survey | CodeCode Available | 2 | 5 |
| GradSafe: Detecting Jailbreak Prompts for LLMs via Safety-Critical Gradient Analysis | Feb 21, 2024 | | CodeCode Available | 2 | 5 |
| SeamlessM4T: Massively Multilingual & Multimodal Machine Translation | Aug 22, 2023 | Automatic Speech RecognitionMachine Translation | CodeCode Available | 2 | 5 |
| Retrieval-Augmented Score Distillation for Text-to-3D Generation | Feb 5, 2024 | 3D Generation3D geometry | CodeCode Available | 2 | 5 |
| PathoDuet: Foundation Models for Pathological Slide Analysis of H&E and IHC Stains | Dec 15, 2023 | Self-Supervised Learning | CodeCode Available | 2 | 5 |
| Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks | May 5, 2021 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Swift Parameter-free Attention Network for Efficient Super-Resolution | Nov 21, 2023 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| Human-Centric Foundation Models: Perception, Generation and Agentic Modeling | Feb 12, 2025 | Survey | CodeCode Available | 2 | 5 |
| Fietje: An open, efficient LLM for Dutch | Dec 19, 2024 | Linguistic AcceptabilitySentiment Analysis | CodeCode Available | 2 | 5 |
| Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection | Apr 6, 2025 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 2 | 5 |
| Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural Embeddings | Oct 4, 2022 | Gesture GenerationRhythm | CodeCode Available | 2 | 5 |
| Contrastive learning of cell state dynamics in response to perturbations | Oct 15, 2024 | Cell TrackingContrastive Learning | CodeCode Available | 2 | 5 |
| KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems | Feb 22, 2022 | Conversational RecommendationRecommendation Systems | CodeCode Available | 2 | 5 |
| The Devil is in Temporal Token: High Quality Video Reasoning Segmentation | Jan 15, 2025 | Reasoning SegmentationReferring Expression Segmentation | CodeCode Available | 2 | 5 |
| ReplayCAD: Generative Diffusion Replay for Continual Anomaly Detection | May 10, 2025 | Anomaly Detectioncontinual anomaly detection | CodeCode Available | 2 | 5 |
| A Simple Episodic Linear Probe Improves Visual Recognition in the Wild | Jan 1, 2022 | Fine-Grained Image ClassificationImage Classification | CodeCode Available | 2 | 5 |
| SweetDreamer: Aligning Geometric Priors in 2D Diffusion for Consistent Text-to-3D | Oct 4, 2023 | 3D GenerationText to 3D | CodeCode Available | 2 | 5 |
| SALT: Introducing a Framework for Hierarchical Segmentations in Medical Imaging using Softmax for Arbitrary Label Trees | Jul 11, 2024 | Diagnostic | CodeCode Available | 2 | 5 |
| Skinned Motion Retargeting with Dense Geometric Interaction Perception | Oct 28, 2024 | motion retargeting | CodeCode Available | 2 | 5 |
| DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space | Dec 19, 2024 | | CodeCode Available | 2 | 5 |
| SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis | Nov 25, 2024 | 3D Generation3DGS | CodeCode Available | 2 | 5 |
| DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes | Dec 13, 2023 | Autonomous Driving | CodeCode Available | 2 | 5 |