| Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization | Sep 2, 2024 | DiversityOffline RL | CodeCode Available | 2 | 5 |
| Associative Recurrent Memory Transformer | Jul 5, 2024 | Retrieval | CodeCode Available | 2 | 5 |
| UltraMedical: Building Specialized Generalists in Biomedicine | Jun 6, 2024 | | CodeCode Available | 2 | 5 |
| ResCLIP: Residual Attention for Training-free Dense Vision-language Inference | Nov 24, 2024 | AttributeSemantic Segmentation | CodeCode Available | 2 | 5 |
| STAR: Scale-wise Text-to-image generation via Auto-Regressive representations | Jun 16, 2024 | DiversityImage Generation | CodeCode Available | 2 | 5 |
| YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection | Aug 10, 2023 | Objectobject-detection | CodeCode Available | 2 | 5 |
| AgentGroupChat: An Interactive Group Chat Simulacra For Better Eliciting Emergent Behavior | Mar 20, 2024 | | CodeCode Available | 2 | 5 |
| Low-Light Image Enhancement via Structure Modeling and Guidance | May 10, 2023 | Edge DetectionImage Enhancement | CodeCode Available | 2 | 5 |
| Vision-Language Pre-Training with Triple Contrastive Learning | Feb 21, 2022 | Contrastive Learningcross-modal alignment | CodeCode Available | 2 | 5 |
| LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs | Jun 17, 2025 | | CodeCode Available | 2 | 5 |
| UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces | Dec 25, 2023 | Image SegmentationObject | CodeCode Available | 2 | 5 |
| KeypointNeRF: Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints | May 10, 2022 | 3D Face Reconstruction3D Human Reconstruction | CodeCode Available | 2 | 5 |
| RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control | Jun 6, 2023 | continuous-controlContinuous Control | CodeCode Available | 2 | 5 |
| DepthMaster: Taming Diffusion Models for Monocular Depth Estimation | Jan 5, 2025 | DenoisingDepth Estimation | CodeCode Available | 2 | 5 |
| Efficient Autoregressive Audio Modeling via Next-Scale Prediction | Aug 16, 2024 | Audio GenerationFAD | CodeCode Available | 2 | 5 |
| Accelerating DETR Convergence via Semantic-Aligned Matching | Mar 14, 2022 | Objectobject-detection | CodeCode Available | 2 | 5 |
| SimKGC: Simple Contrastive Knowledge Graph Completion with Pre-trained Language Models | Mar 4, 2022 | Contrastive LearningGraph Embedding | CodeCode Available | 2 | 5 |
| Make-A-Shape: a Ten-Million-scale 3D Shape Model | Jan 20, 2024 | | CodeCode Available | 2 | 5 |
| On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection | Oct 31, 2024 | Video Forensics | CodeCode Available | 2 | 5 |
| The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG) | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Adversarial Detection and Correction by Matching Prediction Distributions | Feb 21, 2020 | Prediction | CodeCode Available | 2 | 5 |
| ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation | Jul 2, 2024 | PredictionText to 3D | CodeCode Available | 2 | 5 |
| ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation | Feb 20, 2025 | 3D Molecule GenerationProtein Design | CodeCode Available | 2 | 5 |
| Selective-Stereo: Adaptive Frequency Information Selection for Stereo Matching | Mar 1, 2024 | Stereo Matching | CodeCode Available | 2 | 5 |
| Zero Shot Health Trajectory Prediction Using Transformer | Jul 30, 2024 | ICU AdmissionICU Mortality | CodeCode Available | 2 | 5 |
| Helix-mRNA: A Hybrid Foundation Model For Full Sequence mRNA Therapeutics | Feb 19, 2025 | | CodeCode Available | 2 | 5 |
| X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios | Nov 2, 2024 | Denoising | CodeCode Available | 2 | 5 |
| MHG-GNN: Combination of Molecular Hypergraph Grammar with Graph Neural Network | Sep 28, 2023 | Graph Neural NetworkPrediction | CodeCode Available | 2 | 5 |
| DAVE -- A Detect-and-Verify Paradigm for Low-Shot Counting | Apr 25, 2024 | Exemplar-Free CountingFew-shot Object Counting and Detection | CodeCode Available | 2 | 5 |
| Learning-Based Defect Recognitions for Autonomous UAV Inspections | Feb 13, 2023 | Crack SegmentationSegmentation | CodeCode Available | 2 | 5 |
| InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences | Dec 2, 2024 | | CodeCode Available | 2 | 5 |
| LaRE^2: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection | Mar 26, 2024 | Image Generation | CodeCode Available | 2 | 5 |
| Deep Learning-Based Point Cloud Registration: A Comprehensive Survey and Taxonomy | Apr 22, 2024 | Autonomous DrivingDeep Learning | CodeCode Available | 2 | 5 |
| Geometry Aware Operator Transformer as an Efficient and Accurate Neural Surrogate for PDEs on Arbitrary Domains | May 24, 2025 | Computational EfficiencyOperator learning | CodeCode Available | 2 | 5 |
| Poison-splat: Computation Cost Attack on 3D Gaussian Splatting | Oct 10, 2024 | 3DGS | CodeCode Available | 2 | 5 |
| Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems | Mar 18, 2024 | Machine TranslationTranslation | CodeCode Available | 2 | 5 |
| A Survey on Diffusion Models for Recommender Systems | Sep 8, 2024 | Data AugmentationRecommendation Systems | CodeCode Available | 2 | 5 |
| Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology | Jun 3, 2025 | Multiple Instance LearningPrognosis | CodeCode Available | 2 | 5 |
| CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations | Apr 10, 2024 | Dialogue Generationtext-to-speech | CodeCode Available | 2 | 5 |
| Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior | Mar 29, 2024 | NeRF | CodeCode Available | 2 | 5 |
| Panda: A pretrained forecast model for universal representation of chaotic dynamics | May 19, 2025 | Time Series | CodeCode Available | 2 | 5 |
| Embedded FPGA Developments in 130nm and 28nm CMOS for Machine Learning in Particle Detector Readout | Apr 26, 2024 | | CodeCode Available | 2 | 5 |
| InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval | Jan 4, 2023 | Information RetrievalRetrieval | CodeCode Available | 2 | 5 |
| Point Transformer V2: Grouped Vector Attention and Partition-based Pooling | Oct 11, 2022 | 3D Point Cloud Classification3D Semantic Segmentation | CodeCode Available | 2 | 5 |
| Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks | Jan 30, 2024 | | CodeCode Available | 2 | 5 |
| Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking | Apr 12, 2024 | Contrastive LearningRetrieval | CodeCode Available | 2 | 5 |
| Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4 | Dec 26, 2023 | All | CodeCode Available | 2 | 5 |
| Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models | Jun 1, 2023 | Image GenerationStory Visualization | CodeCode Available | 2 | 5 |
| LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts | Dec 16, 2024 | General KnowledgeInstruction Following | CodeCode Available | 2 | 5 |
| Deconstructing equivariant representations in molecular systems | Oct 10, 2024 | Property Prediction | CodeCode Available | 2 | 5 |