| Tree of Attacks: Jailbreaking Black-Box LLMs Automatically | Dec 4, 2023 | Navigate | CodeCode Available | 2 | 5 |
| Attention Mechanisms in Computer Vision: A Survey | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis | Jul 4, 2023 | Image Generation | CodeCode Available | 2 | 5 |
| DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models | Jul 5, 2023 | Object | CodeCode Available | 2 | 5 |
| Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System | Mar 11, 2024 | GPULanguage Modeling | CodeCode Available | 2 | 5 |
| Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models | Apr 16, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents | Jul 12, 2024 | Information RetrievalQuestion Answering | CodeCode Available | 2 | 5 |
| Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation | Jul 15, 2025 | Image SegmentationSegmentation | CodeCode Available | 2 | 5 |
| Multi-Representation Adaptation Network for Cross-domain Image Classification | Jan 4, 2022 | ClassificationDomain Adaptation | CodeCode Available | 2 | 5 |
| Anomaly Detection via Reverse Distillation from One-Class Embedding | Jan 26, 2022 | Anomaly Classification | CodeCode Available | 2 | 5 |
| Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil Infrastructure | Sep 4, 2024 | Crack SegmentationDecoder | CodeCode Available | 2 | 5 |
| LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning | Jun 12, 2024 | text-to-speechText to Speech | CodeCode Available | 2 | 5 |
| Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives | Mar 15, 2024 | Motion Synthesis | CodeCode Available | 2 | 5 |
| Generalizable Human Gaussians for Sparse View Synthesis | Jul 17, 2024 | NeRFNeural Rendering | CodeCode Available | 2 | 5 |
| Protein structure generation via folding diffusion | Sep 30, 2022 | DenoisingProtein Structure Prediction | CodeCode Available | 2 | 5 |
| Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network | Aug 10, 2024 | geo-localizationImage Retrieval | CodeCode Available | 2 | 5 |
| Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels | Mar 5, 2024 | Pseudo LabelSemantic Segmentation | CodeCode Available | 2 | 5 |
| ARoFace: Alignment Robustness to Improve Low-Quality Face Recognition | Jul 20, 2024 | Data AugmentationFace Alignment | CodeCode Available | 2 | 5 |
| HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation | Apr 9, 2023 | DenoisingImage Generation | CodeCode Available | 2 | 5 |
| Liquid Structural State-Space Models | Sep 26, 2022 | Heart rate estimationLong-range modeling | CodeCode Available | 2 | 5 |
| STAR: Skeleton-aware Text-based 4D Avatar Generation with In-Network Motion Retargeting | Jun 7, 2024 | motion retargeting | CodeCode Available | 2 | 5 |
| SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language Description | Aug 24, 2024 | DescriptiveSpeech Synthesis | CodeCode Available | 2 | 5 |
| One Transformer Can Understand Both 2D & 3D Molecular Data | Oct 4, 2022 | Graph Regressionmolecular representation | CodeCode Available | 2 | 5 |
| SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video | Jan 30, 2022 | 3D Human ReconstructionNeural Rendering | CodeCode Available | 2 | 5 |
| LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation | Nov 14, 2024 | Earth ObservationInstruction Following | CodeCode Available | 2 | 5 |