| AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases | Jul 17, 2024 | Autonomous DrivingBackdoor Attack | CodeCode Available | 3 | 5 |
| Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech Separation | Jun 10, 2024 | ChunkingSpeech Separation | CodeCode Available | 3 | 5 |
| Safety Assessment of Chinese Large Language Models | Apr 20, 2023 | | CodeCode Available | 3 | 5 |
| Stable Flow: Vital Layers for Training-Free Image Editing | Nov 21, 2024 | Text-based Image Editing | CodeCode Available | 3 | 5 |
| UltraFeedback: Boosting Language Models with Scaled AI Feedback | Oct 2, 2023 | Language Modelling | CodeCode Available | 3 | 5 |
| S-LoRA: Serving Thousands of Concurrent LoRA Adapters | Nov 6, 2023 | GPUparameter-efficient fine-tuning | CodeCode Available | 3 | 5 |
| A Survey on Inference Optimization Techniques for Mixture of Experts Models | Dec 18, 2024 | Computational EfficiencyDistributed Computing | CodeCode Available | 3 | 5 |
| Pre-Training with Whole Word Masking for Chinese BERT | Jun 19, 2019 | Document ClassificationGeneral Classification | CodeCode Available | 3 | 5 |
| Generic 3D Diffusion Adapter Using Controlled Multi-View Editing | Mar 18, 2024 | 3D GenerationImage Generation | CodeCode Available | 3 | 5 |
| AnyTop: Character Animation Diffusion with Any Topology | Feb 24, 2025 | Denoising | CodeCode Available | 3 | 5 |
| EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation | Mar 22, 2023 | 3D Object Detection6D Pose Estimation using RGB | CodeCode Available | 3 | 5 |
| U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers | May 4, 2024 | Image GenerationInductive Bias | CodeCode Available | 3 | 5 |
| XCiT: Cross-Covariance Image Transformers | Jun 17, 2021 | image-classificationImage Classification | CodeCode Available | 3 | 5 |
| 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation | Oct 24, 2024 | 3D Generation3D geometry | CodeCode Available | 3 | 5 |
| ERNIE: Enhanced Representation through Knowledge Integration | Apr 19, 2019 | Chinese Named Entity RecognitionChinese Sentence Pair Classification | CodeCode Available | 3 | 5 |
| PathRAG: Pruning Graph-based Retrieval Augmented Generation with Relational Paths | Feb 18, 2025 | RAGRetrieval | CodeCode Available | 3 | 5 |
| LoLCATs: On Low-Rank Linearizing of Large Language Models | Oct 14, 2024 | MMLU | CodeCode Available | 3 | 5 |
| Stitch it in Time: GAN-Based Facial Editing of Real Videos | Jan 20, 2022 | Facial Editing | CodeCode Available | 3 | 5 |
| ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models | Mar 4, 2024 | DenoisingImage Generation | CodeCode Available | 3 | 5 |
| Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis | Feb 28, 2024 | DecoderImage Generation | CodeCode Available | 3 | 5 |
| Time Series Classification from Scratch with Deep Neural Networks: A Strong Baseline | Nov 20, 2016 | General ClassificationTime Series | CodeCode Available | 3 | 5 |
| DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes | Sep 6, 2024 | Video Generation | CodeCode Available | 3 | 5 |
| SwinIR: Image Restoration Using Swin Transformer | Aug 23, 2021 | Color Image DenoisingDenoising | CodeCode Available | 3 | 5 |
| PiML Toolbox for Interpretable Machine Learning Model Development and Diagnostics | May 7, 2023 | FairnessInterpretable Machine Learning | CodeCode Available | 3 | 5 |
| Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer | May 23, 2024 | 3D Generation3D Reconstruction | CodeCode Available | 3 | 5 |
| SemCity: Semantic Scene Generation with Triplane Diffusion | Mar 12, 2024 | Scene Generation | CodeCode Available | 3 | 5 |
| MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts | Jan 8, 2024 | MambaMixture-of-Experts | CodeCode Available | 3 | 5 |
| Leveraging tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics | Apr 25, 2024 | Audio ClassificationTransfer Learning | CodeCode Available | 3 | 5 |
| VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks | Mar 1, 2024 | Image ClassificationImage Generation | CodeCode Available | 3 | 5 |
| StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis | Jan 23, 2023 | Image GenerationText-to-Image Generation | CodeCode Available | 3 | 5 |
| LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding | Apr 25, 2024 | GSM8KHellaSwag | CodeCode Available | 3 | 5 |
| Revisiting Image Pyramid Structure for High Resolution Salient Object Detection | Sep 20, 2022 | Dichotomous Image SegmentationObject Detection | CodeCode Available | 3 | 5 |
| Travel Time Prediction using Tree-Based Ensembles | May 28, 2020 | Prediction | CodeCode Available | 3 | 5 |
| All-atom Diffusion Transformers: Unified generative modelling of molecules and materials | Mar 5, 2025 | AllUnconditional Crystal Generation | CodeCode Available | 3 | 5 |
| CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation | Oct 12, 2024 | Conditional Image GenerationGPU | CodeCode Available | 3 | 5 |
| MambaGlue: Fast and Robust Local Feature Matching With Mamba | Feb 1, 2025 | Mamba | CodeCode Available | 3 | 5 |
| Sparser, Better, Faster, Stronger: Sparsity Detection for Efficient Automatic Differentiation | Jan 29, 2025 | | CodeCode Available | 3 | 5 |
| Neural networks for abstraction and reasoning: Towards broad generalization in machines | Feb 5, 2024 | ARCVisual Reasoning | CodeCode Available | 3 | 5 |
| Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models | Nov 20, 2023 | Image Generation | CodeCode Available | 3 | 5 |
| RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation | Jan 9, 2024 | GPUMath | CodeCode Available | 3 | 5 |
| Revisiting VerilogEval: A Year of Improvements in Large-Language Models for Hardware Code Generation | Aug 20, 2024 | Code CompletionCode Generation | CodeCode Available | 3 | 5 |
| OneFormer: One Transformer to Rule Universal Image Segmentation | Nov 10, 2022 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 3 | 5 |
| The Surprising Effectiveness of Test-Time Training for Few-Shot Learning | Nov 11, 2024 | ARCFew-Shot Learning | CodeCode Available | 3 | 5 |
| Prefix-Tuning: Optimizing Continuous Prompts for Generation | Jan 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| Tina: Tiny Reasoning Models via LoRA | Apr 22, 2025 | Reinforcement Learning (RL) | CodeCode Available | 3 | 5 |
| Pushing the limits of raw waveform speaker recognition | Mar 16, 2022 | Self-Supervised LearningSpeaker Recognition | CodeCode Available | 3 | 5 |
| Discovering and exploring cases of educational source code plagiarism with Dolos | Feb 16, 2024 | | CodeCode Available | 3 | 5 |
| UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation | Jun 15, 2021 | Speech Synthesistext-to-speech | CodeCode Available | 3 | 5 |
| LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models | Apr 4, 2023 | Arithmetic ReasoningLanguage Modelling | CodeCode Available | 3 | 5 |
| PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies | Jun 9, 2022 | 3D Classification3D Part Segmentation | CodeCode Available | 3 | 5 |