| High-Fidelity Neural Phonetic Posteriorgrams | Feb 27, 2024 | Voice Conversion | CodeCode Available | 2 |
| Advances in APPFL: A Comprehensive and Extensible Federated Learning Framework | Sep 17, 2024 | BenchmarkingFederated Learning | CodeCode Available | 2 |
| ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs | Nov 22, 2023 | | CodeCode Available | 2 |
| KV Shifting Attention Enhances Language Modeling | Nov 29, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| MixFormer: End-to-End Tracking with Iterative Mixed Attention | Mar 21, 2022 | Semi-Supervised Video Object SegmentationVideo Object Tracking | CodeCode Available | 2 |
| A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet | Mar 28, 2019 | Speech Synthesis | CodeCode Available | 2 |
| Knowledge-Design: Pushing the Limit of Protein Design via Knowledge Refinement | May 20, 2023 | Protein DesignRetrieval | CodeCode Available | 2 |
| CodeRAG-Bench: Can Retrieval Augment Code Generation? | Jun 20, 2024 | Code GenerationRAG | CodeCode Available | 2 |
| Diffusion-SDF: Text-to-Shape via Voxelized Diffusion | Dec 6, 2022 | | CodeCode Available | 2 |
| 3DTeethSeg'22: 3D Teeth Scan Segmentation and Labeling Challenge | May 29, 2023 | AnatomySegmentation | CodeCode Available | 2 |
| General-purpose, long-context autoregressive modeling with Perceiver AR | Feb 15, 2022 | Density EstimationLanguage Modelling | CodeCode Available | 2 |
| Investigating the Scalability of Approximate Sparse Retrieval Algorithms to Massive Datasets | Jan 20, 2025 | Retrieval | CodeCode Available | 2 |
| Omnivore: A Single Model for Many Visual Modalities | Jan 20, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| Vim4Path: Self-Supervised Vision Mamba for Histopathology Images | Apr 20, 2024 | DiagnosticMamba | CodeCode Available | 2 |
| Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching | Jun 1, 2024 | Audio GenerationVideo-to-Sound Generation | CodeCode Available | 2 |
| FITS: Modeling Time Series with 10k Parameters | Jul 6, 2023 | Anomaly DetectionTime Series | CodeCode Available | 2 |
| HMT: Hierarchical Memory Transformer for Long Context Language Processing | May 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| InstanceGen: Image Generation with Instance-level Instructions | May 8, 2025 | Image Generation | CodeCode Available | 2 |
| VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft | Jun 9, 2024 | ManagementMinecraft | CodeCode Available | 2 |
| Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model | Apr 2, 2024 | Video Generation | CodeCode Available | 2 |
| Manify: A Python Library for Learning Non-Euclidean Representations | Mar 12, 2025 | Representation Learning | CodeCode Available | 2 |
| Decoupled Dynamic Spatial-Temporal Graph Neural Network for Traffic Forecasting | Jun 18, 2022 | Graph LearningGraph Neural Network | CodeCode Available | 2 |
| Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation | Jun 26, 2024 | HallucinationKnowledge Base Question Answering | CodeCode Available | 2 |
| TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis | May 20, 2025 | Contrastive LearningSinging Voice Synthesis | CodeCode Available | 2 |
| ADA-Track++: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association | May 14, 2024 | 3D Multi-Object TrackingDecoder | CodeCode Available | 2 |
| BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs | Jul 17, 2023 | Instruction FollowingSentence | CodeCode Available | 2 |
| PyCM: Multiclass confusion matrix library in Python | May 29, 2018 | General Classification | CodeCode Available | 2 |
| WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose | May 20, 2020 | Head Pose EstimationPose Estimation | CodeCode Available | 2 |
| 3D Clothed Human Reconstruction in the Wild | Jul 20, 2022 | Garment Reconstruction | CodeCode Available | 2 |
| SCAMPS: Synthetics for Camera Measurement of Physiological Signals | Jun 8, 2022 | DescriptiveDiversity | CodeCode Available | 2 |
| FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization | Jan 17, 2025 | Anomaly DetectionImage-text matching | CodeCode Available | 2 |
| CSAD: Unsupervised Component Segmentation for Logical Anomaly Detection | Aug 28, 2024 | Anomaly DetectionSegmentation | CodeCode Available | 2 |
| LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models | Mar 18, 2024 | | CodeCode Available | 2 |
| Enhancing the Utility of Privacy-Preserving Cancer Classification using Synthetic Data | Jul 17, 2024 | Breast Cancer DetectionCancer Classification | CodeCode Available | 2 |
| An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training | Apr 18, 2024 | Contrastive LearningCPU | CodeCode Available | 2 |
| Global Tracking Transformers | Mar 24, 2022 | Multi-Object TrackingObject | CodeCode Available | 2 |
| Morphological Analyzer and Generator for Russian and Ukrainian Languages | Mar 25, 2015 | Morphological Analysis | CodeCode Available | 2 |
| Multi-scale convolutional transformer network for motor imagery brain-computer interface | Apr 15, 2025 | 4-task ClassificationBrain Computer Interface | CodeCode Available | 2 |
| Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting | Jun 29, 2019 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| CGCOD: Class-Guided Camouflaged Object Detection | Dec 25, 2024 | Objectobject-detection | CodeCode Available | 2 |
| Advancing MRI Reconstruction: A Systematic Review of Deep Learning and Compressed Sensing Integration | Jan 24, 2025 | compressed sensingFederated Learning | CodeCode Available | 2 |
| Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition | Dec 15, 2024 | Computational EfficiencyVideo Recognition | CodeCode Available | 2 |
| Interpretable RNA Foundation Model from Unannotated Data for Highly Accurate RNA Structure and Function Predictions | Apr 1, 2022 | Self-Supervised Learning | CodeCode Available | 2 |
| DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions | Nov 8, 2024 | Pose Estimation | CodeCode Available | 2 |
| BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents | Aug 11, 2023 | BenchmarkingDecision Making | CodeCode Available | 2 |
| AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models | Oct 3, 2023 | Decision Making | CodeCode Available | 2 |
| Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants | Oct 1, 2023 | Instruction Following | CodeCode Available | 2 |
| A Closer Look into Mixture-of-Experts in Large Language Models | Jun 26, 2024 | Computational EfficiencyDiversity | CodeCode Available | 2 |
| Snap-and-tune: combining deep learning and test-time optimization for high-fidelity cardiovascular volumetric meshing | Jun 9, 2025 | | CodeCode Available | 2 |
| Universal Score-based Speech Enhancement with High Content Preservation | Jun 18, 2024 | Speech Enhancement | CodeCode Available | 2 |