| EmoFace: Audio-driven Emotional 3D Face Animation | Jul 17, 2024 | 3D Face Animation | CodeCode Available | 2 |
| OmniBench: Towards The Future of Universal Omni-Language Models | Sep 23, 2024 | Instruction Following | CodeCode Available | 2 |
| ADATIME: A Benchmarking Suite for Domain Adaptation on Time Series Data | Mar 15, 2022 | BenchmarkingDomain Adaptation | CodeCode Available | 2 |
| ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction | Jul 9, 2024 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| InteractRank: Personalized Web-Scale Search Pre-Ranking with Cross Interaction Features | Apr 9, 2025 | Computational Efficiency | CodeCode Available | 2 |
| Specializing Smaller Language Models towards Multi-Step Reasoning | Jan 30, 2023 | MathModel Selection | CodeCode Available | 2 |
| Stitchable Neural Networks | Feb 13, 2023 | Image Classification | CodeCode Available | 2 |
| Respecting causality is all you need for training physics-informed neural networks | Mar 14, 2022 | AllAttribute | CodeCode Available | 2 |
| Towards Interpretable Mental Health Analysis with Large Language Models | Apr 6, 2023 | Causal Emotion EntailmentEmotion Recognition | CodeCode Available | 2 |
| Cross-Modality Safety Alignment | Jun 21, 2024 | Safety Alignment | CodeCode Available | 2 |
| FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality Localization | Apr 21, 2024 | Anomaly DetectionPosition | CodeCode Available | 2 |
| HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference | Apr 8, 2025 | CPUGPU | CodeCode Available | 2 |
| Target conversation extraction: Source separation using turn-taking dynamics | Jul 15, 2024 | | CodeCode Available | 2 |
| Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts | Mar 14, 2024 | DenoisingMixture-of-Experts | CodeCode Available | 2 |
| GPT-InvestAR: Enhancing Stock Investment Strategies through Annual Report Analysis with Large Language Models | Sep 6, 2023 | | CodeCode Available | 2 |
| BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions | Aug 19, 2023 | MMEOptical Character Recognition (OCR) | CodeCode Available | 2 |
| H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer Training | Sep 21, 2023 | | CodeCode Available | 2 |
| Beyond Next Token Prediction: Patch-Level Training for Large Language Models | Jul 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future | Jul 18, 2023 | Knowledge Distillationobject-detection | CodeCode Available | 2 |
| normflows: A PyTorch Package for Normalizing Flows | Jan 26, 2023 | Image GenerationVariational Inference | CodeCode Available | 2 |
| WidthFormer: Toward Efficient Transformer-based BEV View Transformation | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Evidential Detection and Tracking Collaboration: New Problem, Benchmark and Algorithm for Robust Anti-UAV System | Jun 27, 2023 | | CodeCode Available | 2 |
| Deep Incubation: Training Large Models by Divide-and-Conquering | Dec 8, 2022 | Image Segmentationobject-detection | CodeCode Available | 2 |
| Fortuna: A Library for Uncertainty Quantification in Deep Learning | Feb 8, 2023 | Bayesian InferenceBenchmarking | CodeCode Available | 2 |
| BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation | Apr 3, 2022 | DecoderDepth Estimation | CodeCode Available | 2 |
| TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation | Jan 25, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models | Mar 18, 2025 | AnatomyAttribute | CodeCode Available | 2 |
| Etalon: Holistic Performance Evaluation Framework for LLM Inference Systems | Jul 9, 2024 | | CodeCode Available | 2 |
| Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark | May 14, 2024 | | CodeCode Available | 2 |
| A Diffusion-Based Generative Equalizer for Music Restoration | Mar 27, 2024 | Bandwidth ExtensionHallucination | CodeCode Available | 2 |
| Omnizart: A General Toolbox for Automatic Music Transcription | Jun 1, 2021 | Chord RecognitionDownbeat Tracking | CodeCode Available | 2 |
| MARLIN: Masked Autoencoder for facial video Representation LearnINg | Nov 12, 2022 | Action ClassificationAttribute | CodeCode Available | 2 |
| Thermal half-lives of azobenzene derivatives: virtual screening based on intersystem crossing using a machine learning potential | Jul 23, 2022 | | CodeCode Available | 2 |
| GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization | Sep 27, 2023 | Contrastive Learninggeo-localization | CodeCode Available | 2 |
| Large Language Models for Anomaly and Out-of-Distribution Detection: A Survey | Sep 3, 2024 | Out-of-Distribution Detection | CodeCode Available | 2 |
| Towards Scalable Automated Alignment of LLMs: A Survey | Jun 3, 2024 | Survey | CodeCode Available | 2 |
| ViTime: A Visual Intelligence-Based Foundation Model for Time Series Forecasting | Jul 10, 2024 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams | Jun 10, 2025 | 3DGS3D Reconstruction | CodeCode Available | 2 |
| Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and Leaderboarding | Jul 4, 2022 | BenchmarkingDocument Ranking | CodeCode Available | 2 |
| eVAE: Evolutionary Variational Autoencoder | Jan 1, 2023 | DisentanglementImage Generation | CodeCode Available | 2 |
| Let Images Give You More:Point Cloud Cross-Modal Training for Shape Analysis | Oct 9, 2022 | 3D Point Cloud ClassificationKnowledge Distillation | CodeCode Available | 2 |
| Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation | Jun 20, 2025 | Scene Generation | CodeCode Available | 2 |
| EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision | Nov 3, 2023 | Optical Flow EstimationSemantic Segmentation | CodeCode Available | 2 |
| L-AutoDA: Leveraging Large Language Models for Automated Decision-based Adversarial Attacks | Jan 27, 2024 | Adversarial AttackComputational Efficiency | CodeCode Available | 2 |
| Omni-Video: Democratizing Unified Video Understanding and Generation | Jul 8, 2025 | Video GenerationVideo Understanding | CodeCode Available | 2 |
| From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking | Jun 24, 2024 | BenchmarkingNeRF | CodeCode Available | 2 |
| ExpeL: LLM Agents Are Experiential Learners | Aug 20, 2023 | Decision MakingTransfer Learning | CodeCode Available | 2 |
| MuMA-ToM: Multi-modal Multi-Agent Theory of Mind | Aug 22, 2024 | | CodeCode Available | 2 |
| Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient | Nov 26, 2024 | GPUImage Generation | CodeCode Available | 2 |
| Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation | Jun 24, 2021 | MuJoCoOpenAI Gym | CodeCode Available | 2 |