| SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in Chinese | Jan 22, 2024 | DiversityGSM8K | CodeCode Available | 2 |
| ChainerCV: a Library for Deep Learning in Computer Vision | Aug 28, 2017 | Deep Learningobject-detection | CodeCode Available | 2 |
| Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse | Sep 17, 2024 | In-Context LearningRAG | CodeCode Available | 2 |
| CenterNet++ for Object Detection | Apr 18, 2022 | Objectobject-detection | CodeCode Available | 2 |
| Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models | Mar 30, 2023 | Video AlignmentVideo Editing | CodeCode Available | 2 |
| Conformal Symplectic Optimization for Stable Reinforcement Learning | Dec 3, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 2 |
| LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models | Mar 4, 2022 | DecoderGPU | CodeCode Available | 2 |
| STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment Fusion | Jan 3, 2024 | 3D Human Pose EstimationHuman Mesh Recovery | CodeCode Available | 2 |
| LongReward: Improving Long-context Large Language Models with AI Feedback | Oct 28, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 2 |
| Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A Survey | Feb 8, 2025 | FairnessRAG | CodeCode Available | 2 |
| Deformable One-shot Face Stylization via DINO Semantic Guidance | Mar 1, 2024 | One-Shot Face Stylization | CodeCode Available | 2 |
| ProcessPainter: Learn Painting Process from Sequence Data | Jun 10, 2024 | DenoisingImage Generation | CodeCode Available | 2 |
| Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation | Aug 24, 2023 | Image-to-Image Translation | CodeCode Available | 2 |
| Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models | Oct 6, 2023 | Code GenerationDecision Making | CodeCode Available | 2 |
| Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion | Feb 22, 2024 | Music Generation | CodeCode Available | 2 |
| Learning to Compress Prompts with Gist Tokens | Apr 17, 2023 | Decoder | CodeCode Available | 2 |
| TRADES: Generating Realistic Market Simulations with Diffusion Models | Jan 31, 2025 | Denoising | CodeCode Available | 2 |
| SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals | May 28, 2024 | Contrastive LearningRepresentation Learning | CodeCode Available | 2 |
| MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input Features | Sep 30, 2022 | Image Classification | CodeCode Available | 2 |
| PPSURF: Combining Patches and Point Convolutions for Detailed Surface Reconstruction | Jan 16, 2024 | Surface Reconstruction | CodeCode Available | 2 |
| FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference | Feb 28, 2025 | | CodeCode Available | 2 |
| Heterogeneous Multi-Robot Reinforcement Learning | Jan 17, 2023 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| DETR Doesn't Need Multi-Scale or Locality Design | Aug 3, 2023 | Decoder | CodeCode Available | 2 |
| Multi-Scale Representations by Varying Window Attention for Semantic Segmentation | Apr 25, 2024 | DecoderSemantic Segmentation | CodeCode Available | 2 |
| Segment and Caption Anything | Dec 1, 2023 | Caption Generationobject-detection | CodeCode Available | 2 |
| Attention as a Hypernetwork | Jun 9, 2024 | | CodeCode Available | 2 |
| Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions | Mar 25, 2024 | Attribute | CodeCode Available | 2 |
| Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data | Mar 27, 2025 | Text to 3D | CodeCode Available | 2 |
| Ontology Embedding: A Survey of Methods, Applications and Resources | Jun 16, 2024 | Logical ReasoningOntology Embedding | CodeCode Available | 2 |
| 3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification | Aug 25, 2024 | Computational EfficiencyHyperspectral Image Classification | CodeCode Available | 2 |
| Scaling Diffusion Transformers Efficiently via μP | May 21, 2025 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection | Jul 23, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Quanda: An Interpretability Toolkit for Training Data Attribution Evaluation and Beyond | Oct 9, 2024 | Benchmarking | CodeCode Available | 2 |
| ViTs for SITS: Vision Transformers for Satellite Image Time Series | Jan 12, 2023 | Semantic SegmentationTime Series | CodeCode Available | 2 |
| RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction | Mar 8, 2024 | Audio GenerationComputational Efficiency | CodeCode Available | 2 |
| LambdaKG: A Library for Pre-trained Language Model-Based Knowledge Graph Embeddings | Oct 1, 2022 | Graph Representation LearningKnowledge Graph Completion | CodeCode Available | 2 |
| Optimal Flow Matching: Learning Straight Trajectories in Just One Step | Mar 19, 2024 | | CodeCode Available | 2 |
| DualBEV: Unifying Dual View Transformation with Probabilistic Correspondences | Mar 8, 2024 | | CodeCode Available | 2 |
| Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework | Apr 19, 2024 | Earth ObservationSegmentation | CodeCode Available | 2 |
| g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin | Mar 20, 2022 | Part-Of-Speech TaggingPolyphone disambiguation | CodeCode Available | 2 |
| LangProp: A code optimization framework using Large Language Models applied to driving | Jan 18, 2024 | Autonomous DrivingCode Generation | CodeCode Available | 2 |
| LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding | Feb 28, 2022 | Document Image Classificationdocument understanding | CodeCode Available | 2 |
| GrootVL: Tree Topology is All You Need in State Space Model | Jun 4, 2024 | Allimage-classification | CodeCode Available | 2 |
| Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tuning | Jun 6, 2024 | Multi-Task LearningVulnerability Detection | CodeCode Available | 2 |
| Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors | May 29, 2023 | Contrastive LearningImage Reconstruction | CodeCode Available | 2 |
| CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models | Nov 28, 2023 | Dialogue Generation | CodeCode Available | 2 |
| Towards Evaluating and Building Versatile Large Language Models for Medicine | Aug 22, 2024 | Multiple-choicenamed-entity-recognition | CodeCode Available | 2 |
| RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Practical Blind Image Denoising via Swin-Conv-UNet and Data Synthesis | Mar 24, 2022 | DenoisingImage Denoising | CodeCode Available | 2 |
| AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms | Feb 21, 2025 | Scheduling | CodeCode Available | 2 |