| SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models | Mar 14, 2024 | BlockingGPU | CodeCode Available | 4 | 5 |
| Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers | Aug 12, 2024 | GSM8KMath | CodeCode Available | 4 | 5 |
| Data quality dimensions for fair AI | May 11, 2023 | ClassificationFairness | CodeCode Available | 4 | 5 |
| AnyText: Multilingual Visual Text Generation And Editing | Nov 6, 2023 | Image GenerationOptical Character Recognition (OCR) | CodeCode Available | 4 | 5 |
| Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders | Dec 12, 2024 | Gaze Target Estimation | CodeCode Available | 4 | 5 |
| BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation | May 26, 2022 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 4 | 5 |
| SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image Editing | May 7, 2024 | Image ManipulationLanguage Modeling | CodeCode Available | 4 | 5 |
| TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot Control | Feb 24, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 4 | 5 |
| CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models | Mar 24, 2025 | | CodeCode Available | 4 | 5 |
| Kubric: A scalable dataset generator | Mar 7, 2022 | FairnessNeRF | CodeCode Available | 4 | 5 |
| Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation | Aug 8, 2024 | ChunkingFact Checking | CodeCode Available | 4 | 5 |
| R^2-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction | May 31, 2024 | 3DGSNeRF | CodeCode Available | 4 | 5 |
| AgentGym: Evolving Large Language Model-based Agents across Diverse Environments | Jun 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| RecBole 2.0: Towards a More Up-to-Date Recommendation Library | Jun 15, 2022 | BenchmarkingData Augmentation | CodeCode Available | 4 | 5 |
| ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates | Feb 10, 2025 | Hierarchical Reinforcement LearningLanguage Modeling | CodeCode Available | 4 | 5 |
| IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo Matching | Sep 1, 2024 | Patch MatchingStereo Matching | CodeCode Available | 4 | 5 |
| Long Context Transfer from Language to Vision | Jun 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| RealisDance: Equip controllable character animation with realistic hands | Sep 10, 2024 | | CodeCode Available | 4 | 5 |
| NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals | Jul 18, 2024 | Experimental DesignGPU | CodeCode Available | 4 | 5 |
| Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning | May 6, 2025 | Image Generation | CodeCode Available | 4 | 5 |
| TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters | Oct 30, 2024 | model | CodeCode Available | 4 | 5 |
| A Closer Look at Deep Learning Methods on Tabular Datasets | Jul 1, 2024 | AttributeDeep Learning | CodeCode Available | 4 | 5 |
| Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking | Mar 14, 2024 | GSM8KLanguage Modelling | CodeCode Available | 4 | 5 |
| Magicoder: Empowering Code Generation with OSS-Instruct | Dec 4, 2023 | Code GenerationHumanEval | CodeCode Available | 4 | 5 |
| Zero-Shot Whole-Body Humanoid Control via Behavioral Foundation Models | Apr 15, 2025 | Humanoid ControlReinforcement Learning (RL) | CodeCode Available | 4 | 5 |
| Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference | Oct 6, 2023 | GPUImage Generation | CodeCode Available | 4 | 5 |
| XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL | Jul 7, 2025 | Text to SQLText-To-SQL | CodeCode Available | 4 | 5 |
| VM-UNet: Vision Mamba UNet for Medical Image Segmentation | Feb 4, 2024 | Image SegmentationMamba | CodeCode Available | 4 | 5 |
| FedCP: Separating Feature Information for Personalized Federated Learning via Conditional Policy | Jul 1, 2023 | Federated LearningPersonalized Federated Learning | CodeCode Available | 4 | 5 |
| Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering | Feb 26, 2024 | Evidence SelectionOpen-Ended Question Answering | CodeCode Available | 4 | 5 |
| NExT-GPT: Any-to-Any Multimodal LLM | Sep 11, 2023 | AI Agent | CodeCode Available | 4 | 5 |
| Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning | Jun 3, 2025 | Code Generationreinforcement-learning | CodeCode Available | 4 | 5 |
| Eliminating Domain Bias for Federated Learning in Representation Space | Nov 25, 2023 | Federated LearningPrivacy Preserving | CodeCode Available | 4 | 5 |
| MotionClone: Training-Free Motion Cloning for Controllable Video Generation | Jun 8, 2024 | DenoisingMotion Generation | CodeCode Available | 4 | 5 |
| Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic Evaluation | Feb 23, 2025 | Benchmarking | CodeCode Available | 4 | 5 |
| GIM: Learning Generalizable Image Matcher From Internet Videos | Feb 16, 2024 | 3D ReconstructionCamera Pose Estimation | CodeCode Available | 4 | 5 |
| Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach | Dec 4, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 4 | 5 |
| Pearl: A Production-ready Reinforcement Learning Agent | Dec 6, 2023 | Benchmarkingreinforcement-learning | CodeCode Available | 4 | 5 |
| Towards All-in-One Medical Image Re-Identification | Mar 11, 2025 | All | CodeCode Available | 4 | 5 |
| LocAgent: Graph-Guided LLM Agents for Code Localization | Mar 12, 2025 | GitHub issue resolutionNavigate | CodeCode Available | 4 | 5 |
| GPFL: Simultaneously Learning Global and Personalized Feature Information for Personalized Federated Learning | Aug 20, 2023 | FairnessFederated Learning | CodeCode Available | 4 | 5 |
| Data-centric Artificial Intelligence: A Survey | Mar 17, 2023 | Survey | CodeCode Available | 4 | 5 |
| Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement | Mar 9, 2025 | Domain GeneralizationObject Detection | CodeCode Available | 4 | 5 |
| KeyPoint Relative Position Encoding for Face Recognition | Mar 21, 2024 | Face RecognitionGait Recognition | CodeCode Available | 4 | 5 |
| Diffusion Model-Based Image Editing: A Survey | Feb 27, 2024 | DenoisingImage Generation | CodeCode Available | 4 | 5 |
| Planning-oriented Autonomous Driving | Dec 20, 2022 | Autonomous DrivingBench2Drive | CodeCode Available | 4 | 5 |
| Generation of Training Data from HD Maps in the Lanelet2 Framework | Jul 24, 2024 | | CodeCode Available | 4 | 5 |
| NAFSSR: Stereo Image Super-Resolution Using NAFNet | Apr 19, 2022 | Image RestorationImage Super-Resolution | CodeCode Available | 4 | 5 |
| Visual Mamba: A Survey and New Outlooks | Apr 29, 2024 | MambaSurvey | CodeCode Available | 4 | 5 |
| Weighted-Reward Preference Optimization for Implicit Model Fusion | Dec 4, 2024 | model | CodeCode Available | 4 | 5 |