| MixTex: Unambiguous Recognition Should Not Rely Solely on Real Data | Jun 24, 2024 | Data AugmentationOptical Character Recognition (OCR) | CodeCode Available | 5 |
| LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention | Mar 28, 2023 | Instruction FollowingLanguage Modelling | CodeCode Available | 5 |
| InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds | Mar 29, 2024 | 3D ReconstructionNovel View Synthesis | CodeCode Available | 5 |
| AugLy: Data Augmentations for Robustness | Jan 17, 2022 | Adversarial RobustnessData Augmentation | CodeCode Available | 5 |
| The Rise and Potential of Large Language Model Based Agents: A Survey | Sep 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs | Jun 24, 2024 | Representation LearningVisual Grounding | CodeCode Available | 5 |
| Feature Refinement to Improve High Resolution Image Inpainting | Jun 27, 2022 | Image InpaintingVocal Bursts Intensity Prediction | CodeCode Available | 5 |
| Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection | Nov 23, 2024 | Face SwappingSynthetic Image Detection | CodeCode Available | 5 |
| 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion | Sep 19, 2024 | | CodeCode Available | 5 |
| Tree of Thoughts: Deliberate Problem Solving with Large Language Models | May 17, 2023 | Arithmetic ReasoningDecision Making | CodeCode Available | 5 |
| SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation | Jan 16, 2025 | Benchmarking | CodeCode Available | 5 |
| Monolith: Real Time Recommendation System With Collisionless Embedding Table | Sep 16, 2022 | | CodeCode Available | 5 |
| Consistency Models | Mar 2, 2023 | ColorizationImage Generation | CodeCode Available | 5 |
| Process Reinforcement through Implicit Rewards | Feb 3, 2025 | MathReinforcement Learning (RL) | CodeCode Available | 5 |
| FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU | Mar 13, 2023 | CPUGPU | CodeCode Available | 5 |
| RealFusion: 360° Reconstruction of Any Object from a Single Image | Feb 21, 2023 | 3D ReconstructionObject | CodeCode Available | 5 |
| YOLOv6 v3.0: A Full-Scale Reloading | Jan 13, 2023 | GPUObject Detection | CodeCode Available | 5 |
| Text-to-Image Rectified Flow as Plug-and-Play Priors | Jun 5, 2024 | 3D GenerationText to 3D | CodeCode Available | 5 |
| Agents: An Open-source Framework for Autonomous Language Agents | Sep 14, 2023 | | CodeCode Available | 5 |
| MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention | Apr 22, 2025 | GPU | CodeCode Available | 5 |
| ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models | Jun 21, 2024 | | CodeCode Available | 5 |
| LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning | Jun 23, 2025 | Reinforcement Learning (RL)Text Generation | CodeCode Available | 5 |
| LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models | Oct 9, 2023 | GSM8KIn-Context Learning | CodeCode Available | 5 |
| Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language Model | Jun 28, 2023 | HallucinationKnowledge Graphs | CodeCode Available | 5 |
| Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation | Dec 18, 2024 | 3D Reconstruction4k | CodeCode Available | 5 |
| OPT: Open Pre-trained Transformer Language Models | May 2, 2022 | DecoderHate Speech Detection | CodeCode Available | 5 |
| Low Bitrate High-Quality RVQGAN-based Discrete Speech Tokenizer | Oct 10, 2024 | | CodeCode Available | 5 |
| CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X | Mar 30, 2023 | BenchmarkingCode Generation | CodeCode Available | 5 |
| Deep Confident Steps to New Pockets: Strategies for Docking Generalization | Feb 28, 2024 | Blind Docking | CodeCode Available | 5 |
| Conditional Generative Models for Contrast-Enhanced Synthesis of T1w and T1 Maps in Brain MRI | Oct 11, 2024 | Uncertainty Quantification | CodeCode Available | 5 |
| skfolio: Portfolio Optimization in Python | Jul 5, 2025 | ManagementPortfolio Optimization | CodeCode Available | 5 |
| Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG | Jan 15, 2025 | Natural Language UnderstandingRAG | CodeCode Available | 5 |
| Instruction-Following Evaluation for Large Language Models | Nov 14, 2023 | Instruction Following | CodeCode Available | 5 |
| ShowUI: One Vision-Language-Action Model for GUI Visual Agent | Nov 26, 2024 | Instruction FollowingNatural Language Visual Grounding | CodeCode Available | 5 |
| NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results | Apr 22, 2024 | 4kImage Enhancement | CodeCode Available | 5 |
| SpatialTracker: Tracking Any 2D Pixels in 3D Space | Apr 5, 2024 | | CodeCode Available | 5 |
| Autoformalization in the Era of Large Language Models: A Survey | May 29, 2025 | Automated Theorem Proving | CodeCode Available | 5 |
| BM25S: Orders of magnitude faster lexical search via eager sparse scoring | Jul 4, 2024 | Passage RetrievalRetrieval | CodeCode Available | 5 |
| DEIM: DETR with Improved Matching for Fast Convergence | Dec 5, 2024 | Data AugmentationGPU | CodeCode Available | 5 |
| UQLM: A Python Package for Uncertainty Quantification in Large Language Models | Jul 8, 2025 | HallucinationUncertainty Quantification | CodeCode Available | 5 |
| Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese | Nov 2, 2022 | Contrastive Learningimage-classification | CodeCode Available | 5 |
| ControlNeXt: Powerful and Efficient Control for Image and Video Generation | Aug 12, 2024 | Video Generation | CodeCode Available | 5 |
| MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation | Jan 12, 2025 | RAGRetrieval | CodeCode Available | 5 |
| SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More | Aug 8, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 5 |
| WizardCoder: Empowering Code Large Language Models with Evol-Instruct | Jun 14, 2023 | Code GenerationHumanEval | CodeCode Available | 5 |
| Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities | Feb 2, 2024 | Acoustic Scene ClassificationAudio captioning | CodeCode Available | 5 |
| Long-term Forecasting with TiDE: Time-series Dense Encoder | Apr 17, 2023 | Anomaly DetectionDecoder | CodeCode Available | 5 |
| From System 1 to System 2: A Survey of Reasoning Large Language Models | Feb 24, 2025 | Logical Reasoning | CodeCode Available | 5 |
| Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions | Feb 10, 2025 | | CodeCode Available | 5 |