| Improving Diffusion Models for Authentic Virtual Try-on in the Wild | Mar 8, 2024 | Virtual Try-on | CodeCode Available | 7 |
| Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena | Jun 9, 2023 | ChatbotLanguage Modelling | CodeCode Available | 7 |
| The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search | Apr 10, 2025 | scientific discovery | CodeCode Available | 7 |
| Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models | Feb 29, 2024 | Language ModellingMamba | CodeCode Available | 7 |
| Skywork-R1V3 Technical Report | Jul 8, 2025 | cross-modal alignmentMathematical Reasoning | CodeCode Available | 7 |
| Interactive Prompt Debugging with Sequence Salience | Apr 11, 2024 | Sentencetext-classification | CodeCode Available | 7 |
| gsplat: An Open-Source Library for Gaussian Splatting | Sep 10, 2024 | | CodeCode Available | 7 |
| GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers | Oct 31, 2022 | GPULanguage Modelling | CodeCode Available | 7 |
| EvoAgentX: An Automated Framework for Evolving Agentic Workflows | Jul 4, 2025 | Code GenerationMath | CodeCode Available | 7 |
| DataComp-LM: In search of the next generation of training sets for language models | Jun 17, 2024 | Language ModellingMMLU | CodeCode Available | 7 |
| VITA: Towards Open-Source Interactive Omni Multimodal LLM | Aug 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| Segment Anything in Medical Images and Videos: Benchmark and Deployment | Aug 6, 2024 | BenchmarkingSegmentation | CodeCode Available | 7 |
| LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models | Apr 8, 2024 | | CodeCode Available | 7 |
| Cradle: Empowering Foundation Agents Towards General Computer Control | Mar 5, 2024 | Efficient Exploration | CodeCode Available | 7 |
| OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments | Apr 11, 2024 | Benchmarking | CodeCode Available | 7 |
| Efficient Track Anything | Nov 28, 2024 | ObjectSegmentation | CodeCode Available | 7 |
| Streamlining Ocean Dynamics Modeling with Fourier Neural Operators: A Multiobjective Hyperparameter and Architecture Optimization Approach | Apr 7, 2024 | Efficient ExplorationHyperparameter Optimization | CodeCode Available | 7 |
| Embedding Atlas: Low-Friction, Interactive Embedding Visualization | May 9, 2025 | Friction | CodeCode Available | 7 |
| A Library for Learning Neural Operators | Dec 13, 2024 | Operator learning | CodeCode Available | 7 |
| Kimi k1.5: Scaling Reinforcement Learning with LLMs | Jan 22, 2025 | Mathreinforcement-learning | CodeCode Available | 7 |
| AutoCodeRover: Autonomous Program Improvement | Apr 8, 2024 | Bug fixingCode Search | CodeCode Available | 7 |
| S*: Test Time Scaling for Code Generation | Feb 20, 2025 | Code GenerationMath | CodeCode Available | 7 |
| RT-DETRv2: Improved Baseline with Bag-of-Freebies for Real-Time Detection Transformer | Jul 24, 2024 | Data AugmentationDecoder | CodeCode Available | 7 |
| AI-Researcher: Autonomous Scientific Innovation | May 24, 2025 | scientific discovery | CodeCode Available | 7 |
| HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models | May 23, 2024 | HippocampusKnowledge Graphs | CodeCode Available | 7 |
| PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models | Jan 10, 2024 | GPUImage Generation | CodeCode Available | 7 |
| Large Language Model Agent: A Survey on Methodology, Applications and Challenges | Mar 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers | May 9, 2024 | | CodeCode Available | 7 |
| SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild | Mar 24, 2025 | Instruction FollowingMath | CodeCode Available | 7 |
| Logo-LLM: Local and Global Modeling with Large Language Models for Time Series Forecasting | May 16, 2025 | Time SeriesTime Series Forecasting | CodeCode Available | 7 |
| DragAnything: Motion Control for Anything using Entity Representation | Mar 12, 2024 | ObjectVideo Generation | CodeCode Available | 7 |
| EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning | Jan 25, 2025 | BenchmarkingEvolutionary Algorithms | CodeCode Available | 7 |
| Efficient MedSAMs: Segment Anything in Medical Images on Laptop | Dec 20, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 7 |
| Aligning Anime Video Generation with Human Feedback | Apr 14, 2025 | Video Generation | CodeCode Available | 7 |
| Chronos: Learning the Language of Time Series | Mar 12, 2024 | Gaussian ProcessesLanguage Modeling | CodeCode Available | 7 |
| Adding Conditional Control to Text-to-Image Diffusion Models | Feb 10, 2023 | Image GenerationLayout-to-Image Generation | CodeCode Available | 7 |
| OASIS: Open Agent Social Interaction Simulations with One Million Agents | Nov 18, 2024 | Large Language ModelRecommendation Systems | CodeCode Available | 7 |
| Muon is Scalable for LLM Training | Feb 24, 2025 | Computational Efficiency | CodeCode Available | 7 |
| An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents | May 21, 2025 | Reinforcement Learning (RL) | CodeCode Available | 7 |
| Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model | Jun 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditions | Jul 11, 2024 | Image Animation | CodeCode Available | 7 |
| CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases | Aug 7, 2024 | HumanEvalmbpp | CodeCode Available | 7 |
| Adaptive In-conversation Team Building for Language Model Agents | May 29, 2024 | DiversityLanguage Modeling | CodeCode Available | 7 |
| Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models | Mar 8, 2023 | | CodeCode Available | 7 |
| MiniMax-01: Scaling Foundation Models with Lightning Attention | Jan 14, 2025 | Mixture-of-Experts | CodeCode Available | 7 |
| Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold | May 18, 2023 | Image ManipulationPoint Tracking | CodeCode Available | 7 |
| MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers | Jun 14, 2024 | Decoder | CodeCode Available | 7 |
| BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO | Jun 25, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 7 |
| EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees | Jun 24, 2024 | | CodeCode Available | 7 |
| DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference | Jan 9, 2024 | BenchmarkingText Generation | CodeCode Available | 7 |