| Large Language Models for Generative Information Extraction: A Survey | Dec 29, 2023 | Survey | CodeCode Available | 3 |
| The Rise of Diffusion Models in Time-Series Forecasting | Jan 5, 2024 | Time SeriesTime Series Analysis | CodeCode Available | 3 |
| Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single Shot | Feb 22, 2024 | 3D Human Pose Estimation3D Human Reconstruction | CodeCode Available | 3 |
| Segment Anything Model for Road Network Graph Extraction | Mar 24, 2024 | Graph LearningGraph Neural Network | CodeCode Available | 3 |
| RS-Mamba for Large Remote Sensing Image Dense Prediction | Apr 3, 2024 | Building change detection for remote sensing imagesChange Detection | CodeCode Available | 3 |
| Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future Directions | Apr 4, 2024 | Survey | CodeCode Available | 3 |
| Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer | Apr 21, 2024 | Face ParsingSemantic Parsing | CodeCode Available | 3 |
| SMART: Scalable Multi-agent Real-time Motion Generation via Next-token Prediction | May 24, 2024 | Autonomous DrivingMotion Generation | CodeCode Available | 3 |
| MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds | May 27, 2024 | 4D reconstructionPose Estimation | CodeCode Available | 3 |
| Generative AI for Autonomous Driving: Frontiers and Opportunities | May 13, 2025 | Autonomous DrivingVideo Generation | CodeCode Available | 3 |
| Understanding and Minimising Outlier Features in Neural Network Training | May 29, 2024 | | CodeCode Available | 3 |
| GenAI-Bench: Evaluating and Improving Compositional Text-to-Visual Generation | Jun 19, 2024 | BenchmarkingImage Generation | CodeCode Available | 3 |
| LoRA-GA: Low-Rank Adaptation with Gradient Approximation | Jul 6, 2024 | GSM8Kparameter-efficient fine-tuning | CodeCode Available | 3 |
| Fast Matrix Multiplications for Lookup Table-Quantized LLMs | Jul 15, 2024 | Quantization | CodeCode Available | 3 |
| UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization | Aug 12, 2024 | Layout Generation | CodeCode Available | 3 |
| OpenResearcher: Unleashing AI for Accelerated Scientific Research | Aug 13, 2024 | RAGRetrieval | CodeCode Available | 3 |
| REDUCIO! Generating 10241024 Video within 16 Seconds using Extremely Compressed Motion Latents | Nov 20, 2024 | GPUVideo Generation | CodeCode Available | 3 |
| SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer | Dec 14, 2024 | DenoisingImage Generation | CodeCode Available | 3 |
| Automating the Search for Artificial Life with Foundation Models | Dec 23, 2024 | Artificial LifeIngenuity | CodeCode Available | 3 |
| RadGPT: Constructing 3D Image-Text Tumor Datasets | Jan 8, 2025 | AI AgentAnatomy | CodeCode Available | 3 |
| Hierarchical Lexical Graph for Enhanced Multi-Hop Retrieval | Jun 9, 2025 | Dataset GenerationRAG | CodeCode Available | 3 |
| Generalized Trajectory Scoring for End-to-end Multimodal Planning | Jun 7, 2025 | Autonomous DrivingDomain Generalization | CodeCode Available | 3 |
| General-Reasoner: Advancing LLM Reasoning Across All Domains | May 20, 2025 | AllMath | CodeCode Available | 3 |
| A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation Models | Oct 17, 2022 | CPUGPU | CodeCode Available | 3 |
| SNR-Aware Low-Light Image Enhancement | Jan 1, 2022 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 3 |
| Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language Models | May 4, 2023 | Instruction Following | CodeCode Available | 3 |
| Scaling up Masked Diffusion Models on Text | Oct 24, 2024 | GSM8KLanguage Modeling | CodeCode Available | 3 |
| A Phylogenetic Approach to Genomic Language Modeling | Mar 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| SplatFormer: Point Transformer for Robust 3D Gaussian Splatting | Nov 10, 2024 | 3DGSNovel View Synthesis | CodeCode Available | 3 |
| Automated Hypothesis Validation with Agentic Sequential Falsifications | Feb 14, 2025 | Decision MakingHallucination | CodeCode Available | 3 |
| BUFFER-X: Towards Zero-Shot Point Cloud Registration in Diverse Scenes | Mar 11, 2025 | Point Cloud Registration | CodeCode Available | 3 |
| The T05 System for The VoiceMOS Challenge 2024: Transfer Learning from Deep Image Classifier to Naturalness MOS Prediction of High-Quality Synthetic Speech | Sep 14, 2024 | Self-Supervised LearningTransfer Learning | CodeCode Available | 3 |
| Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making | Oct 9, 2024 | BenchmarkingDecision Making | CodeCode Available | 3 |
| VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding | Jun 13, 2024 | Dense Video CaptioningMVBench | CodeCode Available | 3 |
| AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models | Aug 29, 2023 | Anomaly DetectionIn-Context Learning | CodeCode Available | 3 |
| BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark | Jul 10, 2024 | Imitation Learning | CodeCode Available | 3 |
| MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion | Apr 28, 2025 | | CodeCode Available | 3 |
| Recurrent Drafter for Fast Speculative Decoding in Large Language Models | Mar 14, 2024 | BenchmarkingKnowledge Distillation | CodeCode Available | 3 |
| MAD-ICP: It Is All About Matching Data -- Robust and Informed LiDAR Odometry | May 9, 2024 | All | CodeCode Available | 3 |
| MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare Copilot | Feb 6, 2025 | DiagnosticLarge Language Model | CodeCode Available | 3 |
| AER: Auto-Encoder with Regression for Time Series Anomaly Detection | Dec 27, 2022 | Anomaly DetectionBenchmarking | CodeCode Available | 3 |
| Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention | May 27, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models | Feb 8, 2022 | DiagnosticImage Captioning | CodeCode Available | 3 |
| Exploring the Feasibility of Automated Data Standardization using Large Language Models for Seamless Positioning | Aug 22, 2024 | Data Integration | CodeCode Available | 3 |
| Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference | Jun 16, 2024 | | CodeCode Available | 3 |
| MegaBlocks: Efficient Sparse Training with Mixture-of-Experts | Nov 29, 2022 | GPUMixture-of-Experts | CodeCode Available | 3 |
| When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes | Apr 18, 2024 | Contrastive LearningFew-Shot Learning | CodeCode Available | 3 |
| Blind Image Restoration via Fast Diffusion Inversion | May 29, 2024 | DeblurringImage Restoration | CodeCode Available | 3 |
| An Investigation of Incorporating Mamba for Speech Enhancement | May 10, 2024 | MambaSpeech Enhancement | CodeCode Available | 3 |
| Nuclei instance segmentation and classification in histopathology images with StarDist | Mar 3, 2022 | ClassificationInstance Segmentation | CodeCode Available | 3 |