| PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting | Dec 16, 2024 | 3D Reconstruction4k | CodeCode Available | 3 |
| MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio | Mar 7, 2025 | Video Generation | CodeCode Available | 3 |
| Efficient and Robust Automated Machine Learning | Dec 1, 2015 | AutoMLBayesian Optimization | CodeCode Available | 3 |
| MM-Agent: LLM as Agents for Real-world Mathematical Modeling Problem | May 20, 2025 | Mathematical Reasoningscientific discovery | CodeCode Available | 3 |
| SynSin: End-to-end View Synthesis from a Single Image | Dec 18, 2019 | Novel View Synthesis | CodeCode Available | 3 |
| An Extensible Framework for Open Heterogeneous Collaborative Perception | Jan 25, 2024 | | CodeCode Available | 3 |
| Multi-Head RAG: Solving Multi-Aspect Problems with LLMs | Jun 7, 2024 | BenchmarkingDecoder | CodeCode Available | 3 |
| Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligence | Feb 12, 2020 | BIG-bench Machine LearningGPU | CodeCode Available | 3 |
| MMLSpark: Unifying Machine Learning Ecosystems at Massive Scales | Oct 20, 2018 | BIG-bench Machine LearningDistributed Computing | CodeCode Available | 3 |
| Simulating the Real World: A Unified Survey of Multimodal Generative Models | Mar 6, 2025 | 3D GenerationSurvey | CodeCode Available | 3 |
| AlphaEvolve: A Learning Framework to Discover Novel Alphas in Quantitative Investment | Mar 30, 2021 | AutoMLStock Prediction | CodeCode Available | 3 |
| VideoRoPE: What Makes for Good Video Rotary Position Embedding? | Feb 7, 2025 | HallucinationPosition | CodeCode Available | 3 |
| Green AI | Jul 22, 2019 | Deep Learning | CodeCode Available | 3 |
| Bag of Freebies for Training Object Detection Neural Networks | Feb 11, 2019 | General Classificationimage-classification | CodeCode Available | 3 |
| Characterizing signal propagation to close the performance gap in unnormalized ResNets | Jan 21, 2021 | | CodeCode Available | 3 |
| SnapKV: LLM Knows What You are Looking for Before Generation | Apr 22, 2024 | 16kGPU | CodeCode Available | 3 |
| Towards Next-Generation LLM-based Recommender Systems: A Survey and Beyond | Oct 10, 2024 | Large Language ModelRecommendation Systems | CodeCode Available | 3 |
| Distributional Generalization: A New Kind of Generalization | Sep 17, 2020 | 2D Object Detection | CodeCode Available | 3 |
| Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space | May 21, 2025 | | CodeCode Available | 3 |
| ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and Reasoning | Dec 4, 2024 | AttributeTime Series | CodeCode Available | 3 |
| Bilinear Attention Networks | May 21, 2018 | Visual Question AnsweringVisual Question Answering (VQA) | CodeCode Available | 3 |
| Caption Anything: Interactive Image Description with Diverse Multimodal Controls | May 4, 2023 | controllable image captioningImage Captioning | CodeCode Available | 3 |
| Parametric UMAP embeddings for representation and semi-supervised learning | Sep 27, 2020 | Dimensionality Reduction | CodeCode Available | 3 |
| Taming Diffusion Probabilistic Models for Character Control | Apr 23, 2024 | Computational EfficiencyDiversity | CodeCode Available | 3 |
| Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields | Nov 24, 2016 | 2D Human Pose Estimation2D Pose Estimation | CodeCode Available | 3 |
| FIFO-Diffusion: Generating Infinite Videos from Text without Training | May 19, 2024 | Text-to-Video GenerationVideo Generation | CodeCode Available | 3 |
| Neural Volume Rendering: NeRF And Beyond | Dec 17, 2020 | NeRF | CodeCode Available | 3 |
| Aggregated Contextual Transformations for High-Resolution Image Inpainting | Apr 3, 2021 | Image InpaintingTexture Synthesis | CodeCode Available | 3 |
| Training-Free Long-Context Scaling of Large Language Models | Feb 27, 2024 | 16k | CodeCode Available | 3 |
| CNN Explainer: Learning Convolutional Neural Networks with Interactive Visualization | Apr 30, 2020 | Deep Learning | CodeCode Available | 3 |
| An RML-FNML module for Python user-defined functions in Morph-KGC | Apr 1, 2024 | Data IntegrationKnowledge Graphs | CodeCode Available | 3 |
| Recovering Realistic Texture in Image Super-resolution by Deep Spatial Feature Transform | Apr 9, 2018 | Image Super-ResolutionSemantic Segmentation | CodeCode Available | 3 |
| A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications | Jun 14, 2025 | Information RetrievalSurvey | CodeCode Available | 3 |
| Amplifying Pathological Detection in EEG Signaling Pathways through Cross-Dataset Transfer Learning | Sep 19, 2023 | EEGNMT | CodeCode Available | 3 |
| Unity: A General Platform for Intelligent Agents | Sep 7, 2018 | Reinforcement LearningUnity | CodeCode Available | 3 |
| MA-Net: A Multi-Scale Attention Network for Liver and Tumor Segmentation | Sep 21, 2020 | Image SegmentationMedical Image Segmentation | CodeCode Available | 3 |
| SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks | Mar 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| TED-LIUM 3: twice as much data and corpus repartition for experiments on speaker adaptation | May 12, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration | Dec 2, 2024 | Image RestorationIncremental Learning | CodeCode Available | 3 |
| PsyDI: Towards a Personalized and Progressively In-depth Chatbot for Psychological Measurements | Jul 22, 2024 | Chatbot | CodeCode Available | 3 |
| Wake Word Detection with Alignment-Free Lattice-Free MMI | May 17, 2020 | Decoder | CodeCode Available | 3 |
| RemoteSAM: Towards Segment Anything for Earth Observation | May 23, 2025 | AttributeEarth Observation | CodeCode Available | 3 |
| Large-Scale Intelligent Microservices | Sep 17, 2020 | Anomaly Detection | CodeCode Available | 3 |
| TF-GNN: Graph Neural Networks in TensorFlow | Jul 7, 2022 | Graph LearningGraph Sampling | CodeCode Available | 3 |
| VmambaIR: Visual State Space Model for Image Restoration | Mar 18, 2024 | DenoisingImage Restoration | CodeCode Available | 3 |
| MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning | May 20, 2024 | Continual PretrainingMathematical Reasoning | CodeCode Available | 3 |
| UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model | Aug 1, 2024 | | CodeCode Available | 3 |
| Cubify Anything: Scaling Indoor 3D Object Detection | Dec 5, 2024 | 3D Object DetectionObject | CodeCode Available | 3 |
| Lossless data compression by large models | Jun 24, 2024 | Data Compression | CodeCode Available | 3 |
| LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens | Feb 21, 2024 | 8k | CodeCode Available | 3 |