| ANIRA: An Architecture for Neural Network Inference in Real-Time Audio Applications | Jun 14, 2025 | Benchmarking | CodeCode Available | 3 | 5 |
| WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset | May 9, 2023 | ArticlesImage Captioning | CodeCode Available | 3 | 5 |
| A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends | Oct 19, 2024 | AllImage Restoration | CodeCode Available | 3 | 5 |
| Probabilistic Forecasting with Temporal Convolutional Neural Network | Jun 11, 2019 | Multivariate Time Series ForecastingProbabilistic Time Series Forecasting | CodeCode Available | 3 | 5 |
| Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation | Feb 12, 2025 | cross-modal alignmentmultimodal generation | CodeCode Available | 3 | 5 |
| MNN: A Universal and Efficient Inference Engine | Feb 27, 2020 | Deep LearningDiversity | CodeCode Available | 3 | 5 |
| OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning | May 13, 2025 | Reinforcement Learning (RL)Visual Reasoning | CodeCode Available | 3 | 5 |
| GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning | May 30, 2024 | Graph Question AnsweringKnowledge Graphs | CodeCode Available | 3 | 5 |
| Biomedical and Clinical English Model Packages in the Stanza Python NLP Library | Jul 29, 2020 | GPUNamed Entity Recognition | CodeCode Available | 3 | 5 |
| PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models | Mar 26, 2024 | Code CompletionFew-Shot Learning | CodeCode Available | 3 | 5 |
| Training-Free Efficient Video Generation via Dynamic Token Carving | May 22, 2025 | DenoisingVideo Generation | CodeCode Available | 3 | 5 |
| Position: Graph Foundation Models are Already Here | Feb 3, 2024 | Position | CodeCode Available | 3 | 5 |
| Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement | Mar 24, 2024 | 2D Object DetectionComputational Efficiency | CodeCode Available | 3 | 5 |
| "Hey, that's not an ODE": Faster ODE Adjoints via Seminorms | Sep 20, 2020 | Time SeriesTime Series Analysis | CodeCode Available | 3 | 5 |
| BEVPoolv2: A Cutting-edge Implementation of BEVDet Toward Deployment | Nov 30, 2022 | | CodeCode Available | 3 | 5 |
| Texture Memory-Augmented Deep Patch-Based Image Inpainting | Sep 28, 2020 | Image InpaintingRetrieval | CodeCode Available | 3 | 5 |
| Towards ML Engineering: A Brief History Of TensorFlow Extended (TFX) | Sep 28, 2020 | | CodeCode Available | 3 | 5 |
| Unifying 3D Representation and Control of Diverse Robots with a Single Camera | Jul 11, 2024 | | CodeCode Available | 3 | 5 |
| Learning Neural Event Functions for Ordinary Differential Equations | Nov 8, 2020 | Point Processes | CodeCode Available | 3 | 5 |
| Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling | Mar 5, 2024 | Mamba | CodeCode Available | 3 | 5 |
| Multi-Concept Customization of Text-to-Image Diffusion | Dec 8, 2022 | Diffusion Personalization | CodeCode Available | 3 | 5 |
| ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora | Dec 31, 2020 | SentenceTranslation | CodeCode Available | 3 | 5 |
| GroundGrid:LiDAR Point Cloud Ground Segmentation and Terrain Estimation | May 24, 2024 | Autonomous VehiclesSegmentation | CodeCode Available | 3 | 5 |
| Build a Deep Neural Network model using CPUs Builds a feed-forward multilayer artificial neural network on an H2OFrame | Sep 1, 2015 | Fraud Detection | CodeCode Available | 3 | 5 |
| A Systematic Evaluation of Large Language Models of Code | Feb 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models | Sep 16, 2024 | DecoderDiversity | CodeCode Available | 3 | 5 |
| MetaFormer Baselines for Vision | Oct 24, 2022 | Domain GeneralizationImage Classification | CodeCode Available | 3 | 5 |
| A Parallelizable Lattice Rescoring Strategy with Neural Language Models | Mar 8, 2021 | ARCAutomatic Speech Recognition | CodeCode Available | 3 | 5 |
| LangSplat: 3D Language Gaussian Splatting | Dec 26, 2023 | NeRFObject Localization | CodeCode Available | 3 | 5 |
| End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation | Feb 23, 2022 | Speech Synthesis | CodeCode Available | 3 | 5 |
| A Survey of Knowledge Graph Reasoning on Graph Types: Static, Dynamic, and Multimodal | Dec 12, 2022 | General KnowledgeGraph Embedding | CodeCode Available | 3 | 5 |
| BMX: Entropy-weighted Similarity and Semantic-enhanced Lexical Search | Aug 13, 2024 | Information RetrievalRetrieval | CodeCode Available | 3 | 5 |
| Spectral Pruning for Recurrent Neural Networks | May 23, 2021 | Edge-computing | CodeCode Available | 3 | 5 |
| Pivotal Tuning for Latent-based Editing of Real Images | Jun 10, 2021 | Facial EditingImage Manipulation | CodeCode Available | 3 | 5 |
| Star Attention: Efficient LLM Inference over Long Sequences | Nov 26, 2024 | Computational Efficiency | CodeCode Available | 3 | 5 |
| NeuralFoil: An Airfoil Aerodynamics Analysis Tool Using Physics-Informed Machine Learning | Mar 20, 2025 | Feature EngineeringPhysics-informed machine learning | CodeCode Available | 3 | 5 |
| The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report | Apr 14, 2025 | Super-Resolutionvalid | CodeCode Available | 3 | 5 |
| Longformer: The Long-Document Transformer | Apr 10, 2020 | DecoderLanguage Modeling | CodeCode Available | 3 | 5 |
| Multi-objective Asynchronous Successive Halving | Jun 23, 2021 | FairnessHyperparameter Optimization | CodeCode Available | 3 | 5 |
| Corrective Retrieval Augmented Generation | Jan 29, 2024 | RAGRetrieval | CodeCode Available | 3 | 5 |
| mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition | Feb 3, 2025 | Audio-Visual Speech RecognitionDecoder | CodeCode Available | 3 | 5 |
| Highly accurate protein structure prediction with AlphaFold | Jul 15, 2021 | PredictionProtein Folding | CodeCode Available | 3 | 5 |
| Artificial Intelligence Index Report 2024 | May 29, 2024 | | CodeCode Available | 3 | 5 |
| Workshop on Autonomous Driving at CVPR 2021: Technical Report for Streaming Perception Challenge | Jul 27, 2021 | 2D Object DetectionAutonomous Driving | CodeCode Available | 3 | 5 |
| LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs | Nov 3, 2021 | Few-Shot Learning | CodeCode Available | 3 | 5 |
| FiT: Flexible Vision Transformer for Diffusion Model | Feb 19, 2024 | Computational EfficiencyImage Cropping | CodeCode Available | 3 | 5 |
| OpenGraph: Towards Open Graph Foundation Models | Mar 2, 2024 | Data AugmentationGraph Learning | CodeCode Available | 3 | 5 |
| The Hidden Attention of Mamba Models | Mar 3, 2024 | Mamba | CodeCode Available | 3 | 5 |
| EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation | Jun 10, 2024 | Speech Enhancement | CodeCode Available | 3 | 5 |
| Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration | Jun 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |