| Iterated Denoising Energy Matching for Sampling from Boltzmann Densities | Feb 9, 2024 | DenoisingEfficient Exploration | CodeCode Available | 2 | 5 |
| CATCH: Channel-Aware multivariate Time Series Anomaly Detection via Frequency Patching | Oct 16, 2024 | Anomaly DetectionTime Series | CodeCode Available | 2 | 5 |
| PennyLane: Automatic differentiation of hybrid quantum-classical computations | Nov 12, 2018 | BIG-bench Machine LearningQuantum Machine Learning | CodeCode Available | 2 | 5 |
| A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval | Mar 7, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 2 | 5 |
| Integrate Any Omics: Towards genome-wide data integration for patient stratification | Jan 15, 2024 | Data IntegrationDiversity | CodeCode Available | 2 | 5 |
| pyhgf: A neural network library for predictive coding | Oct 11, 2024 | Causal DiscoveryMeta-Learning | CodeCode Available | 2 | 5 |
| European Space Agency Benchmark for Anomaly Detection in Satellite Telemetry | Jun 25, 2024 | Anomaly DetectionTime Series | CodeCode Available | 2 | 5 |
| From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models | Apr 24, 2024 | Instruction Following | CodeCode Available | 2 | 5 |
| Soft Masked Mamba Diffusion Model for CT to MRI Conversion | Jun 22, 2024 | Computed Tomography (CT)Image Generation | CodeCode Available | 2 | 5 |
| Med-Flamingo: a Multimodal Medical Few-shot Learner | Jul 27, 2023 | Medical Visual Question AnsweringQuestion Answering | CodeCode Available | 2 | 5 |
| Vision language models are blind: Failing to translate detailed visual features into words | Jul 9, 2024 | | CodeCode Available | 2 | 5 |
| SNAKE: Shape-aware Neural 3D Keypoint Field | Jun 3, 2022 | Keypoint Detection | CodeCode Available | 2 | 5 |
| Predicting Human Brain States with Transformer | Dec 11, 2024 | Language ModellingMusic Generation | CodeCode Available | 2 | 5 |
| COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models | Sep 23, 2024 | Robot Task PlanningTask Planning | CodeCode Available | 2 | 5 |
| CausalFormer: An Interpretable Transformer for Temporal Causal Discovery | Jun 24, 2024 | Causal DiscoveryTime Series | CodeCode Available | 2 | 5 |
| Synthetic Data RL: Task Definition Is All You Need | May 18, 2025 | AllGSM8K | CodeCode Available | 2 | 5 |
| MF-MOS: A Motion-Focused Model for Moving Object Segmentation | Jan 30, 2024 | Autonomous DrivingObject | CodeCode Available | 2 | 5 |
| Discrete Morse Sandwich: Fast Computation of Persistence Diagrams for Scalar Data -- An Algorithm and A Benchmark | Jun 27, 2022 | | CodeCode Available | 2 | 5 |
| Towards Reliable Advertising Image Generation Using Human Feedback | Aug 1, 2024 | Image Generation | CodeCode Available | 2 | 5 |
| Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic Data | Jul 13, 2023 | 2D Human Pose EstimationPose Estimation | CodeCode Available | 2 | 5 |
| Foundational Models Defining a New Era in Vision: A Survey and Outlook | Jul 25, 2023 | Benchmarking | CodeCode Available | 2 | 5 |
| Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis | Sep 12, 2024 | Novel View Synthesis | CodeCode Available | 2 | 5 |
| Registration based Few-Shot Anomaly Detection | Jul 15, 2022 | Anomaly Detection | CodeCode Available | 2 | 5 |
| BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models | Sep 12, 2023 | DiagnosticNatural Language Understanding | CodeCode Available | 2 | 5 |
| VanillaNet: the Power of Minimalism in Deep Learning | May 22, 2023 | Deep LearningPhilosophy | CodeCode Available | 2 | 5 |
| xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement | Jan 10, 2025 | MambaSpeech Enhancement | CodeCode Available | 2 | 5 |
| MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation | Mar 28, 2024 | Talking Head Generation | CodeCode Available | 2 | 5 |
| FireANTs: Adaptive Riemannian Optimization for Multi-Scale Diffeomorphic Matching | Apr 1, 2024 | CPUImage Registration | CodeCode Available | 2 | 5 |
| HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting | Nov 29, 2023 | | CodeCode Available | 2 | 5 |
| RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching | May 29, 2024 | DenoisingProtein Design | CodeCode Available | 2 | 5 |
| High-dimensional mixed-categorical Gaussian processes with application to multidisciplinary design optimization for a green aircraft | Nov 10, 2023 | Bayesian OptimizationCantilever Beam | CodeCode Available | 2 | 5 |
| The KEEN Universe: An Ecosystem for Knowledge Graph Embeddings with a Focus on Reproducibility and Transferability | Jan 28, 2020 | BIG-bench Machine LearningFact Checking | CodeCode Available | 2 | 5 |
| Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning | Feb 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval | Jan 31, 2024 | RetrievalText Retrieval | CodeCode Available | 2 | 5 |
| FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression | Dec 5, 2024 | DescriptiveVisual Question Answering | CodeCode Available | 2 | 5 |
| Unifying Voxel-based Representation with Transformer for 3D Object Detection | Jun 1, 2022 | 3D Object DetectionDecoder | CodeCode Available | 2 | 5 |
| EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications | Jun 21, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 | 5 |
| Sylber: Syllabic Embedding Representation of Speech from Raw Audio | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| PMaF: Deep Declarative Layers for Principal Matrix Features | Jun 26, 2023 | | CodeCode Available | 2 | 5 |
| GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Apr 10, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| Nellie: Automated organelle segmentation, tracking, and hierarchical feature extraction in 2D/3D live-cell microscopy | Mar 20, 2024 | | CodeCode Available | 2 | 5 |
| ConDSeg: A General Medical Image Segmentation Framework via Contrast-Driven Feature Enhancement | Dec 11, 2024 | DecoderImage Segmentation | CodeCode Available | 2 | 5 |
| EEdit: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing | Mar 13, 2025 | | CodeCode Available | 2 | 5 |
| Improving Opus Low Bit Rate Quality with Neural Speech Synthesis | Aug 10, 2020 | DecoderSpeech Synthesis | CodeCode Available | 2 | 5 |
| Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration | Apr 19, 2024 | Ensemble Learning | CodeCode Available | 2 | 5 |
| Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert | Mar 29, 2023 | Contrastive LearningFace Generation | CodeCode Available | 2 | 5 |
| MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning | Nov 15, 2023 | Chart Understanding | CodeCode Available | 2 | 5 |
| VBR: A Vision Benchmark in Rome | Apr 17, 2024 | Autonomous VehiclesBenchmarking | CodeCode Available | 2 | 5 |
| Eliminating Warping Shakes for Unsupervised Online Video Stitching | Mar 11, 2024 | Image StitchingVideo Stabilization | CodeCode Available | 2 | 5 |
| Querying Databases with Function Calling | Jan 23, 2025 | | CodeCode Available | 2 | 5 |