| LiSA: LiDAR Localization with Semantic Awareness | Jan 1, 2024 | Knowledge DistillationSemantic Segmentation | CodeCode Available | 2 |
| TensorFlow Quantum: A Software Framework for Quantum Machine Learning | Mar 6, 2020 | BIG-bench Machine LearningMeta-Learning | CodeCode Available | 2 |
| GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching | Mar 17, 2025 | Autonomous DrivingImage Generation | CodeCode Available | 2 |
| Exploring Human-Like Translation Strategy with Large Language Models | May 6, 2023 | HallucinationMachine Translation | CodeCode Available | 2 |
| Benchmarking Uncertainty Disentanglement: Specialized Uncertainties for Specialized Tasks | Feb 29, 2024 | BenchmarkingDisentanglement | CodeCode Available | 2 |
| LITA: Language Instructed Temporal-Localization Assistant | Mar 27, 2024 | Instruction FollowingTemporal Localization | CodeCode Available | 2 |
| JaxLife: An Open-Ended Agentic Simulator | Sep 1, 2024 | Artificial Life | CodeCode Available | 2 |
| SeD: Semantic-Aware Discriminator for Image Super-Resolution | Feb 29, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Let Go of Your Labels with Unsupervised Transfer | Jun 11, 2024 | Image ClusteringUnsupervised Image Classification | CodeCode Available | 2 |
| Iterated Denoising Energy Matching for Sampling from Boltzmann Densities | Feb 9, 2024 | DenoisingEfficient Exploration | CodeCode Available | 2 |
| CATCH: Channel-Aware multivariate Time Series Anomaly Detection via Frequency Patching | Oct 16, 2024 | Anomaly DetectionTime Series | CodeCode Available | 2 |
| PennyLane: Automatic differentiation of hybrid quantum-classical computations | Nov 12, 2018 | BIG-bench Machine LearningQuantum Machine Learning | CodeCode Available | 2 |
| A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval | Mar 7, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 2 |
| Integrate Any Omics: Towards genome-wide data integration for patient stratification | Jan 15, 2024 | Data IntegrationDiversity | CodeCode Available | 2 |
| pyhgf: A neural network library for predictive coding | Oct 11, 2024 | Causal DiscoveryMeta-Learning | CodeCode Available | 2 |
| European Space Agency Benchmark for Anomaly Detection in Satellite Telemetry | Jun 25, 2024 | Anomaly DetectionTime Series | CodeCode Available | 2 |
| From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models | Apr 24, 2024 | Instruction Following | CodeCode Available | 2 |
| Soft Masked Mamba Diffusion Model for CT to MRI Conversion | Jun 22, 2024 | Computed Tomography (CT)Image Generation | CodeCode Available | 2 |
| Med-Flamingo: a Multimodal Medical Few-shot Learner | Jul 27, 2023 | Medical Visual Question AnsweringQuestion Answering | CodeCode Available | 2 |
| Vision language models are blind: Failing to translate detailed visual features into words | Jul 9, 2024 | | CodeCode Available | 2 |
| SNAKE: Shape-aware Neural 3D Keypoint Field | Jun 3, 2022 | Keypoint Detection | CodeCode Available | 2 |
| Predicting Human Brain States with Transformer | Dec 11, 2024 | Language ModellingMusic Generation | CodeCode Available | 2 |
| COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models | Sep 23, 2024 | Robot Task PlanningTask Planning | CodeCode Available | 2 |
| CausalFormer: An Interpretable Transformer for Temporal Causal Discovery | Jun 24, 2024 | Causal DiscoveryTime Series | CodeCode Available | 2 |
| Synthetic Data RL: Task Definition Is All You Need | May 18, 2025 | AllGSM8K | CodeCode Available | 2 |
| MF-MOS: A Motion-Focused Model for Moving Object Segmentation | Jan 30, 2024 | Autonomous DrivingObject | CodeCode Available | 2 |
| Discrete Morse Sandwich: Fast Computation of Persistence Diagrams for Scalar Data -- An Algorithm and A Benchmark | Jun 27, 2022 | | CodeCode Available | 2 |
| Towards Reliable Advertising Image Generation Using Human Feedback | Aug 1, 2024 | Image Generation | CodeCode Available | 2 |
| Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic Data | Jul 13, 2023 | 2D Human Pose EstimationPose Estimation | CodeCode Available | 2 |
| Foundational Models Defining a New Era in Vision: A Survey and Outlook | Jul 25, 2023 | Benchmarking | CodeCode Available | 2 |
| Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis | Sep 12, 2024 | Novel View Synthesis | CodeCode Available | 2 |
| Registration based Few-Shot Anomaly Detection | Jul 15, 2022 | Anomaly Detection | CodeCode Available | 2 |
| BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models | Sep 12, 2023 | DiagnosticNatural Language Understanding | CodeCode Available | 2 |
| VanillaNet: the Power of Minimalism in Deep Learning | May 22, 2023 | Deep LearningPhilosophy | CodeCode Available | 2 |
| xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement | Jan 10, 2025 | MambaSpeech Enhancement | CodeCode Available | 2 |
| MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation | Mar 28, 2024 | Talking Head Generation | CodeCode Available | 2 |
| FireANTs: Adaptive Riemannian Optimization for Multi-Scale Diffeomorphic Matching | Apr 1, 2024 | CPUImage Registration | CodeCode Available | 2 |
| HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting | Nov 29, 2023 | | CodeCode Available | 2 |
| RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching | May 29, 2024 | DenoisingProtein Design | CodeCode Available | 2 |
| High-dimensional mixed-categorical Gaussian processes with application to multidisciplinary design optimization for a green aircraft | Nov 10, 2023 | Bayesian OptimizationCantilever Beam | CodeCode Available | 2 |
| The KEEN Universe: An Ecosystem for Knowledge Graph Embeddings with a Focus on Reproducibility and Transferability | Jan 28, 2020 | BIG-bench Machine LearningFact Checking | CodeCode Available | 2 |
| Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning | Feb 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval | Jan 31, 2024 | RetrievalText Retrieval | CodeCode Available | 2 |
| FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression | Dec 5, 2024 | DescriptiveVisual Question Answering | CodeCode Available | 2 |
| Unifying Voxel-based Representation with Transformer for 3D Object Detection | Jun 1, 2022 | 3D Object DetectionDecoder | CodeCode Available | 2 |
| EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications | Jun 21, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 |
| Sylber: Syllabic Embedding Representation of Speech from Raw Audio | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PMaF: Deep Declarative Layers for Principal Matrix Features | Jun 26, 2023 | | CodeCode Available | 2 |
| GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Apr 10, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| Nellie: Automated organelle segmentation, tracking, and hierarchical feature extraction in 2D/3D live-cell microscopy | Mar 20, 2024 | | CodeCode Available | 2 |