| Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth Study | Oct 23, 2024 | | CodeCode Available | 2 |
| Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models | May 27, 2024 | SegmentationSemantic correspondence | CodeCode Available | 2 |
| Distributed Global Structure-from-Motion with a Deep Front-End | Nov 30, 2023 | | CodeCode Available | 2 |
| Solver-in-the-Loop: Learning from Differentiable Physics to Interact with Iterative PDE-Solvers | Jun 30, 2020 | | CodeCode Available | 2 |
| FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes | May 28, 2024 | Novel View SynthesisTriplet | CodeCode Available | 2 |
| Remote Bio-Sensing: Open Source Benchmark Framework for Fair Evaluation of rPPG | Jul 24, 2023 | Benchmarking | CodeCode Available | 2 |
| Stream of Search (SoS): Learning to Search in Language | Apr 1, 2024 | Language Modelling | CodeCode Available | 2 |
| TotalVibeSegmentator: Full Body MRI Segmentation for the NAKO and UK Biobank | May 31, 2024 | EpidemiologyHoldout Set | CodeCode Available | 2 |
| Tracking Anything in High Quality | Jul 26, 2023 | ObjectObject Tracking | CodeCode Available | 2 |
| R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning | Jun 27, 2025 | Object TrackingTemplate Matching | CodeCode Available | 2 |
| Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Mar 5, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| Discovering Latent Knowledge in Language Models Without Supervision | Dec 7, 2022 | Imitation LearningLanguage Modelling | CodeCode Available | 2 |
| QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search | Feb 4, 2025 | | CodeCode Available | 2 |
| DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genome | Jun 26, 2023 | Computational EfficiencyCore Promoter Detection | CodeCode Available | 2 |
| ProteinInvBench: Benchmarking Protein Inverse Folding on Diverse Tasks, Models, and Metrics | Sep 26, 2023 | | CodeCode Available | 2 |
| Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning | Mar 3, 2025 | Reinforcement Learning (RL) | CodeCode Available | 2 |
| Equivariant Graph Neural Operator for Modeling 3D Dynamics | Jan 19, 2024 | Operator learning | CodeCode Available | 2 |
| Positional Encoder Graph Quantile Neural Networks for Geographic Data | Sep 27, 2024 | Density EstimationUncertainty Quantification | CodeCode Available | 2 |
| Using the IBM Analog In-Memory Hardware Acceleration Kit for Neural Network Training and Inference | Jul 18, 2023 | | CodeCode Available | 2 |
| FloorSet -- a VLSI Floorplanning Dataset with Design Constraints of Real-World SoCs | May 9, 2024 | Combinatorial Optimization | CodeCode Available | 2 |
| PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change | Jun 21, 2022 | Common Sense ReasoningDiversity | CodeCode Available | 2 |
| SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure | Jun 16, 2025 | Simultaneous Localization and Mapping | CodeCode Available | 2 |
| Cross-modal Orthogonal High-rank Augmentation for RGB-Event Transformer-trackers | Jul 9, 2023 | Object Tracking | CodeCode Available | 2 |
| Can LLMs Separate Instructions From Data? And What Do We Even Mean By That? | Mar 11, 2024 | Prompt Engineering | CodeCode Available | 2 |
| Idiosyncrasies in Large Language Models | Feb 17, 2025 | | CodeCode Available | 2 |
| Longitudinal Segmentation of MS Lesions via Temporal Difference Weighting | Sep 20, 2024 | Inductive BiasLesion Detection | CodeCode Available | 2 |
| Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts | Jul 7, 2025 | Inductive BiasMixture-of-Experts | CodeCode Available | 2 |
| ICASSP 2022 Acoustic Echo Cancellation Challenge | Feb 27, 2022 | Acoustic echo cancellationSpeech Enhancement | CodeCode Available | 2 |
| EASI-Tex: Edge-Aware Mesh Texturing from Single Image | May 27, 2024 | | CodeCode Available | 2 |
| Gaussian Shading: Provable Performance-Lossless Image Watermarking for Diffusion Models | Apr 7, 2024 | Denoising | CodeCode Available | 2 |
| Accurate Leukocyte Detection Based on Deformable-DETR and Multi-Level Feature Fusion for Aiding Diagnosis of Blood Diseases | Jan 1, 2024 | | CodeCode Available | 2 |
| HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection | Mar 16, 2024 | channel selectionobject-detection | CodeCode Available | 2 |
| Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction | Apr 6, 2022 | PredictionStock Prediction | CodeCode Available | 2 |
| IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS | Sep 9, 2024 | DenoisingSpeech Enhancement | CodeCode Available | 2 |
| Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning? | May 27, 2025 | Multimodal Reasoning | CodeCode Available | 2 |
| SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds | Jul 16, 2024 | LIDAR Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models | Mar 14, 2025 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| GinAR: An End-To-End Multivariate Time Series Forecasting Model Suitable for Variable Missing | May 18, 2024 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 2 |
| FlowSE: Efficient and High-Quality Speech Enhancement via Flow Matching | May 26, 2025 | QuantizationSpeech Enhancement | CodeCode Available | 2 |
| EVOR: Evolving Retrieval for Code Generation | Feb 19, 2024 | Code GenerationRAG | CodeCode Available | 2 |
| CenterFormer: Center-based Transformer for 3D Object Detection | Sep 12, 2022 | 3D Object DetectionObject | CodeCode Available | 2 |
| Natural Language Fine-Tuning | Dec 29, 2024 | GSM8KLarge Language Model | CodeCode Available | 2 |
| Compression-Aware One-Step Diffusion Model for JPEG Artifact Removal | Feb 14, 2025 | DenoisingImage Restoration | CodeCode Available | 2 |
| OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems | Feb 21, 2024 | Logical Fallacies | CodeCode Available | 2 |
| Implicit Neural Representation in Medical Imaging: A Comparative Survey | Jul 30, 2023 | Domain AdaptationImage Reconstruction | CodeCode Available | 2 |
| LlavaGuard: An Open VLM-based Framework for Safeguarding Vision Datasets and Models | Jun 7, 2024 | | CodeCode Available | 2 |
| DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion | May 25, 2023 | DenoisingStyle Transfer | CodeCode Available | 2 |
| Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph | Jul 15, 2023 | HallucinationKnowledge Graphs | CodeCode Available | 2 |
| Quantifying the Plausibility of Context Reliance in Neural Machine Translation | Oct 2, 2023 | Machine TranslationTranslation | CodeCode Available | 2 |
| Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving | Feb 11, 2025 | AttributeAutonomous Driving | CodeCode Available | 2 |