| LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding | May 22, 2025 | Position | CodeCode Available | 1 |
| Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization | Mar 24, 2025 | NavigateScheduling | CodeCode Available | 1 |
| DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets | Apr 3, 2024 | Image ClassificationInductive Bias | CodeCode Available | 1 |
| Learning Open-vocabulary Semantic Segmentation Models From Natural Language Supervision | Jan 22, 2023 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Sliding Window FastEdit: A Framework for Lesion Annotation in Whole-body PET Images | Nov 24, 2023 | Interactive SegmentationSegmentation | CodeCode Available | 1 |
| SAGDFN: A Scalable Adaptive Graph Diffusion Forecasting Network for Multivariate Time Series Forecasting | Jun 18, 2024 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 1 |
| Semi-Supervised Deep Regression with Uncertainty Consistency and Variational Model Ensembling via Bayesian Neural Networks | Feb 15, 2023 | Age Estimationregression | CodeCode Available | 1 |
| A Privacy-Preserving Hybrid Federated Learning Framework for Financial Crime Detection | Feb 7, 2023 | Federated LearningPrivacy Preserving | CodeCode Available | 1 |
| AutoQA: From Databases To QA Semantic Parsers With Only Synthetic Training Data | Oct 9, 2020 | AttributeNatural Questions | CodeCode Available | 1 |
| Self-Training Guided Disentangled Adaptation for Cross-Domain Remote Sensing Image Semantic Segmentation | Jan 13, 2023 | DecoderSemantic Segmentation | CodeCode Available | 1 |
| Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians | Jan 15, 2025 | Computational chemistryKnowledge Distillation | CodeCode Available | 1 |
| Self-Supervised Correspondence Estimation via Multiview Registration | Dec 6, 2022 | Diversity | CodeCode Available | 1 |
| Fine-grained Category Discovery under Coarse-grained supervision with Hierarchical Weighted Self-contrastive Learning | Oct 14, 2022 | Contrastive Learning | CodeCode Available | 1 |
| Reinforced Structured State-Evolution for Vision-Language Navigation | Apr 20, 2022 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| Visual Sound Localization in the Wild by Cross-Modal Interference Erasing | Feb 13, 2022 | Sound Source Localization | CodeCode Available | 1 |
| Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues | Nov 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications | Aug 7, 2022 | Activity RecognitionData Augmentation | CodeCode Available | 1 |
| Hyperbolic Geometric Latent Diffusion Model for Graph Generation | May 6, 2024 | Graph Generation | CodeCode Available | 1 |
| Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation | Oct 24, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation | Dec 5, 2023 | Self-Supervised LearningSpeech-to-Speech Translation | CodeCode Available | 1 |
| DevBench: A multimodal developmental benchmark for language learning | Jun 14, 2024 | | CodeCode Available | 1 |
| BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices | Sep 25, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| ForecastPFN: Synthetically-Trained Zero-Shot Forecasting | Nov 3, 2023 | Bayesian InferenceTime Series | CodeCode Available | 1 |
| Normalizing Flows are Capable Models for RL | May 29, 2025 | Imitation LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Multi-Stage Episodic Control for Strategic Exploration in Text Games | Jan 4, 2022 | | CodeCode Available | 1 |
| EWMoE: An effective model for global weather forecasting with mixture-of-experts | May 9, 2024 | Mixture-of-ExpertsWeather Forecasting | CodeCode Available | 1 |
| Benchmarking Data Science Agents | Feb 27, 2024 | BenchmarkingCode Generation | CodeCode Available | 1 |
| Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation | Jul 28, 2022 | | CodeCode Available | 1 |
| Introducing Thermodynamics-Informed Symbolic Regression -- A Tool for Thermodynamic Equations of State Development | Sep 6, 2023 | regressionSymbolic Regression | CodeCode Available | 1 |
| Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models | Oct 23, 2023 | AI2 Reasoning Challenge | CodeCode Available | 1 |
| Avoiding Reasoning Shortcuts: Adversarial Evaluation, Training, and Model Development for Multi-Hop QA | Jun 17, 2019 | Multi-hop Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Supervised Adversarial Contrastive Learning for Emotion Recognition in Conversations | Jun 2, 2023 | Contrastive LearningEmotion Recognition | CodeCode Available | 1 |
| Stage-by-stage Wavelet Optimization Refinement Diffusion Model for Sparse-View CT Reconstruction | Aug 30, 2023 | CT Reconstruction | CodeCode Available | 1 |
| Mosaic-IT: Free Compositional Data Augmentation Improves Instruction Tuning | May 22, 2024 | Data AugmentationDiversity | CodeCode Available | 1 |
| The Devil is in the Upsampling: Architectural Decisions Made Simpler for Denoising with Deep Image Prior | Apr 22, 2023 | DenoisingImage Denoising | CodeCode Available | 1 |
| Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning | Oct 27, 2023 | D4RLReinforcement Learning (RL) | CodeCode Available | 1 |
| LED: Light Enhanced Depth Estimation at Night | Sep 12, 2024 | Autonomous DrivingDecoder | CodeCode Available | 1 |
| Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Nov 3, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning | Oct 14, 2022 | Caption GenerationKnowledge Distillation | CodeCode Available | 1 |
| PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample Consensus | Jan 26, 2024 | | CodeCode Available | 1 |
| Neural Target Speech Extraction: An Overview | Jan 31, 2023 | Speech Extraction | CodeCode Available | 1 |
| ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts | Feb 8, 2025 | BenchmarkingSelf-Supervised Learning | CodeCode Available | 1 |
| Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning | Feb 22, 2024 | | CodeCode Available | 1 |
| ResiDual: Transformer with Dual Residual Connections | Apr 28, 2023 | Machine Translation | CodeCode Available | 1 |
| AMD-Hummingbird: Towards an Efficient Text-to-Video Model | Mar 24, 2025 | Computational EfficiencyVideo Generation | CodeCode Available | 1 |
| Unbiased Teacher for Semi-Supervised Object Detection | Feb 18, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Truncation Sampling as Language Model Desmoothing | Oct 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SatLM: Satisfiability-Aided Language Models Using Declarative Prompting | May 16, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 |
| Learning Weakly Convex Regularizers for Convergent Image-Reconstruction Algorithms | Aug 21, 2023 | Image ReconstructionMRI Reconstruction | CodeCode Available | 1 |
| MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style Convolution | Jun 17, 2025 | Facial Landmark DetectionMicro Expression Recognition | CodeCode Available | 1 |