| SASVi - Segment Any Surgical Video | Feb 12, 2025 | SegmentationVideo Segmentation | CodeCode Available | 1 |
| HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification | Feb 12, 2025 | Cell SegmentationImage Generation | CodeCode Available | 1 |
| InTAR: Inter-Task Auto-Reconfigurable Accelerator Design for High Data Volume Variation in DNNs | Feb 12, 2025 | Computational Efficiency | CodeCode Available | 1 |
| Bidirectional Diffusion Bridge Models | Feb 12, 2025 | Translation | CodeCode Available | 1 |
| Enhanced Load Forecasting with GAT-LSTM: Leveraging Grid and Temporal Features | Feb 12, 2025 | Graph AttentionLoad Forecasting | CodeCode Available | 1 |
| IHEval: Evaluating Language Models on Following the Instruction Hierarchy | Feb 12, 2025 | Instruction Following | CodeCode Available | 1 |
| Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions | Feb 12, 2025 | Contrastive LearningImage Retrieval | CodeCode Available | 1 |
| HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting | Feb 12, 2025 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 1 |
| LDC-MTL: Balancing Multi-Task Learning through Scalable Loss Discrepancy Control | Feb 12, 2025 | Bilevel OptimizationMulti-Task Learning | CodeCode Available | 1 |
| Measuring Diversity in Synthetic Datasets | Feb 12, 2025 | ClassificationDiversity | CodeCode Available | 1 |
| Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems | Feb 12, 2025 | Reinforcement Learning (RL) | CodeCode Available | 1 |
| Heterogeneous Mixture of Experts for Remote Sensing Image Super-Resolution | Feb 12, 2025 | Image Super-ResolutionMixture-of-Experts | CodeCode Available | 1 |
| Out-of-Distribution Detection on Graphs: A Survey | Feb 12, 2025 | Anomaly DetectionGraph Anomaly Detection | CodeCode Available | 1 |
| Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models | Feb 12, 2025 | AttributeDiagnostic | CodeCode Available | 1 |
| SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence | Feb 12, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation | Feb 12, 2025 | Computational EfficiencyImage Segmentation | CodeCode Available | 1 |
| From Brainwaves to Brain Scans: A Robust Neural Network for EEG-to-fMRI Synthesis | Feb 11, 2025 | EEGSSIM | CodeCode Available | 1 |
| Direct Ascent Synthesis: Revealing Hidden Generative Capabilities in Discriminative Models | Feb 11, 2025 | Image GenerationStyle Transfer | CodeCode Available | 1 |
| Navigating Semantic Drift in Task-Agnostic Class-Incremental Learning | Feb 11, 2025 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| EventEgo3D++: 3D Human Motion Capture from a Head-Mounted Event Camera | Feb 11, 2025 | | CodeCode Available | 1 |
| Time2Lang: Bridging Time-Series Foundation Models and Large Language Models for Health Sensing Beyond Prompting | Feb 11, 2025 | Time Series | CodeCode Available | 1 |
| Explaining 3D Computed Tomography Classifiers with Counterfactuals | Feb 11, 2025 | Computed Tomography (CT)counterfactual | CodeCode Available | 1 |
| EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering | Feb 11, 2025 | Question AnsweringVideo Question Answering | CodeCode Available | 1 |
| Graph RAG-Tool Fusion | Feb 11, 2025 | RAGRetrieval | CodeCode Available | 1 |
| DarwinLM: Evolutionary Structured Pruning of Large Language Models | Feb 11, 2025 | Model Compression | CodeCode Available | 1 |
| TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation | Feb 11, 2025 | Depth CompletionTransparent objects | CodeCode Available | 1 |
| Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decoupled Cross-entropy Loss | Feb 11, 2025 | | CodeCode Available | 1 |
| Revisiting Non-Acyclic GFlowNets in Discrete Environments | Feb 11, 2025 | | CodeCode Available | 1 |
| Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering | Feb 11, 2025 | | CodeCode Available | 1 |
| MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification | Feb 11, 2025 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection | Feb 11, 2025 | Fake Image Detection | CodeCode Available | 1 |
| BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models | Feb 11, 2025 | Code GenerationInstruction Following | CodeCode Available | 1 |
| Small Language Model Makes an Effective Long Text Extractor | Feb 11, 2025 | GPULanguage Modeling | CodeCode Available | 1 |
| Generative Modeling with Bayesian Sample Inference | Feb 11, 2025 | Density EstimationImage Generation | CodeCode Available | 1 |
| EIQP: Execution-time-certified and Infeasibility-detecting QP Solver | Feb 11, 2025 | C++ codeModel Predictive Control | CodeCode Available | 1 |
| Instance-dependent Early Stopping | Feb 11, 2025 | Transfer Learning | CodeCode Available | 1 |
| PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning | Feb 11, 2025 | ObjectVideo Prediction | CodeCode Available | 1 |
| Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples | Feb 11, 2025 | | CodeCode Available | 1 |
| Joint Modelling Histology and Molecular Markers for Cancer Classification | Feb 11, 2025 | Cancer ClassificationPrognosis | CodeCode Available | 1 |
| VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification | Feb 11, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Space-Aware Instruction Tuning: Dataset and Benchmark for Guide Dog Robots Assisting the Visually Impaired | Feb 11, 2025 | | CodeCode Available | 1 |
| Integrating Physics and Data-Driven Approaches: An Explainable and Uncertainty-Aware Hybrid Model for Wind Turbine Power Prediction | Feb 11, 2025 | Fault Detectionquantile regression | CodeCode Available | 1 |
| Flow Matching for Collaborative Filtering | Feb 11, 2025 | Collaborative FilteringRecommendation Systems | CodeCode Available | 1 |
| On Iterative Evaluation and Enhancement of Code Quality Using GPT-4o | Feb 11, 2025 | | CodeCode Available | 1 |
| MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces | Feb 11, 2025 | | CodeCode Available | 1 |
| Diffusion Suction Grasping with Large-Scale Parcel Dataset | Feb 11, 2025 | Denoising | CodeCode Available | 1 |
| JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata | Feb 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MAAT: Mamba Adaptive Anomaly Transformer with association discrepancy for time series | Feb 11, 2025 | Anomaly DetectionAnomaly Localization | CodeCode Available | 1 |
| Bag of Tricks for Inference-time Computation of LLM Reasoning | Feb 11, 2025 | GPU | CodeCode Available | 1 |
| MiniF2F in Rocq: Automatic Translation Between Proof Assistants -- A Case Study | Feb 11, 2025 | Translation | CodeCode Available | 1 |