| 360PanT: Training-Free Text-Driven 360-Degree Panorama-to-Panorama Translation | Sep 12, 2024 | Image-to-Image TranslationTranslation | CodeCode Available | 1 |
| DeCLIP: Decoding CLIP representations for deepfake localization | Sep 12, 2024 | DecoderDeepFake Detection | CodeCode Available | 1 |
| Learning Brain Tumor Representation in 3D High-Resolution MR Images via Interpretable State Space Models | Sep 12, 2024 | Self-Supervised LearningState Space Models | CodeCode Available | 1 |
| OpenACE: An Open Benchmark for Evaluating Audio Coding Performance | Sep 12, 2024 | | CodeCode Available | 1 |
| Click2Mask: Local Editing with Dynamic Mask Generation | Sep 12, 2024 | Image GenerationImage Manipulation | CodeCode Available | 1 |
| InvDesFlow: An AI-driven materials inverse design workflow to explore possible high-temperature superconductors | Sep 12, 2024 | Physical Intuition | CodeCode Available | 1 |
| Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation? | Sep 12, 2024 | DecoderDomain Generalization | CodeCode Available | 1 |
| Improving Virtual Try-On with Garment-focused Diffusion Models | Sep 12, 2024 | Image GenerationVirtual Try-on | CodeCode Available | 1 |
| meds_reader: A fast and efficient EHR processing library | Sep 12, 2024 | | CodeCode Available | 1 |
| Learning incomplete factorization preconditioners for GMRES | Sep 12, 2024 | Graph Neural Networksubspace methods | CodeCode Available | 1 |
| AudioBERT: Audio Knowledge Augmented Language Model | Sep 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality | Sep 12, 2024 | Transfer Learning | CodeCode Available | 1 |
| Fine-tuning Large Language Models for Entity Matching | Sep 12, 2024 | Data IntegrationEntity Resolution | CodeCode Available | 1 |
| WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks | Sep 12, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Estimating Atmospheric Variables from Digital Typhoon Satellite Images via Conditional Denoising Diffusion Models | Sep 12, 2024 | DenoisingImputation | CodeCode Available | 1 |
| Deep learning and machine learning techniques for head pose estimation: a survey | Sep 12, 2024 | ArticlesHead Pose Estimation | CodeCode Available | 1 |
| OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation | Sep 12, 2024 | Mamba | CodeCode Available | 1 |
| Scribble-Guided Diffusion for Training-free Text-to-Image Generation | Sep 12, 2024 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| FaVoR: Features via Voxel Rendering for Camera Relocalization | Sep 11, 2024 | Camera Relocalization | CodeCode Available | 1 |
| Retinex-RAWMamba: Bridging Demosaicing and Denoising for Low-Light RAW Image Enhancement | Sep 11, 2024 | DemosaickingDenoising | CodeCode Available | 1 |
| Swin-LiteMedSAM: A Lightweight Box-Based Segment Anything Model for Large-Scale Medical Image Datasets | Sep 11, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| A Comprehensive Survey on Inverse Constrained Reinforcement Learning: Definitions, Progress and Challenges | Sep 11, 2024 | Autonomous DrivingSports Analytics | CodeCode Available | 1 |
| PiTe: Pixel-Temporal Alignment for Large Video-Language Model | Sep 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enhancing adversarial robustness in Natural Language Inference using explanations | Sep 11, 2024 | Adversarial RobustnessNatural Language Inference | CodeCode Available | 1 |
| EchoDFKD: Data-Free Knowledge Distillation for Cardiac Ultrasound Segmentation using Synthetic Data | Sep 11, 2024 | Data-free Knowledge DistillationKnowledge Distillation | CodeCode Available | 1 |
| AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge | Sep 11, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Policy Filtration in RLHF to Fine-Tune LLM for Code Generation | Sep 11, 2024 | Code GenerationHumanEval | CodeCode Available | 1 |
| Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Model | Sep 11, 2024 | Data-to-Text GenerationGraph-to-Sequence | CodeCode Available | 1 |
| Representation Tuning | Sep 11, 2024 | | CodeCode Available | 1 |
| Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region | Sep 11, 2024 | parameter-efficient fine-tuningWeather Forecasting | CodeCode Available | 1 |
| TLD-READY: Traffic Light Detection -- Relevance Estimation and Deployment Analysis | Sep 11, 2024 | Autonomous Vehicles | CodeCode Available | 1 |
| Using Neural Network Models to Estimate Stellar Ages from Lithium Equivalent Widths: An EAGLES Expansion | Sep 11, 2024 | | CodeCode Available | 1 |
| Salmon: A Suite for Acoustic Language Model Evaluation | Sep 11, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| Diff-VPS: Video Polyp Segmentation via a Multi-task Diffusion Network with Adversarial Temporal Reasoning | Sep 11, 2024 | SegmentationVideo Polyp Segmentation | CodeCode Available | 1 |
| Data Augmentation via Latent Diffusion for Saliency Prediction | Sep 11, 2024 | Data AugmentationDiversity | CodeCode Available | 1 |
| SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories | Sep 11, 2024 | | CodeCode Available | 1 |
| Event-based Mosaicing Bundle Adjustment | Sep 11, 2024 | | CodeCode Available | 1 |
| Weakly-supervised Camera Localization by Ground-to-satellite Image Registration | Sep 10, 2024 | Camera LocalizationContrastive Learning | CodeCode Available | 1 |
| Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts | Sep 10, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Multi-Source Music Generation with Latent Diffusion | Sep 10, 2024 | FADMusic Generation | CodeCode Available | 1 |
| When to Extract ReID Features: A Selective Approach for Improved Multiple Object Tracking | Sep 10, 2024 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 1 |
| Static for Dynamic: Towards a Deeper Understanding of Dynamic Facial Expressions Using Static Expression Data | Sep 10, 2024 | Dynamic Facial Expression RecognitionFacial Expression Recognition | CodeCode Available | 1 |
| Deep Learning for Koopman Operator Estimation in Idealized Atmospheric Dynamics | Sep 10, 2024 | Deep LearningWeather Forecasting | CodeCode Available | 1 |
| LIME: Less Is More for MLLM Evaluation | Sep 10, 2024 | Image CaptioningQuestion Answering | CodeCode Available | 1 |
| Modelling Global Trade with Optimal Transport | Sep 10, 2024 | Uncertainty Quantification | CodeCode Available | 1 |
| Can Large Language Models Unlock Novel Scientific Research Ideas? | Sep 10, 2024 | | CodeCode Available | 1 |
| A Likelihood Ratio-Based Approach to Segmenting Unknown Objects | Sep 10, 2024 | Segmentation | CodeCode Available | 1 |
| Neural Laplacian Operator for 3D Point Clouds | Sep 10, 2024 | 3D geometry | CodeCode Available | 1 |
| Confident Teacher, Confident Student? A Novel User Study Design for Investigating the Didactic Potential of Explanations and their Impact on Uncertainty | Sep 10, 2024 | Experimental DesignExplainable artificial intelligence | CodeCode Available | 1 |
| DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement | Sep 10, 2024 | Code GenerationDenoising | CodeCode Available | 1 |