| Decomposing and Editing Predictions by Modeling Model Computation | Apr 17, 2024 | counterfactualmodel | CodeCode Available | 2 |
| Behavior Alignment: A New Perspective of Evaluating LLM-based Conversational Recommender Systems | Apr 17, 2024 | Conversational RecommendationRecommendation Systems | CodeCode Available | 2 |
| Bias and Unfairness in Information Retrieval Systems: New Challenges in the LLM Era | Apr 17, 2024 | FairnessInformation Retrieval | CodeCode Available | 2 |
| Deep Portrait Quality Assessment. A NTIRE 2024 Challenge Survey | Apr 17, 2024 | Survey | CodeCode Available | 2 |
| Training Transformer Models by Wavelet Losses Improves Quantitative and Visual Performance in Single Image Super-Resolution | Apr 17, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Vision-and-Language Navigation via Causal Learning | Apr 16, 2024 | Causal InferenceContrastive Learning | CodeCode Available | 2 |
| Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models | Apr 16, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Sustainability of Data Center Digital Twins with Reinforcement Learning | Apr 16, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| Revealing data leakage in protein interaction benchmarks | Apr 16, 2024 | Benchmarking | CodeCode Available | 2 |
| TorchSurv: A Lightweight Package for Deep Survival Analysis | Apr 16, 2024 | Survival Analysis | CodeCode Available | 2 |
| Self-playing Adversarial Language Game Enhances LLM Reasoning | Apr 16, 2024 | | CodeCode Available | 2 |
| SRGS: Super-Resolution 3D Gaussian Splatting | Apr 16, 2024 | 3DGSNeRF | CodeCode Available | 2 |
| Portrait3D: Text-Guided High-Quality 3D Portrait Generation Using Pyramid Representation and GANs Prior | Apr 16, 2024 | Neural RenderingText to 3D | CodeCode Available | 2 |
| Can Language Models Solve Olympiad Programming? | Apr 16, 2024 | | CodeCode Available | 2 |
| Self-Explore: Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards | Apr 16, 2024 | GSM8KMath | CodeCode Available | 2 |
| Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology | Apr 16, 2024 | Drug DiscoverySelf-Supervised Learning | CodeCode Available | 2 |
| Self-Supervised Visual Preference Alignment | Apr 16, 2024 | 8kMM-Vet | CodeCode Available | 2 |
| Confidential Federated Computations | Apr 16, 2024 | Federated Learning | CodeCode Available | 2 |
| Compression Represents Intelligence Linearly | Apr 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception | Apr 15, 2024 | | CodeCode Available | 2 |
| Towards a high-performance AI compiler with upstream MLIR | Apr 15, 2024 | | CodeCode Available | 2 |
| Map-Relative Pose Regression for Visual Re-Localization | Apr 15, 2024 | Novel View Synthesisregression | CodeCode Available | 2 |
| FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining | Apr 15, 2024 | MambaRain Removal | CodeCode Available | 2 |
| Convergence Analysis of Probability Flow ODE for Score-based Generative Models | Apr 15, 2024 | | CodeCode Available | 2 |
| FSPEN: AN ULTRA-LIGHTWEIGHT NETWORK FOR REAL TIME SPEECH ENAHNCMENT | Apr 15, 2024 | Speech Enhancement | CodeCode Available | 2 |
| LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence Parallelism | Apr 15, 2024 | GPU | CodeCode Available | 2 |
| in2IN: Leveraging individual Information to Generate Human INteractions | Apr 15, 2024 | DiversityLanguage Modelling | CodeCode Available | 2 |
| Salient Object-Aware Background Generation using Text-Guided Diffusion Models | Apr 15, 2024 | Object | CodeCode Available | 2 |
| Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System | Apr 15, 2024 | Autonomous Driving | CodeCode Available | 2 |
| Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models | Apr 15, 2024 | | CodeCode Available | 2 |
| ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEs | Apr 15, 2024 | Uncertainty QuantificationWeather Forecasting | CodeCode Available | 2 |
| All-in-one simulation-based inference | Apr 15, 2024 | AllBayesian Inference | CodeCode Available | 2 |
| Foundational Challenges in Assuring Alignment and Safety of Large Language Models | Apr 15, 2024 | | CodeCode Available | 2 |
| Leveraging Temporal Contextualization for Video Action Recognition | Apr 15, 2024 | Action RecognitionTemporal Action Localization | CodeCode Available | 2 |
| σ-GPTs: A New Approach to Autoregressive Models | Apr 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| XoFTR: Cross-modal Feature Matching Transformer | Apr 15, 2024 | Image Augmentation | CodeCode Available | 2 |
| VideoSAGE: Video Summarization with Graph Representation Learning | Apr 14, 2024 | Graph Representation LearningNode Classification | CodeCode Available | 2 |
| HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images | Apr 14, 2024 | Change DetectionDeep Learning | CodeCode Available | 2 |
| Change Guiding Network: Incorporating Change Prior to Guide Change Detection in Remote Sensing Imagery | Apr 14, 2024 | Change DetectionEdge Detection | CodeCode Available | 2 |
| TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning | Apr 14, 2024 | Dense Video CaptioningDescriptive | CodeCode Available | 2 |
| TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models | Apr 14, 2024 | | CodeCode Available | 2 |
| A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion | Apr 14, 2024 | MambaPansharpening | CodeCode Available | 2 |
| Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation | Apr 13, 2024 | Domain AdaptationImage Segmentation | CodeCode Available | 2 |
| AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning | Apr 13, 2024 | Few-Shot Learning | CodeCode Available | 2 |
| MMA-DFER: MultiModal Adaptation of unimodal models for Dynamic Facial Expression Recognition in-the-wild | Apr 13, 2024 | cross-modal alignmentDynamic Facial Expression Recognition | CodeCode Available | 2 |
| Under pressure: learning-based analog gauge reading in the wild | Apr 12, 2024 | | CodeCode Available | 2 |
| LaSagnA: Language-based Segmentation Assistant for Complex Queries | Apr 12, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation | Apr 12, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking | Apr 12, 2024 | Contrastive LearningRetrieval | CodeCode Available | 2 |
| Learning representations of learning representations | Apr 12, 2024 | Sentence | CodeCode Available | 2 |