| Jointly RS Image Deblurring and Super-Resolution with Adjustable-Kernel and Multi-Domain Attention | Dec 7, 2024 | DeblurringImage Deblurring | CodeCode Available | 1 |
| RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts | Dec 7, 2024 | Change DetectionImage Comprehension | CodeCode Available | 1 |
| Training-Free Bayesianization for Low-Rank Adapters of Large Language Models | Dec 7, 2024 | Variational Inference | CodeCode Available | 1 |
| M^3PC: Test-time Model Predictive Control for Pretrained Masked Trajectory Model | Dec 7, 2024 | D4RLmodel | CodeCode Available | 1 |
| Remix-DiT: Mixing Diffusion Transformers for Multi-Expert Denoising | Dec 7, 2024 | Denoising | CodeCode Available | 1 |
| Finite Element Neural Network Interpolation. Part I: Interpretable and Adaptive Discretization for Solving PDEs | Dec 7, 2024 | Transfer Learning | CodeCode Available | 1 |
| CoE: Deep Coupled Embedding for Non-Rigid Point Cloud Correspondences | Dec 7, 2024 | | CodeCode Available | 1 |
| Towards Learning to Reason: Comparing LLMs with Neuro-Symbolic on Arithmetic Relations in Abstract Reasoning | Dec 7, 2024 | Attribute | CodeCode Available | 1 |
| TransitGPT: A Generative AI-based framework for interacting with GTFS data using Large Language Models | Dec 7, 2024 | ChatbotNatural Language Queries | CodeCode Available | 1 |
| PrivAgent: Agentic-based Red-teaming for LLM Privacy Leakage | Dec 7, 2024 | Red TeamingSafety Alignment | CodeCode Available | 1 |
| CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual Worlds | Dec 7, 2024 | Question Answering | CodeCode Available | 1 |
| Fragmented Layer Grouping in GUI Designs Through Graph Learning Based on Multimodal Information | Dec 7, 2024 | Graph LearningGraph Neural Network | CodeCode Available | 1 |
| SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts | Dec 7, 2024 | General KnowledgeMixture-of-Experts | CodeCode Available | 1 |
| Slicing Vision Transformer for Flexible Inference | Dec 6, 2024 | | CodeCode Available | 1 |
| PyTerrier-GenRank: The PyTerrier Plugin for Reranking with Large Language Models | Dec 6, 2024 | Reranking | CodeCode Available | 1 |
| Learning to Translate Noise for Robust Image Denoising | Dec 6, 2024 | DenoisingImage Denoising | CodeCode Available | 1 |
| DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling | Dec 6, 2024 | Dialogue GenerationImitation Learning | CodeCode Available | 1 |
| DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA | Dec 6, 2024 | counterfactualLanguage Model Evaluation | CodeCode Available | 1 |
| Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection | Dec 6, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Two stages domain invariant representation learners solve the large co-variate shift in unsupervised domain adaptation with two dimensional data domains | Dec 6, 2024 | Domain AdaptationRepresentation Learning | CodeCode Available | 1 |
| COOOL: Challenge Of Out-Of-Label A Novel Benchmark for Autonomous Driving | Dec 6, 2024 | Anomaly DetectionAutonomous Driving | CodeCode Available | 1 |
| Extrapolated Urban View Synthesis Benchmark | Dec 6, 2024 | Autonomous VehiclesNovel View Synthesis | CodeCode Available | 1 |
| Machine Learning-Based mmWave MIMO Beam Tracking in V2I Scenarios: Algorithms and Datasets | Dec 6, 2024 | | CodeCode Available | 1 |
| Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens | Dec 6, 2024 | Superpixels | CodeCode Available | 1 |
| Explingo: Explaining AI Predictions using Large Language Models | Dec 6, 2024 | | CodeCode Available | 1 |
| Sparse autoencoders reveal selective remapping of visual concepts during adaptation | Dec 6, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Transformers Can Navigate Mazes With Multi-Step Prediction | Dec 6, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| NLP-ADBench: NLP Anomaly Detection Benchmark | Dec 6, 2024 | Anomaly DetectionFraud Detection | CodeCode Available | 1 |
| Towards Effective GenAI Multi-Agent Collaboration: Design and Evaluation for Enterprise Applications | Dec 6, 2024 | | CodeCode Available | 1 |
| Neural Representation for Wireless Radiation Field Reconstruction: A 3D Gaussian Splatting Approach | Dec 6, 2024 | Computational Efficiency | CodeCode Available | 1 |
| DrIFT: Autonomous Drone Dataset with Integrated Real and Synthetic Data, Flexible Views, and Transformed Domains | Dec 6, 2024 | Domain AdaptationUnsupervised Domain Adaptation | CodeCode Available | 1 |
| Customized Generation Reimagined: Fidelity and Editability Harmonized | Dec 6, 2024 | Denoising | CodeCode Available | 1 |
| TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft | Dec 6, 2024 | Imitation LearningMinecraft | CodeCode Available | 1 |
| LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation | Dec 6, 2024 | Image Generation | CodeCode Available | 1 |
| SleeperMark: Towards Robust Watermark against Fine-Tuning Text-to-image Diffusion Models | Dec 6, 2024 | | CodeCode Available | 1 |
| MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale | Dec 6, 2024 | Multimodal ReasoningVisual Question Answering | CodeCode Available | 1 |
| KNN-MMD: Cross Domain Wireless Sensing via Local Distribution Alignment | Dec 6, 2024 | Action ClassificationAction Classification (1-shot) | CodeCode Available | 1 |
| One-shot Federated Learning via Synthetic Distiller-Distillate Communication | Dec 6, 2024 | Data-free Knowledge DistillationFederated Learning | CodeCode Available | 1 |
| SoPo: Text-to-Motion Generation Using Semi-Online Preference Optimization | Dec 6, 2024 | Motion Generation | CodeCode Available | 1 |
| Smoothie: Label Free Language Model Routing | Dec 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot | Dec 6, 2024 | Decision MakingRAG | CodeCode Available | 1 |
| Training MLPs on Graphs without Supervision | Dec 5, 2024 | Fraud DetectionGraph Classification | CodeCode Available | 1 |
| PDG2Seq: Periodic Dynamic Graph to Sequence Model for Traffic Flow Prediction | Dec 5, 2024 | feature selectionGraph-to-Sequence | CodeCode Available | 1 |
| Does your model understand genes? A benchmark of gene properties for biological and text models | Dec 5, 2024 | BenchmarkingMulti-class Classification | CodeCode Available | 1 |
| ProtBoost: protein function prediction with Py-Boost and Graph Neural Networks -- CAFA5 top2 solution | Dec 5, 2024 | Protein Function Prediction | CodeCode Available | 1 |
| GRAM: Generalization in Deep RL with a Robust Adaptation Module | Dec 5, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Grounding Descriptions in Images informs Zero-Shot Visual Recognition | Dec 5, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay | Dec 5, 2024 | DecoderGPU | CodeCode Available | 1 |
| 3D Part Segmentation via Geometric Aggregation of 2D Visual Features | Dec 5, 2024 | 3D geometry3D Part Segmentation | CodeCode Available | 1 |
| HyperMARL: Adaptive Hypernetworks for Multi-Agent RL | Dec 5, 2024 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 1 |