| CN-SBM: Categorical Block Modelling For Primary and Residual Copy Number Variation | Jun 28, 2025 | | —Unverified | 0 |
| LightBSR: Towards Lightweight Blind Super-Resolution via Discriminative Implicit Degradation Representation Learning | Jun 28, 2025 | | CodeCode Available | 0 |
| A Systematic Study of Compositional Syntactic Transformer Language Models | Jun 28, 2025 | | CodeCode Available | 0 |
| Residual Matrix Transformers: Scaling the Size of the Residual Stream | Jun 28, 2025 | | CodeCode Available | 0 |
| Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Gate | Jun 28, 2025 | | CodeCode Available | 0 |
| Prompting without Panic: Attribute-aware, Zero-shot, Test-Time Calibration | Jun 28, 2025 | | CodeCode Available | 0 |
| Missing-Modality-Aware Graph Neural Network for Cancer Classification | Jun 28, 2025 | | CodeCode Available | 0 |
| Probabilistic Prototype Calibration of Vision-Language Models for Generalized Few-shot Semantic Segmentation | Jun 28, 2025 | | CodeCode Available | 0 |
| Enabling Precise Topic Alignment in Large Language Models Via Sparse Autoencoders | Jun 28, 2025 | | CodeCode Available | 0 |
| OpenPath: Open-Set Active Learning for Pathology Image Classification via Pre-trained Vision-Language Models | Jun 28, 2025 | | CodeCode Available | 0 |
| MedEthicsQA: A Comprehensive Question Answering Benchmark for Medical Ethics Evaluation of LLMs | Jun 28, 2025 | | CodeCode Available | 0 |
| Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models | Jun 28, 2025 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 |
| Listener-Rewarded Thinking in VLMs for Image Preferences | Jun 28, 2025 | MemorizationReinforcement Learning (RL) | —Unverified | 0 |
| Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval | Jun 28, 2025 | Cross-Modal RetrievalImage Captioning | —Unverified | 0 |
| VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding | Jun 28, 2025 | 3DGSInstance Segmentation | —Unverified | 0 |
| SemFaceEdit: Semantic Face Editing on Generative Radiance Manifolds | Jun 28, 2025 | Disentanglement | —Unverified | 0 |
| Deterministic Object Pose Confidence Region Estimation | Jun 28, 2025 | Conformal PredictionObject | —Unverified | 0 |
| Point Cloud Compression and Objective Quality Assessment: A Survey | Jun 28, 2025 | Autonomous DrivingBenchmarking | —Unverified | 0 |
| FOCUS: Fine-grained Optimization with Semantic Guided Understanding for Pedestrian Attributes Recognition | Jun 28, 2025 | AttributeContrastive Learning | —Unverified | 0 |
| Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder | Jun 28, 2025 | Image SegmentationLarge Language Model | CodeCode Available | 1 |
| Revisiting CroPA: A Reproducibility Study and Enhancements for Cross-Prompt Adversarial Transferability in Vision-Language Models | Jun 28, 2025 | image-classificationImage Classification | CodeCode Available | 0 |
| Prompt Mechanisms in Medical Imaging: A Comprehensive Survey | Jun 28, 2025 | Feature EngineeringImage Generation | —Unverified | 0 |
| Attention to Burstiness: Low-Rank Bilinear Prompt Tuning | Jun 28, 2025 | Visual Prompt Tuning | CodeCode Available | 0 |
| Knowledge Augmented Finetuning Matters in both RAG and Agent Based Dialog Systems | Jun 28, 2025 | RAGResponse Generation | —Unverified | 0 |
| Sensing Security Oriented OFDM-ISAC Against Multi-Intercept Threats | Jun 28, 2025 | Integrated sensing and communicationISAC | —Unverified | 0 |
| ActAlign: Zero-Shot Fine-Grained Video Classification via Language-Guided Sequence Alignment | Jun 28, 2025 | Dynamic Time WarpingLarge Language Model | CodeCode Available | 0 |
| Agent-to-Agent Theory of Mind: Testing Interlocutor Awareness among Large Language Models | Jun 28, 2025 | | CodeCode Available | 0 |
| Few-Shot Segmentation of Historical Maps via Linear Probing of Vision Foundation Models | Jun 27, 2025 | | CodeCode Available | 0 |
| FedCLAM: Client Adaptive Momentum with Foreground Intensity Matching for Federated Medical Image Segmentation | Jun 27, 2025 | | CodeCode Available | 0 |
| BrainMT: A Hybrid Mamba-Transformer Architecture for Modeling Long-Range Dependencies in Functional MRI Data | Jun 27, 2025 | | CodeCode Available | 0 |
| Smooth-Distill: A Self-distillation Framework for Multitask Learning with Wearable Sensor Data | Jun 27, 2025 | | CodeCode Available | 0 |
| MiCo: Multi-image Contrast for Reinforcement Visual Reasoning | Jun 27, 2025 | Logical ReasoningRepresentation Learning | —Unverified | 0 |
| Visual Structures Helps Visual Reasoning: Addressing the Binding Problem in VLMs | Jun 27, 2025 | Visual Reasoning | —Unverified | 0 |
| DAPFAM: A Domain-Aware Patent Retrieval Dataset Aggregated at the Family Level | Jun 27, 2025 | Patent classificationRetrieval | —Unverified | 0 |
| A Large Language Model-Empowered Agent for Reliable and Robust Structural Analysis | Jun 27, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| CaO_2: Rectifying Inconsistencies in Diffusion-Based Dataset Distillation | Jun 27, 2025 | Dataset Distillation | CodeCode Available | 1 |
| 3D Shape Generation: A Survey | Jun 27, 2025 | 3D Shape GenerationDiversity | —Unverified | 0 |
| Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs | Jun 27, 2025 | MMEVideo MME | —Unverified | 0 |
| A Survey of Continual Reinforcement Learning | Jun 27, 2025 | Continual LearningDecision Making | —Unverified | 0 |
| Advancements and Challenges in Continual Reinforcement Learning: A Comprehensive Review | Jun 27, 2025 | Continual LearningDiversity | —Unverified | 0 |
| TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models | Jun 27, 2025 | DecoderSegmentation | —Unverified | 0 |
| MatChA: Cross-Algorithm Matching with Feature Augmentation | Jun 27, 2025 | Visual Localization | —Unverified | 0 |
| Task-Agnostic Contrastive Pretraining for Relational Deep Learning | Jun 27, 2025 | Deep LearningGraph Neural Network | —Unverified | 0 |
| EAMamba: Efficient All-Around Vision State Space Model for Image Restoration | Jun 27, 2025 | AllDeblurring | CodeCode Available | 2 |
| Integrating Multi-Modal Sensors: A Review of Fusion Techniques for Intelligent Vehicles | Jun 27, 2025 | Autonomous DrivingSensor Fusion | —Unverified | 0 |
| The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements | Jun 27, 2025 | | CodeCode Available | 2 |
| SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding | Jun 27, 2025 | 3D visual groundingNatural Language Queries | —Unverified | 0 |
| ProSAM: Enhancing the Robustness of SAM-based Visual Reference Segmentation with Probabilistic Prompts | Jun 27, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning | Jun 27, 2025 | Object TrackingTemplate Matching | CodeCode Available | 2 |
| Risk-Averse Best Arm Set Identification with Fixed Budget and Fixed Confidence | Jun 27, 2025 | Decision Making | —Unverified | 0 |