| Fundamental Limits of Game-Theoretic LLM Alignment: Smith Consistency and Preference Matching | May 27, 2025 | Diversity | —Unverified | 0 |
| Learning Annotation Consensus for Continuous Emotion Recognition | May 27, 2025 | Emotion Recognition | —Unverified | 0 |
| Leveraging GANs for citation intent classification and its impact on citation network analysis | May 27, 2025 | Citation Intent Classificationintent-classification | CodeCode Available | 0 |
| MedSentry: Understanding and Mitigating Safety Risks in Medical LLM Multi-Agent Systems | May 27, 2025 | | CodeCode Available | 1 |
| NeuralOM: Neural Ocean Model for Subseasonal-to-Seasonal Simulation | May 27, 2025 | Computational EfficiencyGraph Neural Network | CodeCode Available | 3 |
| R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning | May 27, 2025 | Code GenerationReinforcement Learning (RL) | CodeCode Available | 1 |
| Breaking the Ceiling: Exploring the Potential of Jailbreak Attacks through Expanding Strategy Space | May 27, 2025 | Prompt Engineering | CodeCode Available | 1 |
| R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing | May 27, 2025 | Math | CodeCode Available | 2 |
| SELF-PERCEPT: Introspection Improves Large Language Models' Detection of Multi-Person Mental Manipulation in Conversations | May 27, 2025 | | CodeCode Available | 0 |
| Music's Multimodal Complexity in AVQA: Why We Need More than General Multimodal LLMs | May 27, 2025 | Audio-visual Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Mitigating Hallucination in Large Vision-Language Models via Adaptive Attention Calibration | May 27, 2025 | HallucinationVisual Grounding | —Unverified | 0 |
| FeatInv: Spatially resolved mapping from feature space to input space using conditional diffusion models | May 27, 2025 | | CodeCode Available | 0 |
| MLMC-based Resource Adequacy Assessment with Active Learning Trained Surrogate Models | May 27, 2025 | Active Learning | CodeCode Available | 0 |
| MoPFormer: Motion-Primitive Transformer for Wearable-Sensor Activity Recognition | May 27, 2025 | Activity RecognitionHuman Activity Recognition | —Unverified | 0 |
| Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech | May 27, 2025 | Style Transfertext-to-speech | —Unverified | 0 |
| Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework | May 27, 2025 | DiagnosticKnowledge Graphs | —Unverified | 0 |
| HTMNet: A Hybrid Network with Transformer-Mamba Bottleneck Multimodal Fusion for Transparent and Reflective Objects Depth Completion | May 27, 2025 | DecoderDepth Completion | —Unverified | 0 |
| Fully Spiking Neural Networks for Unified Frame-Event Object Tracking | May 27, 2025 | Object TrackingVisual Object Tracking | —Unverified | 0 |
| Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion | May 27, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Sci-Fi: Symmetric Constraint for Frame Inbetweening | May 27, 2025 | | —Unverified | 0 |
| Supervised and self-supervised land-cover segmentation & classification of the Biesbosch wetlands | May 27, 2025 | Land Cover ClassificationSelf-Supervised Learning | —Unverified | 0 |
| Jigsaw-Puzzles: From Seeing to Understanding to Reasoning in Vision-Language Models | May 27, 2025 | DiagnosticSpatial Reasoning | —Unverified | 0 |
| Dual-Polarization Stacked Intelligent Metasurfaces for Holographic MIMO | May 27, 2025 | | CodeCode Available | 1 |
| Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration | May 27, 2025 | Multi-hop Question AnsweringQuestion Answering | CodeCode Available | 1 |
| RoBiS: Robust Binary Segmentation for High-Resolution Industrial Images | May 27, 2025 | Anomaly DetectionBinarization | CodeCode Available | 1 |
| OVERT: A Benchmark for Over-Refusal Evaluation on Text-to-Image Models | May 27, 2025 | Safety Alignment | CodeCode Available | 0 |
| A domain adaptation neural network for digital twin-supported fault diagnosis | May 27, 2025 | DiagnosticDomain Adaptation | CodeCode Available | 0 |
| VibE-SVC: Vibrato Extraction with High-frequency F0 Contour for Singing Voice Conversion | May 27, 2025 | Voice Conversion | —Unverified | 0 |
| Stereo Radargrammetry Using Deep Learning from Airborne SAR Images | May 27, 2025 | Deep Learning | —Unverified | 0 |
| Counterfactual Multi-player Bandits for Explainable Recommendation Diversification | May 27, 2025 | counterfactualDiversity | CodeCode Available | 0 |
| Stationary MMD Points for Cubature | May 27, 2025 | Data Compression | CodeCode Available | 0 |
| DeSocial: Blockchain-based Decentralized Social Networks | May 27, 2025 | Model SelectionPrediction | CodeCode Available | 1 |
| Automated Privacy Information Annotation in Large Language Model Interactions | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Stochastic Geometry-Based Performance Evaluation for LEO Satellite-Assisted Space Caching | May 27, 2025 | Edge-computing | —Unverified | 0 |
| RefAV: Towards Planning-Centric Scenario Mining | May 27, 2025 | Autonomous VehiclesMotion Planning | CodeCode Available | 1 |
| The UD-NewsCrawl Treebank: Reflections and Challenges from a Large-scale Tagalog Syntactic Annotation Project | May 26, 2025 | | CodeCode Available | 2 |
| LLM Web Dynamics: Tracing Model Collapse in a Network of LLMs | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Probabilistic Spatial Interpolation of Sparse Data using Diffusion Models | May 26, 2025 | ImputationSpatial Interpolation | —Unverified | 0 |
| An Open-Source Python Framework and Synthetic ECG Image Datasets for Digitization, Lead and Lead Name Detection, and Overlapping Signal Segmentation | May 26, 2025 | ECG DigitizationSegmentation | CodeCode Available | 0 |
| A Novel Shape-Aware Topological Representation for GPR Data with DNN Integration | May 26, 2025 | GPRobject-detection | —Unverified | 0 |
| Project Riley: Multimodal Multi-Agent LLM Collaboration with Emotional Reasoning and Voting | May 26, 2025 | ChatbotComputational Efficiency | —Unverified | 0 |
| VSCBench: Bridging the Gap in Vision-Language Model Safety Calibration | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Enhancing Contrastive Learning-based Electrocardiogram Pretrained Model with Patient Memory Queue | May 26, 2025 | Contrastive LearningData Augmentation | CodeCode Available | 0 |
| HAMburger: Accelerating LLM Inference via Token Smashing | May 26, 2025 | Large Language Model | —Unverified | 0 |
| GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation | May 26, 2025 | Question AnsweringSynthetic Data Generation | CodeCode Available | 4 |
| In-context Language Learning for Endangered Languages in Speech Recognition | May 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Electrolyzers-HSI: Close-Range Multi-Scene Hyperspectral Imaging Benchmark Dataset | May 26, 2025 | Material Classification | CodeCode Available | 0 |
| MAS-Zero: Designing Multi-Agent Systems with Zero Supervision | May 26, 2025 | MathProblem Decomposition | CodeCode Available | 2 |
| R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning | May 26, 2025 | HallucinationRAG | CodeCode Available | 1 |
| Detection of Suicidal Risk on Social Media: A Hybrid Model | May 26, 2025 | Data AugmentationMulti-class Classification | —Unverified | 0 |