| Beyond Accuracy: Metrics that Uncover What Makes a 'Good' Visual Descriptor | Jul 4, 2025 | Descriptiveimage-classification | CodeCode Available | 0 |
| Molecular Machine Learning Using Euler Characteristic Transforms | Jul 4, 2025 | molecular representationRepresentation Learning | CodeCode Available | 0 |
| Rectifying Adversarial Sample with Low Entropy Prior for Test-Time Defense | Jul 4, 2025 | Adversarial Robustness | —Unverified | 0 |
| Effects of structure on reasoning in instance-level Self-Discover | Jul 4, 2025 | Math | CodeCode Available | 0 |
| Be the Change You Want to See: Revisiting Remote Sensing Change Detection Practices | Jul 4, 2025 | Change Detection | CodeCode Available | 1 |
| Leveraging Out-of-Distribution Unlabeled Images: Semi-Supervised Semantic Segmentation with an Open-Vocabulary Model | Jul 4, 2025 | Pseudo LabelSegmentation | CodeCode Available | 0 |
| Query-Based Adaptive Aggregation for Multi-Dataset Joint Training Toward Universal Visual Place Recognition | Jul 4, 2025 | Visual Place Recognition | —Unverified | 0 |
| Adaptive Gate-Aware Mamba Networks for Magnetic Resonance Fingerprinting | Jul 4, 2025 | Magnetic Resonance FingerprintingMamba | —Unverified | 0 |
| Flow-Anchored Consistency Models | Jul 4, 2025 | Image Generation | CodeCode Available | 2 |
| Hybrid-View Attention for csPCa Classification in TRUS | Jul 4, 2025 | ClassificationDiagnostic | CodeCode Available | 0 |
| Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky | Jul 4, 2025 | Response Generation | —Unverified | 0 |
| STRUCTSENSE: A Task-Agnostic Agentic Framework for Structured Information Extraction with Human-In-The-Loop Evaluation and Benchmarking | Jul 4, 2025 | BenchmarkingNavigate | CodeCode Available | 0 |
| CORE-ReID V2: Advancing the Domain Adaptation for Object Re-Identification with Optimized Training and Ensemble Fusion | Jul 4, 2025 | Domain AdaptationPerson Re-Identification | CodeCode Available | 0 |
| Cross-domain Hyperspectral Image Classification based on Bi-directional Domain Adaptation | Jul 3, 2025 | | CodeCode Available | 0 |
| MvHo-IB: Multi-View Higher-Order Information Bottleneck for Brain Disorder Diagnosis | Jul 3, 2025 | | CodeCode Available | 0 |
| A Fuzzy Supervisor Agent Design for Clinical Reasoning Assistance in a Multi-Agent Educational Clinical Scenario Simulation | Jul 3, 2025 | | CodeCode Available | 0 |
| JoyTTS: LLM-based Spoken Chatbot With Voice Cloning | Jul 3, 2025 | | CodeCode Available | 0 |
| Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback | Jul 3, 2025 | | —Unverified | 0 |
| GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling | Jul 3, 2025 | | CodeCode Available | 0 |
| LocalDyGS: Multi-view Global Dynamic Scene Modeling via Adaptive Local Implicit Feature Decoupling | Jul 3, 2025 | | —Unverified | 0 |
| Wildlife Target Re-Identification Using Self-supervised Learning in Non-Urban Settings | Jul 3, 2025 | | CodeCode Available | 0 |
| Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving | Jul 3, 2025 | | —Unverified | 0 |
| Can LLMs Identify Critical Limitations within Scientific Research? A Systematic Evaluation on AI Research Papers | Jul 3, 2025 | | —Unverified | 0 |
| Prompt learning with bounding box constraints for medical image segmentation | Jul 3, 2025 | | CodeCode Available | 0 |
| LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion | Jul 3, 2025 | | —Unverified | 0 |
| Answer Matching Outperforms Multiple Choice for Language Model Evaluation | Jul 3, 2025 | | —Unverified | 0 |
| Explainable AI for Comprehensive Risk Assessment for Financial Reports: A Lightweight Hierarchical Transformer Network Approach | Jul 3, 2025 | | CodeCode Available | 0 |
| IMASHRIMP: Automatic White Shrimp (Penaeus vannamei) Biometrical Analysis from Laboratory Images Using Computer Vision and Deep Learning | Jul 3, 2025 | | CodeCode Available | 0 |
| CAD-Editor: A Locate-then-Infill Framework with Automated Training Data Synthesis for Text-Based CAD Editing | Jul 3, 2025 | | CodeCode Available | 0 |
| RGC-VQA: An Exploration Database for Robotic-Generated Video Quality Assessment | Jul 3, 2025 | | CodeCode Available | 0 |
| MTCNet: Motion and Topology Consistency Guided Learning for Mitral Valve Segmentationin 4D Ultrasound | Jul 3, 2025 | | CodeCode Available | 0 |
| Listwise Preference Alignment Optimization for Tail Item Recommendation | Jul 3, 2025 | | CodeCode Available | 0 |
| DoMIX: An Efficient Framework for Exploiting Domain Knowledge in Fine-Tuning | Jul 3, 2025 | | CodeCode Available | 0 |
| CrowdTrack: A Benchmark for Difficult Multiple Pedestrian Tracking in Real Scenarios | Jul 3, 2025 | | CodeCode Available | 0 |
| MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs | Jul 3, 2025 | | CodeCode Available | 0 |
| Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching | Jul 3, 2025 | | CodeCode Available | 0 |
| From Sentences to Sequences: Rethinking Languages in Biological System | Jul 3, 2025 | | CodeCode Available | 0 |
| Temporally-Aware Supervised Contrastive Learning for Polyp Counting in Colonoscopy | Jul 3, 2025 | | CodeCode Available | 0 |
| MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning | Jul 3, 2025 | | CodeCode Available | 0 |
| PriOr-Flow: Enhancing Primitive Panoramic Optical Flow with Orthogonal View | Jul 3, 2025 | | CodeCode Available | 0 |
| F^2TTA: Free-Form Test-Time Adaptation on Cross-Domain Medical Image Classification via Image-Level Disentangled Prompt Tuning | Jul 3, 2025 | | CodeCode Available | 0 |
| DistZO2: High-Throughput and Memory-Efficient Zeroth-Order Fine-tuning LLMs with Distributed Parallel Computing | Jul 3, 2025 | | CodeCode Available | 0 |
| Cautious Next Token Prediction | Jul 3, 2025 | Prediction | CodeCode Available | 1 |
| From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images | Jul 3, 2025 | Depth EstimationSemantic Segmentation | —Unverified | 0 |
| AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench | Jul 3, 2025 | Navigate | CodeCode Available | 2 |
| Linear Attention with Global Context: A Multipole Attention Mechanism for Vision and Physics | Jul 3, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents | Jul 3, 2025 | Emotional Intelligencereinforcement-learning | CodeCode Available | 3 |
| MathOptAI.jl: Embed trained machine learning predictors into JuMP models | Jul 3, 2025 | CPUGaussian Processes | CodeCode Available | 2 |
| ViRefSAM: Visual Reference-Guided Segment Anything Model for Remote Sensing Segmentation | Jul 3, 2025 | Few-Shot LearningSegmentation | —Unverified | 0 |
| Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization | Jul 3, 2025 | DescriptiveDisentanglement | —Unverified | 0 |