| MedGemma Technical Report | Jul 12, 2025 | | —Unverified | 0 |
| When Small Guides Large: Cross-Model Co-Learning for Test-Time Adaptation | Jul 12, 2025 | | CodeCode Available | 0 |
| DeltaSHAP: Explaining Prediction Evolutions in Online Patient Monitoring with Shapley Values | Jul 12, 2025 | | CodeCode Available | 0 |
| Harnessing Text-to-Image Diffusion Models for Point Cloud Self-Supervised Learning | Jul 12, 2025 | | CodeCode Available | 0 |
| CoVAE: Consistency Training of Variational Autoencoders | Jul 12, 2025 | | CodeCode Available | 0 |
| Mind the Gap: Preserving and Compensating for the Modality Gap in CLIP-Based Continual Learning | Jul 12, 2025 | | CodeCode Available | 0 |
| RAMA: Retrieval-Augmented Multi-Agent Framework for Misinformation Detection in Multimodal Fact-Checking | Jul 12, 2025 | | CodeCode Available | 0 |
| Optimizing Basis Function Selection in Constructive Wavelet Neural Networks and Its Applications | Jul 12, 2025 | | CodeCode Available | 0 |
| Cross Knowledge Distillation between Artificial and Spiking Neural Networks | Jul 12, 2025 | | CodeCode Available | 0 |
| Geometric Generative Modeling with Noise-Conditioned Graph Networks | Jul 12, 2025 | | CodeCode Available | 0 |
| ESPFormer: Doubly-Stochastic Attention with Expected Sliced Transport Plans | Jul 12, 2025 | | CodeCode Available | 0 |
| DLBAcalib: Robust Extrinsic Calibration for Non-Overlapping LiDARs Based on Dual LBA | Jul 12, 2025 | | CodeCode Available | 0 |
| Ambiguity-Aware and High-Order Relation Learning for Multi-Grained Image-Text Matching | Jul 12, 2025 | | CodeCode Available | 0 |
| AlphaVAE: Unified End-to-End RGBA Image Reconstruction and Generation with Alpha-Aware Representation Learning | Jul 12, 2025 | | CodeCode Available | 0 |
| DS@GT at Touché: Large Language Models for Retrieval-Augmented Debate | Jul 12, 2025 | | CodeCode Available | 0 |
| DTECT: Dynamic Topic Explorer & Context Tracker | Jul 12, 2025 | | CodeCode Available | 0 |
| ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow Matching | Jul 12, 2025 | Dialogue Generationtext-to-speech | CodeCode Available | 4 |
| CompassJudger-2: Towards Generalist Judge Model via Verifiable Rewards | Jul 12, 2025 | | CodeCode Available | 2 |
| Meta-autoencoders: An approach to discovery and representation of relationships between dynamically evolving classes | Jul 12, 2025 | Decoder | —Unverified | 0 |
| Calibrated and Robust Foundation Models for Vision-Language and Medical Image Tasks Under Distribution Shift | Jul 12, 2025 | BenchmarkingTransfer Learning | —Unverified | 0 |
| Adversarial Activation Patching: A Framework for Detecting and Mitigating Emergent Deception in Safety-Aligned Transformers | Jul 12, 2025 | Anomaly Detection | —Unverified | 0 |
| LLM-Stackelberg Games: Conjectural Reasoning Equilibria and Their Applications to Spearphishing | Jul 12, 2025 | Decision MakingMisinformation | —Unverified | 0 |
| SnapMoGen: Human Motion Generation from Expressive Texts | Jul 12, 2025 | Motion Generation | —Unverified | 0 |
| Continual Reinforcement Learning by Planning with Online World Models | Jul 12, 2025 | Continual LearningModel Predictive Control | —Unverified | 0 |
| RoHOI: Robustness Benchmark for Human-Object Interaction Detection | Jul 12, 2025 | Human-Object Interaction DetectionObject | CodeCode Available | 0 |
| ViT-ProtoNet for Few-Shot Image Classification: A Multi-Benchmark Evaluation | Jul 12, 2025 | Few-Shot Image Classificationimage-classification | CodeCode Available | 0 |
| I^2-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting | Jul 12, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 |
| PoseLLM: Enhancing Language-Guided Human Pose Estimation with MLP Alignment | Jul 12, 2025 | Large Language ModelPose Estimation | CodeCode Available | 0 |
| Deep Reinforcement Learning with Gradient Eligibility Traces | Jul 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Generative Latent Kernel Modeling for Blind Motion Deblurring | Jul 12, 2025 | DeblurringGenerative Adversarial Network | CodeCode Available | 0 |
| Robust Spatiotemporal Epidemic Modeling with Integrated Adaptive Outlier Detection | Jul 12, 2025 | Outlier Detectionparameter estimation | CodeCode Available | 0 |
| PanoDiff-SR: Synthesizing Dental Panoramic Radiographs using Diffusion and Super-resolution | Jul 12, 2025 | Super-Resolution | CodeCode Available | 0 |
| BayesTTA: Continual-Temporal Test-Time Adaptation for Vision-Language Models via Gaussian Discriminant Analysis | Jul 11, 2025 | | CodeCode Available | 0 |
| Visual Semantic Description Generation with MLLMs for Image-Text Matching | Jul 11, 2025 | | CodeCode Available | 0 |
| PRISM: Reducing Spurious Implicit Biases in Vision-Language Models with LLM-Guided Embedding Projection | Jul 11, 2025 | | CodeCode Available | 0 |
| Spectral Manifold Harmonization for Graph Imbalanced Regression | Jul 11, 2025 | | CodeCode Available | 0 |
| Multimodal Cardiovascular Risk Profiling Using Self-Supervised Learning of Polysomnography | Jul 11, 2025 | | CodeCode Available | 0 |
| Leanabell-Prover-V2: Verifier-integrated Reasoning for Formal Theorem Proving via Reinforcement Learning | Jul 11, 2025 | | CodeCode Available | 0 |
| Fair-FLIP: Fair Deepfake Detection with Fairness-Oriented Final Layer Input Prioritising | Jul 11, 2025 | | CodeCode Available | 0 |
| Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models | Jul 11, 2025 | | —Unverified | 0 |
| Single-Step Latent Diffusion for Underwater Image Restoration | Jul 11, 2025 | | —Unverified | 0 |
| Cycle Context Verification for In-Context Medical Image Segmentation | Jul 11, 2025 | | CodeCode Available | 0 |
| One-Pass to Reason: Token Duplication and Block-Sparse Mask for Efficient Fine-Tuning on Multi-Turn Reasoning | Jul 11, 2025 | | CodeCode Available | 0 |
| Predicting Air Pollution in Cork, Ireland Using Machine Learning | Jul 11, 2025 | | CodeCode Available | 0 |
| Transfer Learning and Mixup for Fine-Grained Few-Shot Fungi Classification | Jul 11, 2025 | | CodeCode Available | 0 |
| LLaPa: A Vision-Language Model Framework for Counterfactual-Aware Procedural Planning | Jul 11, 2025 | | CodeCode Available | 0 |
| OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique | Jul 11, 2025 | Code Generation | —Unverified | 0 |
| Multilingual Multimodal Software Developer for Code Generation | Jul 11, 2025 | Code GenerationInstruction Following | —Unverified | 0 |
| Droid: A Resource Suite for AI-Generated Code Detection | Jul 11, 2025 | Metric Learning | —Unverified | 0 |
| Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins | Jul 11, 2025 | PredictionProtein Structure Prediction | CodeCode Available | 1 |