SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1295113000 of 474278 papers

TitleStatusHype
Few-Shot Learning by Explicit Physics Integration: An Application to Groundwater Heat TransportCode0
CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation0
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement LearningCode2
AI-Based Demand Forecasting and Load Balancing for Optimising Energy use in Healthcare Systems: A real case study0
Growing Transformers: Modular Composition and Layer-wise Expansion on a Frozen SubstrateCode0
A Survey on Latent ReasoningCode3
CriticLean: Critic-Guided Reinforcement Learning for Mathematical FormalizationCode1
LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures0
Tile-Based ViT Inference with Visual-Cluster Priors for Zero-Shot Multi-Species Plant IdentificationCode0
USIGAN: Unbalanced Self-Information Feature Transport for Weakly Paired Image IHC Virtual StainingCode0
What ZTF Saw Where Rubin Looked: Anomaly Hunting in DR230
PaddleOCR 3.0 Technical Report0
Hierarchical Task Offloading for UAV-Assisted Vehicular Edge Computing via Deep Reinforcement Learning0
MP-ALOE: An r2SCAN dataset for universal machine learning interatomic potentials0
TalkFashion: Intelligent Virtual Try-On Assistant Based on Multimodal Large Language Model0
Fast and Accurate Collision Probability Estimation for Autonomous Vehicles using Adaptive Sigma-Point Sampling0
Remember Past, Anticipate Future: Learning Continual Multimodal Misinformation DetectorsCode0
RIS-Enabled Transmitter Design for Joint Radar and Communication0
Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models0
Diffusion Dataset Condensation: Training Your Diffusion Model Faster with Less Data0
Critical Nodes Identification in Complex Networks: A Survey0
Communication-Efficient Module-Wise Federated Learning for Grasp Pose Detection in Cluttered Environments0
Generative Head-Mounted Camera Captures for Photorealistic Avatars0
Empowering Bridge Digital Twins by Bridging the Data Gap with a Unified Synthesis Framework0
A Directed Lazy Random Walk Model to Three-Way Dynamic Matching Problem0
LeAD: The LLM Enhanced Planning System Converged with End-to-end Autonomous Driving0
AdaptaGen: Domain-Specific Image Generation through Hierarchical Semantic Optimization Framework0
Multi-Modal Face Anti-Spoofing via Cross-Modal Feature Transitions0
Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval0
High-Fidelity and Generalizable Neural Surface Reconstruction with Sparse Feature Volumes0
TigAug: Data Augmentation for Testing Traffic Light Detection in Autonomous Driving Systems0
FEVO: Financial Knowledge Expansion and Reasoning Evolution for Large Language Models0
GSVR: 2D Gaussian-based Video Representation for 800+ FPS with Hybrid Deformation Field0
QS4D: Quantization-aware training for efficient hardware deployment of structured state-space sequential models0
TuneShield: Mitigating Toxicity in Conversational AI while Fine-tuning on Untrusted Data0
Model-free Optical Processors using In Situ Reinforcement Learning with Proximal Policy Optimization0
DESIGN: Encrypted GNN Inference via Server-Side Input Graph Pruning0
DS@GT at CheckThat! 2025: Ensemble Methods for Detection of Scientific Discourse on Social MediaCode0
PSAT: Pediatric Segmentation Approaches via Adult Augmentations and Transfer LearningCode0
SenseShift6D: Multimodal RGB-D Benchmarking for Robust 6D Pose Estimation across Environment and Sensor VariationsCode0
Deep Learning Optimization of Two-State Pinching Antennas Systems0
SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning0
DS@GT at CheckThat! 2025: Detecting Subjectivity via Transfer-Learning and Corrective Data AugmentationCode0
Differentiable Reward Optimization for LLM based TTS systemCode2
UQLM: A Python Package for Uncertainty Quantification in Large Language ModelsCode5
Prototype-Guided and Lightweight Adapters for Inherent Interpretation and Generalisation in Federated LearningCode0
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMsCode2
NeoBabel: A Multilingual Open Tower for Visual GenerationCode1
DreamArt: Generating Interactable Articulated Objects from a Single Image0
ScoreAdv: Score-based Targeted Generation of Natural Adversarial Examples via Diffusion ModelsCode1
Show:102550
← PrevPage 260 of 9486Next →