SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1840118450 of 474278 papers

TitleStatusHype
Enhancing Convergence, Privacy and Fairness for Wireless Personalized Federated Learning: Quantization-Assisted Min-Max Fair Scheduling0
Reconciling Hessian-Informed Acceleration and Scalar-Only Communication for Efficient Federated Zeroth-Order Fine-Tuning0
Probabilistic Online Event Downsampling0
Multi-Spectral Gaussian Splatting with Neural Color Representation0
TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models0
FlexPainter: Flexible and Multi-View Consistent Texture Generation0
A Machine Learning Theory Perspective on Strategic Litigation0
KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud ProviderCode2
GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region RemovalCode1
EyeNavGS: A 6-DoF Navigation Dataset and Record-n-Replay Software for Real-World 3DGS Scenes in VRCode0
SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction ScenariosCode1
Comparative Analysis of AI Agent Architectures for Entity Relationship ClassificationCode0
CyberGym: Evaluating AI Agents' Cybersecurity Capabilities with Real-World Vulnerabilities at ScaleCode2
Causal Explainability of Machine Learning in Heart Failure Prediction from Electronic Health Records0
Generative AI for Predicting 2D and 3D Wildfire Spread: Beyond Physics-Based Models and Traditional Deep Learning0
PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples0
From Theory to Practice with RAVEN-UCB: Addressing Non-Stationarity in Multi-Armed Bandits through Variance AdaptationCode0
Occlusion-Aware Ground Target Tracking by a Dubins Vehicle using Visibility VolumesCode0
Comparison of different Unique hard attention transformer models by the formal languages they can recognize0
VolTex: Food Volume Estimation using Text-Guided Segmentation and Neural Surface ReconstructionCode0
Designing Algorithmic Delegates: The Role of Indistinguishability in Human-AI Handoff0
How Explanations Leak the Decision Logic: Stealing Graph Neural Networks via Explanation AlignmentCode0
BitBypass: A New Direction in Jailbreaking Aligned Large Language Models with Bitstream CamouflageCode0
Labelling Data with Unknown ReferencesCode0
HumanRAM: Feed-forward Human Reconstruction and Animation Model using Transformers0
Enhancing Automatic PT Tagging for MEDLINE Citations Using Transformer-Based Models0
Overcoming Challenges of Partial Client Participation in Federated Learning : A Comprehensive Review0
A Review of Various Datasets for Machine Learning Algorithm-Based Intrusion Detection System: Advances and Challenges0
MISLEADER: Defending against Model Extraction with Ensembles of Distilled ModelsCode0
A Multimodal, Multilingual, and Multidimensional Pipeline for Fine-grained Crowdsourcing Earthquake Damage EvaluationCode0
PhysGaia: A Physics-Aware Dataset of Multi-Body Interactions for Dynamic Novel View SynthesisCode1
Impact of Rankings and Personalized Recommendations in Marketplaces0
Dense Match Summarization for Faster Two-view EstimationCode1
Cell-o1: Training LLMs to Solve Single-Cell Reasoning Puzzles with Reinforcement LearningCode1
VPI-Bench: Visual Prompt Injection Attacks for Computer-Use AgentsCode1
ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding0
Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains0
EvolveNav: Self-Improving Embodied Reasoning for LLM-Based Vision-Language NavigationCode0
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning0
Small Language Models are the Future of Agentic AI0
LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback0
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents0
Enhancing Interpretable Image Classification Through LLM Agents and Conditional Concept Bottleneck Models0
CVC: A Large-Scale Chinese Value Rule Corpus for Value Alignment of Large Language ModelsCode0
PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization0
Can We Trust Machine Learning? The Reliability of Features from Open-Source Speech Analysis Tools for Speech Modeling0
MODS: Multi-source Observations Conditional Diffusion Model for Meteorological State Downscaling0
Embedded Acoustic Intelligence for Automotive Systems0
Alternates, Assemble! Selecting Optimal Alternates for Citizens' Assemblies0
Cross-Lingual Transfer of Cultural Knowledge: An Asymmetric Phenomenon0
Show:102550
← PrevPage 369 of 9486Next →