SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1340113450 of 474278 papers

TitleStatusHype
STELLA: Self-Evolving LLM Agent for Biomedical Research0
Geometry-aware 4D Video Generation for Robot Manipulation0
TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation0
Real-Time Inverse Kinematics for Generating Multi-Constrained Movements of Virtual Human CharactersCode1
GAF-Guard: An Agentic Framework for Risk Management and Governance in Large Language ModelsCode0
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement LearningCode7
CAVALRY-V: A Large-Scale Generator Framework for Adversarial Attacks on Video MLLMs0
HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning0
Prompt2SegCXR:Prompt to Segment All Organs and Diseases in Chest X-rays0
Robotic Manipulation by Imitating Generated Videos Without Physical Demonstrations0
RaGNNarok: A Light-Weight Graph Neural Network for Enhancing Radar Point Clouds on Unmanned Ground Vehicles0
Out-of-distribution detection in 3D applications: a review0
LD-RPS: Zero-Shot Unified Image Restoration via Latent Diffusion Recurrent Posterior SamplingCode1
ShapeEmbed: a self-supervised learning framework for 2D contour quantification0
UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather ConditionsCode1
Why Multi-Interest Fairness Matters: Hypergraph Contrastive Multi-Interest Learning for Fair Conversational Recommender SystemCode0
Empirical Analysis Of Heuristic and Approximation Algorithms for the The Mutual-Visibility ProblemCode0
UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis0
Zero-shot Skeleton-based Action Recognition with Prototype-guided Feature AlignmentCode1
Enhancing LLM Agent Safety via Causal Influence PromptingCode0
Process-aware and high-fidelity microstructure generation using stable diffusion0
Understanding Generalization in Node and Link Prediction0
Instant Particle Size Distribution Measurement Using CNNs Trained on Synthetic DataCode0
A Unified Transformer-Based Framework with Pretraining For Whole Body Grasping Motion GenerationCode0
TABASCO: A Fast, Simplified Model for Molecular Generation with Improved Physical QualityCode1
MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech EnhancementCode2
LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMsCode1
Imbalance Prime Sieving: Every Prime Gap Is a Result of a Möbius Imbalance ObstructionCode0
World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World ModelCode0
Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data0
LLMs are Capable of Misaligned Behavior Under Explicit Prohibition and SurveillanceCode0
AI-Generated Lecture Slides for Improving Slide Element Detection and Retrieval0
Unified Multimodal Understanding via Byte-Pair Visual Encoding0
VMoBA: Mixture-of-Block Attention for Video Diffusion Models0
EfficientXLang: Towards Improving Token Efficiency Through Cross-Lingual ReasoningCode0
BIMgent: Towards Autonomous Building Modeling via Computer-use Agents0
AQUA20: A Benchmark Dataset for Underwater Species Classification under Challenging Conditions0
Single Image Test-Time Adaptation via Multi-View Co-TrainingCode0
Spatially Gene Expression Prediction using Dual-Scale Contrastive LearningCode0
A Closer Look at Conditional Prompt Tuning for Vision-Language ModelsCode0
Calligrapher: Freestyle Text Image Customization0
FreeLong++: Training-Free Long Video Generation via Multi-band SpectralFusion0
MGPRL: Distributed Multi-Gaussian Processes for Wi-Fi-based Multi-Robot Relative Localization in Large Indoor EnvironmentsCode0
Evaluation of Geolocation Capabilities of Multimodal Large Language Models and Analysis of Associated Privacy RisksCode0
Mono-Modalizing Extremely Heterogeneous Multi-Modal Medical Image RegistrationCode0
Revisiting Audio-Visual Segmentation with Vision-Centric TransformerCode0
Visual Textualization for Image Prompted Object DetectionCode0
EXPERT: An Explainable Image Captioning Evaluation Metric with Structured ExplanationsCode0
Beyond Low-Rank Tuning: Model Prior-Guided Rank Allocation for Effective Transfer in Low-Data and Large-Gap RegimesCode0
When Will It Fail?: Anomaly to Prompt for Forecasting Future Anomalies in Time SeriesCode0
Show:102550
← PrevPage 269 of 9486Next →