SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 62266250 of 474278 papers

TitleStatusHype
Pose-Based Sign Language Spotting via an End-to-End Encoder ArchitectureCode0
Enhancing Floor Plan Recognition: A Hybrid Mix-Transformer and U-Net Approach for Precise Wall SegmentationCode0
Llama-based source code vulnerability detection: Prompt engineering vs Fine tuningCode0
Guiding WaveMamba with Frequency Maps for Image DebandingCode0
Open Polymer Challenge: Post-Competition ReportCode0
UniPruning: Unifying Local Metric and Global Feedback for Scalable Sparse LLMsCode0
Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive GenerationCode0
HealthcareNLP: where are we and what is next?Code0
What really matters for person re-identification? A Mixture-of-Experts Framework for Semantic Attribute ImportanceCode0
InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language ModelsCode0
Explaining the Unseen: Multimodal Vision-Language Reasoning for Situational Awareness in Underground Mining DisastersCode0
End-to-End Fine-Tuning of 3D Texture Generation using Differentiable RewardsCode0
Pay Less Attention to Function Words for Free Robustness of Vision-Language ModelsCode0
From Benchmarks to Business Impact: Deploying IBM Generalist Agent in Enterprise ProductionCode0
LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions0
LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL0
EgoX: Egocentric Video Generation from a Single Exocentric Video0
TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels0
LapFM: A Laparoscopic Segmentation Foundation Model via Hierarchical Concept Evolving Pre-trainingCode0
Argus: A Multi-Agent Sensitive Information Leakage Detection Framework Based on Hierarchical Reference RelationshipsCode0
SAM-Body4D: Training-Free 4D Human Body Mesh Recovery from VideosCode0
FRWKV:Frequency-Domain Linear Attention for Long-Term Time Series ForecastingCode0
Fine-grained Spatiotemporal Grounding on Egocentric VideosCode0
Language Models for Controllable DNA Sequence DesignCode0
IRPO: Boosting Image Restoration via Post-training GRPOCode0
Show:102550
← PrevPage 250 of 18972Next →