SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1360113650 of 474278 papers

TitleStatusHype
FixMatch: Simplifying Semi-Supervised Learning with Consistency and ConfidenceCode2
Mask-Free Video Instance SegmentationCode2
Advanced Unstructured Data Processing for ESG Reports: A Methodology for Structured Transformation and Enhanced AnalysisCode2
AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot LearningCode2
ARF: Artistic Radiance FieldsCode2
A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-ResolutionCode2
Bolt: Accelerated Data Mining with Fast Vector CompressionCode2
Learning Dense Representations of Phrases at ScaleCode2
Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3DCode2
GIS Copilot: Towards an Autonomous GIS Agent for Spatial AnalysisCode2
FSGS: Real-Time Few-shot View Synthesis using Gaussian SplattingCode2
BEVLoc: Cross-View Localization and Matching via Birds-Eye-View SynthesisCode2
Simple Guidance Mechanisms for Discrete Diffusion ModelsCode2
One Token to Seg Them All: Language Instructed Reasoning Segmentation in VideosCode2
Geoopt: Riemannian Optimization in PyTorchCode2
RiboDiffusion: Tertiary Structure-based RNA Inverse Folding with Generative Diffusion ModelsCode2
Visual Generation Without GuidanceCode2
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic CorrespondenceCode2
Fast convolutional neural networks on FPGAs with hls4mlCode2
Conformal prediction under ambiguous ground truthCode2
Torsional Diffusion for Molecular Conformer GenerationCode2
Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband RangingCode2
TSFEL: Time Series Feature Extraction LibraryCode2
BEACON: Benchmark for Comprehensive RNA Tasks and Language ModelsCode2
MathPile: A Billion-Token-Scale Pretraining Corpus for MathCode2
Fast R-CNNCode2
ALBERT: A Lite BERT for Self-supervised Learning of Language RepresentationsCode2
Adversarial Attacks against Closed-Source MLLMs via Feature Optimal AlignmentCode2
TableBank: Table Benchmark for Image-based Table Detection and RecognitionCode2
Towards Garment Sewing Pattern Reconstruction from a Single ImageCode2
Deep TEN: Texture Encoding NetworkCode2
3D Human Mesh Estimation from Virtual MarkersCode2
TextBrewer: An Open-Source Knowledge Distillation Toolkit for Natural Language ProcessingCode2
DensePose From WiFiCode2
MGMap: Mask-Guided Learning for Online Vectorized HD Map ConstructionCode2
Stand-Alone Self-Attention in Vision ModelsCode2
FASTER: Fast and Safe Trajectory Planner for Navigation in Unknown EnvironmentsCode2
CLUENER2020: Fine-grained Named Entity Recognition Dataset and Benchmark for ChineseCode2
Data-Free Learning of Student NetworksCode2
The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language ModelsCode2
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMsCode2
PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR SystemCode2
Binary Neural Networks: A SurveyCode2
Is Space-Time Attention All You Need for Video Understanding?Code2
Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language ModelsCode2
Gödel Agent: A Self-Referential Agent Framework for Recursive Self-ImprovementCode2
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorchCode2
Training Graph Neural Networks with 1000 LayersCode2
Construction of a Japanese Financial Benchmark for Large Language ModelsCode2
JAX MD: A Framework for Differentiable PhysicsCode2
Show:102550
← PrevPage 273 of 9486Next →