SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 64516500 of 661570 papers

TitleStatusHype
AnalogNAS-Bench: A NAS Benchmark for Analog In-Memory ComputingCode2
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics LearningCode2
Towards In-the-wild 3D Plane Reconstruction from a Single ImageCode2
Test3R: Learning to Reconstruct 3D at Test TimeCode2
Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and TrendsCode2
Flow-Anchored Consistency ModelsCode2
Feed-Forward SceneDINO for Unsupervised Semantic Scene CompletionCode2
EAMamba: Efficient All-Around Vision State Space Model for Image RestorationCode2
Open Source Planning & Control System with Language Agents for Autonomous Scientific DiscoveryCode2
CaRL: Learning Scalable Planning Policies with Simple RewardsCode2
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal ModelCode2
Detecting Spacecraft Anomalies Using LSTMs and Nonparametric Dynamic ThresholdingCode2
The Best of Both Worlds: Combining Recent Advances in Neural Machine TranslationCode2
Bias and Unfairness in Information Retrieval Systems: New Challenges in the LLM EraCode2
Generating Benchmarks for Factuality Evaluation of Language ModelsCode2
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?Code2
LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction TuningCode2
WavJourney: Compositional Audio Creation with Large Language ModelsCode2
OpenNRE: An Open and Extensible Toolkit for Neural Relation ExtractionCode2
Temporal Action Detection with Structured Segment NetworksCode2
Flash normalization: fast RMSNorm for LLMsCode2
Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph StructuresCode2
PivotNet: Vectorized Pivot Learning for End-to-end HD Map ConstructionCode2
An Open-Source American Sign Language Fingerspell Recognition and Semantic Pose Retrieval InterfaceCode2
Neptune: The Long Orbit to Benchmarking Long Video UnderstandingCode2
Unified Vision-Language Pre-Training for Image Captioning and VQACode2
Simulation to Scaled City: Zero-Shot Policy Transfer for Traffic Control via Autonomous VehiclesCode2
SpecReason: Fast and Accurate Inference-Time Compute via Speculative ReasoningCode2
MatMamba: A Matryoshka State Space ModelCode2
High-dimensional Convolutional Networks for Geometric Pattern RecognitionCode2
Boosting Neural Representations for Videos with a Conditional DecoderCode2
A Pilot Study for Chinese SQL Semantic ParsingCode2
Differentiable Convex Optimization LayersCode2
Thought Cloning: Learning to Think while Acting by Imitating Human ThinkingCode2
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion ModelsCode2
Transferability of Adversarial Examples to Attack Cloud-based Image Classifier ServiceCode2
A Little Fog for a Large TurnCode2
Torch-Struct: Deep Structured Prediction LibraryCode2
Don't be lazy: CompleteP enables compute-efficient deep transformersCode2
Semantically-Guided Representation Learning for Self-Supervised Monocular DepthCode2
Unbiased Scene Graph Generation from Biased TrainingCode2
Knowledge GraphsCode2
Compressing deep neural networks on FPGAs to binary and ternary precision with HLS4MLCode2
UnetTSF: A Better Performance Linear Complexity Time Series Prediction ModelCode2
Detection in Crowded Scenes: One Proposal, Multiple PredictionsCode2
Quantile Encoder: Tackling High Cardinality Categorical Features in Regression ProblemsCode2
Fixing the train-test resolution discrepancy: FixEfficientNetCode2
Self-Supervised Log ParsingCode2
Augmenting Differentiable Simulators with Neural Networks to Close the Sim2Real GapCode2
COVID-Net: A Tailored Deep Convolutional Neural Network Design for Detection of COVID-19 Cases from Chest X-Ray ImagesCode2
Show:102550
← PrevPage 130 of 13232Next →