SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 96019650 of 661570 papers

TitleStatusHype
G3Reg: Pyramid Graph-based Global Registration using Gaussian Ellipsoid ModelCode2
Automatically Identifying Words That Can Serve as Labels for Few-Shot Text ClassificationCode2
U-KAN Makes Strong Backbone for Medical Image Segmentation and GenerationCode2
YOWOv3: An Efficient and Generalized Framework for Human Action Detection and RecognitionCode2
ConceptNet 5.5: An Open Multilingual Graph of General KnowledgeCode2
Efficient One-Pass End-to-End Entity Linking for QuestionsCode2
On the Emergence of Thinking in LLMs I: Searching for the Right IntuitionCode2
3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level SupervisionsCode2
SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing IndustryCode2
A Better Variant of Self-Critical Sequence TrainingCode2
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map GenerationCode2
Pedagogical Alignment of Large Language ModelsCode2
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNetCode2
EndoGS: Deformable Endoscopic Tissues Reconstruction with Gaussian SplattingCode2
Preble: Efficient Distributed Prompt Scheduling for LLM ServingCode2
Deep Learning-based Compression Detection for explainable Face Image Quality AssessmentCode2
Debiasing Multimodal Large Language ModelsCode2
Representation Learning and Identity Adversarial Training for Facial Behavior UnderstandingCode2
GenEval: An Object-Focused Framework for Evaluating Text-to-Image AlignmentCode2
Streaming Anomaly DetectionCode2
FP8-LM: Training FP8 Large Language ModelsCode2
Causal structure learning with momentum: Sampling distributions over Markov Equivalence Classes of DAGsCode2
A New Era in Software Security: Towards Self-Healing Software via Large Language Models and Formal VerificationCode2
ShortcutsBench: A Large-Scale Real-world Benchmark for API-based AgentsCode2
Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood AttentionCode2
Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data EfficiencyCode2
Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth EstimationCode2
Dual Aggregation Transformer for Image Super-ResolutionCode2
Zooming Out on Zooming In: Advancing Super-Resolution for Remote SensingCode2
Curiosity-driven Red-teaming for Large Language ModelsCode2
Gradient Alignment for Cross-Domain Face Anti-SpoofingCode2
MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument LeakageCode2
Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous DrivingCode2
Event-based Stereo Depth Estimation: A SurveyCode2
Aesthetic Text Logo Synthesis via Content-aware Layout InferringCode2
Neural 3D Scene Reconstruction with the Manhattan-world AssumptionCode2
Text2Performer: Text-Driven Human Video GenerationCode2
Large language models can be zero-shot anomaly detectors for time series?Code2
JADE: A Linguistics-based Safety Evaluation Platform for Large Language ModelsCode2
Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling PriorCode2
Koopa: Learning Non-stationary Time Series Dynamics with Koopman PredictorsCode2
StyleDubber: Towards Multi-Scale Style Learning for Movie DubbingCode2
Learning to Solve Job Shop Scheduling under UncertaintyCode2
MatchTime: Towards Automatic Soccer Game Commentary GenerationCode2
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 LanguagesCode2
Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse ProblemsCode2
u-μP: The Unit-Scaled Maximal Update ParametrizationCode2
StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion ModelsCode2
Drone-assisted Road Gaussian Splatting with Cross-view UncertaintyCode2
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content GenerationCode2
Show:102550
← PrevPage 193 of 13232Next →