SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 89518975 of 177340 papers

TitleStatusHype
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document UnderstandingCode2
Centerline Boundary Dice Loss for Vascular SegmentationCode2
Benchmarking Predictive Coding Networks -- Made SimpleCode2
A Survey of Personalization: From RAG to AgentCode2
Discovering symbolic expressions with parallelized tree searchCode2
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language ModelsCode2
See Further for Parameter Efficient Fine-tuning by Standing on the Shoulders of DecompositionCode2
RPN: Reconciled Polynomial Network Towards Unifying PGMs, Kernel SVMs, MLP and KANCode2
Language Representations Can be What Recommenders Need: Findings and PotentialsCode2
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion RecognitionCode2
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention MapsCode2
LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous ExplorationCode2
MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view VideosCode2
Adaptive Parametric ActivationCode2
WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous DrivingCode2
AddressCLIP: Empowering Vision-Language Models for City-wide Image Address LocalizationCode2
xLSTMTime : Long-term Time Series Forecasting With xLSTMCode2
Image Compression for Machine and Human Vision with Spatial-Frequency AdaptationCode2
GOFA: A Generative One-For-All Model for Joint Graph Language ModelingCode2
TTSDS -- Text-to-Speech Distribution ScoreCode2
UrbanWorld: An Urban World Model for 3D City GenerationCode2
GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure DetectionCode2
A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and BeyondCode2
GeneralAD: Anomaly Detection Across Domains by Attending to Distorted FeaturesCode2
Weak-to-Strong ReasoningCode2
Show:102550
← PrevPage 359 of 7094Next →