SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1020110225 of 177340 papers

TitleStatusHype
Frozen Transformers in Language Models Are Effective Visual Encoder LayersCode2
The Russian Legislative CorpusCode2
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instructionCode2
FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher InformationCode2
Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative DecodingCode2
Generative Pretraining from PixelsCode2
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)Code2
Invertible Diffusion Models for Compressed SensingCode2
HASSOD: Hierarchical Adaptive Self-Supervised Object DetectionCode2
OpenCity: Open Spatio-Temporal Foundation Models for Traffic PredictionCode2
Toward Unified Practices in Trajectory Prediction Research on Bird's-Eye-View DatasetsCode2
Closing the Gap Between Synthetic and Ground Truth Time Series Distributions via Neural MappingCode2
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space DualityCode2
DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic ResolutionCode2
An Image Grid Can Be Worth a Video: Zero-shot Video Question Answering Using a VLMCode2
EMOv2: Pushing 5M Vision Model FrontierCode2
PSP-HDRI+: A Synthetic Dataset Generator for Pre-Training of Human-Centric Computer Vision ModelsCode2
OpenBox: A Python Toolkit for Generalized Black-box OptimizationCode2
When Attention Meets Fast Recurrence: Training Language Models with Reduced ComputeCode2
ICML 2023 Topological Deep Learning Challenge : Design and ResultsCode2
Longhorn: State Space Models are Amortized Online LearnersCode2
CCPL: Contrastive Coherence Preserving Loss for Versatile Style TransferCode2
A mmWave Software-Defined Array Platform for Wireless Experimentation at 24-29.5 GHzCode2
Empirical Asset Pricing with Large Language Model AgentsCode2
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT ImprovementsCode2
Show:102550
← PrevPage 409 of 7094Next →