SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 93019350 of 661570 papers

TitleStatusHype
EMOv2: Pushing 5M Vision Model FrontierCode2
PSP-HDRI+: A Synthetic Dataset Generator for Pre-Training of Human-Centric Computer Vision ModelsCode2
OpenBox: A Python Toolkit for Generalized Black-box OptimizationCode2
When Attention Meets Fast Recurrence: Training Language Models with Reduced ComputeCode2
ICML 2023 Topological Deep Learning Challenge : Design and ResultsCode2
Longhorn: State Space Models are Amortized Online LearnersCode2
CCPL: Contrastive Coherence Preserving Loss for Versatile Style TransferCode2
A mmWave Software-Defined Array Platform for Wireless Experimentation at 24-29.5 GHzCode2
Empirical Asset Pricing with Large Language Model AgentsCode2
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT ImprovementsCode2
DCoM: Active Learning for All LearnersCode2
Foundation Models for Remote Sensing and Earth Observation: A SurveyCode2
PMC-LLaMA: Towards Building Open-source Language Models for MedicineCode2
SWE-bench Goes Live!Code2
Uncertainty-Informed Deep Learning Models Enable High-Confidence Predictions for Digital HistopathologyCode2
Accelerated Policy Learning with Parallel Differentiable SimulationCode2
SimVP: Simpler yet Better Video PredictionCode2
Rethinking Imitation-based Planner for Autonomous DrivingCode2
Contrastive Flow MatchingCode2
Conformal prediction interval for dynamic time-seriesCode2
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at ScaleCode2
video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language ModelsCode2
InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object RecognitionCode2
DiscoveryBench: Towards Data-Driven Discovery with Large Language ModelsCode2
Investigating image-based fallow weed detection performance on Raphanus sativus and Avena sativa at speeds up to 30 km h^-1Code2
Training Socially Aligned Language Models on Simulated Social InteractionsCode2
Stabilizing Transformer Training by Preventing Attention Entropy CollapseCode2
End-to-End Vectorized HD-map Construction with Piecewise Bezier CurveCode2
Solving Data Quality Problems with Desbordante: a DemoCode2
Dense Text-to-Image Generation with Attention ModulationCode2
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language ModelsCode2
PyGraft: Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your FingertipsCode2
MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion ModelCode2
Joint Audio and Speech UnderstandingCode2
AdaLomo: Low-memory Optimization with Adaptive Learning RateCode2
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?Code2
Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian SplattingCode2
Learning for CasADi: Data-driven Models in Numerical OptimizationCode2
Tokenize Anything via PromptingCode2
Diffusion Models without Classifier-free GuidanceCode2
FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal ReasoningCode2
BadChain: Backdoor Chain-of-Thought Prompting for Large Language ModelsCode2
General Flow as Foundation Affordance for Scalable Robot LearningCode2
VOLoc: Visual Place Recognition by Querying Compressed Lidar MapCode2
DBConformer: Dual-Branch Convolutional Transformer for EEG DecodingCode2
CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise ClassificationCode2
GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation ExtractionCode2
RRHF: Rank Responses to Align Language Models with Human Feedback without tearsCode2
GSGAN: Adversarial Learning for Hierarchical Generation of 3D Gaussian SplatsCode2
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuningCode2
Show:102550
← PrevPage 187 of 13232Next →