SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 95519575 of 474278 papers

TitleStatusHype
Single Domain Generalization for Crowd CountingCode2
What Was Your Prompt? A Remote Keylogging Attack on AI AssistantsCode2
AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield PromptingCode2
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting EditingCode2
Scattered Mixture-of-Experts ImplementationCode2
PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistencyCode2
Caltech Aerial RGB-Thermal Dataset in the WildCode2
A Decade's Battle on Dataset Bias: Are We There Yet?Code2
Prompting Large Language Models to Tackle the Full Software Development Lifecycle: A Case StudyCode2
Knowledge Conflicts for LLMs: A SurveyCode2
MonoOcc: Digging into Monocular Semantic Occupancy PredictionCode2
SOTOPIA-π: Interactive Learning of Socially Intelligent Language AgentsCode2
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningCode2
Towards Dense and Accurate Radar Perception Via Efficient Cross-Modal Diffusion ModelCode2
Envision3D: One Image to 3D with Anchor Views InterpolationCode2
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language ModelCode2
Language models scale reliably with over-training and on downstream tasksCode2
Pairwise Comparisons Are All You NeedCode2
CleanAgent: Automating Data Standardization with LLM-based AgentsCode2
JAXbind: Bind any function to JAXCode2
Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM EraCode2
AcademiaOS: Automating Grounded Theory Development in Qualitative Research with Large Language ModelsCode2
FastMAC: Stochastic Spectral Sampling of Correspondence GraphCode2
LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban EnvironmentsCode2
Show:102550
← PrevPage 383 of 18972Next →