SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 45014525 of 177340 papers

TitleStatusHype
Accelerating Neural Network Training: An Analysis of the AlgoPerf CompetitionCode3
UniVS: Unified and Universal Video Segmentation with Prompts as QueriesCode3
PsyDT: Using LLMs to Construct the Digital Twin of Psychological Counselor with Personalized Counseling Style for Psychological CounselingCode3
U^2-Net: Going Deeper with Nested U-Structure for Salient Object DetectionCode3
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-ServeCode3
Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding ModelsCode3
Robust High-Resolution Video Matting with Temporal GuidanceCode3
DeepInteraction++: Multi-Modality Interaction for Autonomous DrivingCode3
A Practical Review of Mechanistic Interpretability for Transformer-Based Language ModelsCode3
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model AgentsCode3
TristouNet: Triplet Loss for Speaker Turn EmbeddingCode3
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The WildCode3
LION: Linear Group RNN for 3D Object Detection in Point CloudsCode3
Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and Texture GenerationCode3
Robust and Accurate Object Detection via Adversarial LearningCode3
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid ArchitectureCode3
MultiModal-GPT: A Vision and Language Model for Dialogue with HumansCode3
Behavior Generation with Latent ActionsCode3
OmniAudio: Generating Spatial Audio from 360-Degree VideoCode3
TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSONCode3
Ludwig: a type-based declarative deep learning toolboxCode3
Comparison of Syntactic and Semantic Representations of Programs in Neural EmbeddingsCode3
WantWords: An Open-source Online Reverse Dictionary SystemCode3
Implicit Style-Content Separation using B-LoRACode3
EfficientQAT: Efficient Quantization-Aware Training for Large Language ModelsCode3
Show:102550
← PrevPage 181 of 7094Next →