SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 93769400 of 177340 papers

TitleStatusHype
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature LearningCode2
PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose EstimationCode2
SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal ModelCode2
Bracketing Image Restoration and Enhancement with High-Low Frequency DecompositionCode2
LLM4EDA: Emerging Progress in Large Language Models for Electronic Design AutomationCode2
Overview of the PromptCBLUE Shared Task in CHIP2023Code2
DebugBench: Evaluating Debugging Capability of Large Language ModelsCode2
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement LearningCode2
Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMsCode2
PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image SegmentationCode2
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular FusionCode2
VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic NavigationCode2
STEVE-1: A Generative Model for Text-to-Behavior in MinecraftCode2
An Efficient and Mixed Heterogeneous Model for Image RestorationCode2
Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and ArenaCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt TuningCode2
Therapeutics Data Commons: Machine Learning Datasets and Tasks for Drug Discovery and DevelopmentCode2
Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model ErasCode2
2nd Place Winning Solution for the CVPR2023 Visual Anomaly and Novelty Detection Challenge: Multimodal Prompting for Data-centric Anomaly DetectionCode2
TeCH: Text-guided Reconstruction of Lifelike Clothed HumansCode2
BMFM-RNA: An Open Framework for Building and Evaluating Transcriptomic Foundation ModelsCode2
LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process ThinkingCode2
Bottleneck Transformers for Visual RecognitionCode2
HMANet: Hybrid Multi-Axis Aggregation Network for Image Super-ResolutionCode2
Show:102550
← PrevPage 376 of 7094Next →