SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 36013650 of 661570 papers

TitleStatusHype
ViTPose++: Vision Transformer for Generic Body Pose EstimationCode3
FAN: Fourier Analysis NetworksCode3
FilterNet: Harnessing Frequency Filters for Time Series ForecastingCode3
QuEst: Graph Transformer for Quantum Circuit Reliability EstimationCode3
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker ExtractionCode3
KVzip: Query-Agnostic KV Cache Compression with Context ReconstructionCode3
BERGEN: A Benchmarking Library for Retrieval-Augmented GenerationCode3
MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI ApplicationsCode3
Evaluating Text-to-Visual Generation with Image-to-Text GenerationCode3
Attention Is All You NeedCode3
CodeTF: One-stop Transformer Library for State-of-the-art Code LLMCode3
StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character CustomizationCode3
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video GenerationCode3
Residual Kolmogorov-Arnold Network for Enhanced Deep LearningCode3
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head GenerationCode3
AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into OneCode3
A Survey on LoRA of Large Language ModelsCode3
VisionZip: Longer is Better but Not Necessary in Vision Language ModelsCode3
Humans in 4D: Reconstructing and Tracking Humans with TransformersCode3
Sigmoid Loss for Language Image Pre-TrainingCode3
Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal UnderstandingCode3
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward FeedbackCode3
Husky: A Unified, Open-Source Language Agent for Multi-Step ReasoningCode3
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific AdaptationCode3
Restoring Images in Adverse Weather Conditions via Histogram TransformerCode3
MagicPose: Realistic Human Poses and Facial Expressions Retargeting with Identity-aware DiffusionCode3
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop QueriesCode3
NerfAcc: A General NeRF Acceleration ToolboxCode3
Llemma: An Open Language Model For MathematicsCode3
Datasets: A Community Library for Natural Language ProcessingCode3
Tri-Perspective View for Vision-Based 3D Semantic Occupancy PredictionCode3
ResNeSt: Split-Attention NetworksCode3
MedSegDiff-V2: Diffusion based Medical Image Segmentation with TransformerCode3
IEPile: Unearthing Large-Scale Schema-Based Information Extraction CorpusCode3
StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIsCode3
Dynamic Cheatsheet: Test-Time Learning with Adaptive MemoryCode3
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMsCode3
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked ModelingCode3
Inferring Articulated Rigid Body Dynamics from RGBD VideoCode3
SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual ComprehensionCode3
Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts AdaptersCode3
Neural Network Verification with Branch-and-Bound for General NonlinearitiesCode3
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content CreationCode3
DrivAerNet: A Parametric Car Dataset for Data-Driven Aerodynamic Design and PredictionCode3
Exploring Intrinsic Normal Prototypes within a Single Image for Universal Anomaly DetectionCode3
Diffusion Model-Based Video Editing: A SurveyCode3
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter TransferCode3
BoT-SORT: Robust Associations Multi-Pedestrian TrackingCode3
TopoBench: A Framework for Benchmarking Topological Deep LearningCode3
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image GenerationCode3
Show:102550
← PrevPage 73 of 13232Next →