SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 29513000 of 659983 papers

TitleStatusHype
SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG GenerationCode3
CityWalker: Learning Embodied Urban Navigation from Web-Scale VideosCode3
Star Attention: Efficient LLM Inference over Long SequencesCode3
On the Efficiency of NLP-Inspired Methods for Tabular Deep LearningCode3
A Distractor-Aware Memory for Visual Object Tracking with SAM2Code3
SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous DrivingCode3
Cautious Optimizers: Improving Training with One Line of CodeCode3
BayLing 2: A Multilingual Large Language Model with Efficient Language AlignmentCode3
MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAMCode3
Nimbus: Secure and Efficient Two-Party Inference for TransformersCode3
MobileMamba: Lightweight Multi-Receptive Visual Mamba NetworkCode3
BIP3D: Bridging 2D Images and 3D Perception for Embodied IntelligenceCode3
TEXGen: a Generative Diffusion Model for Mesh TexturesCode3
MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMsCode3
3D Convex Splatting: Radiance Field Rendering with 3D Smooth ConvexesCode3
Nd-BiMamba2: A Unified Bidirectional Architecture for Multi-Dimensional Data ProcessingCode3
SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language ModelCode3
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language ModelsCode3
Stable Flow: Vital Layers for Training-Free Image EditingCode3
Video-RAG: Visually-aligned Retrieval-Augmented Long Video ComprehensionCode3
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context TrainingCode3
REDUCIO! Generating 10241024 Video within 16 Seconds using Extremely Compressed Motion LatentsCode3
Interactive Medical Image Segmentation: A Benchmark Dataset and BaselineCode3
ACE2: Accurately learning subseasonal to decadal atmospheric variability and forced responsesCode3
DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving ScenesCode3
FlipSketch: Flipping Static Drawings to Text-Guided Sketch AnimationsCode3
Model Inversion Attacks: A Survey of Approaches and CountermeasuresCode3
WavChat: A Survey of Spoken Dialogue ModelsCode3
Jailbreak Attacks and Defenses against Multimodal Generative Models: A SurveyCode3
Caravan MultiMet: Extending Caravan with Multiple Weather Nowcasts and ForecastsCode3
InterPLM: Discovering Interpretable Features in Protein Language Models via Sparse AutoencodersCode3
CameraHMR: Aligning People with PerspectiveCode3
MureObjectStitch: Multi-reference Image CompositionCode3
The Surprising Effectiveness of Test-Time Training for Few-Shot LearningCode3
General Geospatial Inference with a Population Dynamics Foundation ModelCode3
SplatFormer: Point Transformer for Robust 3D Gaussian SplattingCode3
Game-theoretic LLM: Agent Workflow for Negotiation GamesCode3
Effects of charging and discharging capabilities on trade-offs between model accuracy and computational efficiency in pumped thermal electricity storageCode3
SuffixDecoding: Extreme Speculative Decoding for Emerging AI ApplicationsCode3
DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile ManipulationCode3
ZipNN: Lossless Compression for AI ModelsCode3
MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse ViewsCode3
Classification Done Right for Vision-Language Pre-TrainingCode3
ADOPT: Modified Adam Can Converge with Any β_2 with the Optimal RateCode3
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG SystemsCode3
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning AgentCode3
AutoVFX: Physically Realistic Video Editing from Natural Language InstructionsCode3
A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and TrustworthinessCode3
ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic Time-Series TransformerCode3
Addressing Representation Collapse in Vector Quantized Models with One Linear LayerCode3
Show:102550
← PrevPage 60 of 13200Next →