SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 59265950 of 474278 papers

TitleStatusHype
Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language ModelCode2
D2GV: Deformable 2D Gaussian Splatting for Video Representation in 400FPSCode2
WritingBench: A Comprehensive Benchmark for Generative WritingCode2
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous DrivingCode2
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired SketchingCode2
Linear-MoE: Linear Sequence Modeling Meets Mixture-of-ExpertsCode2
EDM: Efficient Deep Feature MatchingCode2
Encrypted Vector Similarity Computations Using Partially Homomorphic Encryption: Applications and Performance AnalysisCode2
CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred ImagesCode2
PromptPex: Automatic Test Generation for Language Model PromptsCode2
A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information RetrievalCode2
Slim attention: cut your context memory in half without loss of accuracy -- K-cache is all you need for MHACode2
Generalized Interpolating Discrete DiffusionCode2
Omnidirectional Multi-Object TrackingCode2
Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur PriorCode2
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language ModelCode2
ProtComposer: Compositional Protein Structure Generation with 3D EllipsoidsCode2
An Egocentric Vision-Language Model based Portable Real-time Smart AssistantCode2
AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLMCode2
Scaling Rich Style-Prompted Text-to-Speech DatasetsCode2
Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian ProcessCode2
Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking CapabilitiesCode2
PDX: A Data Layout for Vector Similarity SearchCode2
Collaborative Expert LLMs Guided Multi-Objective Molecular OptimizationCode2
Universal Narrative Model: an Author-centric Storytelling Framework for Generative AICode2
Show:102550
← PrevPage 238 of 18972Next →