SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1405114100 of 474278 papers

TitleStatusHype
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language ModelsCode2
Language Model Crossover: Variation through Few-Shot PromptingCode2
Why do tree-based models still outperform deep learning on typical tabular data?Code2
AutoVerus: Automated Proof Generation for Rust CodeCode2
TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving ScenesCode2
Preference Optimization for Reasoning with Pseudo FeedbackCode2
Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music AudioCode2
TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene UnderstandingCode2
Video-P2P: Video Editing with Cross-attention ControlCode2
Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language ModelsCode2
Focusing on Tracks for Online Multi-Object TrackingCode2
Consistency-diversity-realism Pareto fronts of conditional image generative modelsCode2
Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal LossCode2
DFormer: Rethinking RGBD Representation Learning for Semantic SegmentationCode2
Unleashing Large-Scale Video Generative Pre-training for Visual Robot ManipulationCode2
Character-Aware Models Improve Visual Text RenderingCode2
PetFace: A Large-Scale Dataset and Benchmark for Animal IdentificationCode2
MOODv2: Masked Image Modeling for Out-of-Distribution DetectionCode2
Clifford Neural Layers for PDE ModelingCode2
LamRA: Large Multimodal Model as Your Advanced Retrieval AssistantCode2
Mean Deviation Similarity Index: Efficient and Reliable Full-Reference Image Quality EvaluatorCode2
WavMark: Watermarking for Audio GenerationCode2
CodeEditorBench: Evaluating Code Editing Capability of Large Language ModelsCode2
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic SegmentationCode2
Chemformer: a pre-trained transformer for computational chemistryCode2
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and DebuggingCode2
FSPEN: AN ULTRA-LIGHTWEIGHT NETWORK FOR REAL TIME SPEECH ENAHNCMENTCode2
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech EditingCode2
Video-P2P: Video Editing with Cross-attention ControlCode2
MAUVE Scores for Generative Models: Theory and PracticeCode2
Label Efficient Visual Abstractions for Autonomous DrivingCode2
TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration TransducerCode2
Maximum Entropy Heterogeneous-Agent Reinforcement LearningCode2
Self-Supervised Transformers for Unsupervised Object Discovery using Normalized CutCode2
Few-Shot Scene Classification of Optical Remote Sensing Images Leveraging Calibrated Pretext TasksCode2
Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image RestorationCode2
FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action RecognitionCode2
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single ViewCode2
AnoDDPM: Anomaly Detection With Denoising Diffusion Probabilistic Models Using Simplex NoiseCode2
Efficient Neural Audio SynthesisCode2
Muse: Text-To-Image Generation via Masked Generative TransformersCode2
A Generalist AgentCode2
Stochastic Interpolants: A Unifying Framework for Flows and DiffusionsCode2
Unsupervised Cross-Domain Image GenerationCode2
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive TasksCode2
Temporal Feature Matters: A Framework for Diffusion Model QuantizationCode2
ReFocus: Visual Editing as a Chain of Thought for Structured Image UnderstandingCode2
Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality AssumptionCode2
Dynamic Brain Transformer with Multi-level Attention for Functional Brain Network AnalysisCode2
DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing EncoderCode2
Show:102550
← PrevPage 282 of 9486Next →