SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1390113950 of 177340 papers

TitleStatusHype
Retrieval Oriented Masking Pre-training Language Model for Dense Passage RetrievalCode2
TCMBench: A Comprehensive Benchmark for Evaluating Large Language Models in Traditional Chinese MedicineCode2
Synchromesh: Reliable code generation from pre-trained language modelsCode2
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion ModelsCode2
BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion ModelsCode2
AddSR: Accelerating Diffusion-based Blind Super-Resolution with Adversarial Diffusion DistillationCode2
Machine Learning in Asset Management—Part 1: Portfolio Construction—Trading StrategiesCode2
Towards Automatically-Tuned Deep Neural NetworksCode2
Offline RL for Natural Language Generation with Implicit Language Q LearningCode2
Fully Test-Time Adaptation for Monocular 3D Object DetectionCode2
DeepAR: Probabilistic Forecasting with Autoregressive Recurrent NetworksCode2
Open6DOR: Benchmarking Open-instruction 6-DoF Object Rearrangement and A VLM-based ApproachCode2
A physics-informed and attention-based graph learning approach for regional electric vehicle charging demand predictionCode2
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language ModelsCode2
Language Model Crossover: Variation through Few-Shot PromptingCode2
Why do tree-based models still outperform deep learning on typical tabular data?Code2
AutoVerus: Automated Proof Generation for Rust CodeCode2
TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving ScenesCode2
Preference Optimization for Reasoning with Pseudo FeedbackCode2
Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music AudioCode2
TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene UnderstandingCode2
Video-P2P: Video Editing with Cross-attention ControlCode2
Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language ModelsCode2
Focusing on Tracks for Online Multi-Object TrackingCode2
Consistency-diversity-realism Pareto fronts of conditional image generative modelsCode2
Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal LossCode2
DFormer: Rethinking RGBD Representation Learning for Semantic SegmentationCode2
Unleashing Large-Scale Video Generative Pre-training for Visual Robot ManipulationCode2
Character-Aware Models Improve Visual Text RenderingCode2
PetFace: A Large-Scale Dataset and Benchmark for Animal IdentificationCode2
MOODv2: Masked Image Modeling for Out-of-Distribution DetectionCode2
Clifford Neural Layers for PDE ModelingCode2
LamRA: Large Multimodal Model as Your Advanced Retrieval AssistantCode2
Mean Deviation Similarity Index: Efficient and Reliable Full-Reference Image Quality EvaluatorCode2
WavMark: Watermarking for Audio GenerationCode2
CodeEditorBench: Evaluating Code Editing Capability of Large Language ModelsCode2
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic SegmentationCode2
Chemformer: a pre-trained transformer for computational chemistryCode2
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and DebuggingCode2
FSPEN: AN ULTRA-LIGHTWEIGHT NETWORK FOR REAL TIME SPEECH ENAHNCMENTCode2
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech EditingCode2
Video-P2P: Video Editing with Cross-attention ControlCode2
MAUVE Scores for Generative Models: Theory and PracticeCode2
Label Efficient Visual Abstractions for Autonomous DrivingCode2
TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration TransducerCode2
Maximum Entropy Heterogeneous-Agent Reinforcement LearningCode2
Self-Supervised Transformers for Unsupervised Object Discovery using Normalized CutCode2
Few-Shot Scene Classification of Optical Remote Sensing Images Leveraging Calibrated Pretext TasksCode2
Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image RestorationCode2
FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action RecognitionCode2
Show:102550
← PrevPage 279 of 3547Next →