SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 57515800 of 661570 papers

TitleStatusHype
LLM Processes: Numerical Predictive Distributions Conditioned on Natural LanguageCode2
BYOL for Audio: Exploring Pre-trained General-purpose Audio RepresentationsCode2
Learning local equivariant representations for quantum operatorsCode2
Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action SegmentationCode2
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and contextCode2
ByT5 model for massively multilingual grapheme-to-phoneme conversionCode2
Global Estimation of Building-Integrated Facade and Rooftop Photovoltaic Potential by Integrating 3D Building Footprint and Spatio-Temporal DatasetsCode2
Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object TrackingCode2
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene UnderstandingCode2
SEGAN: Speech Enhancement Generative Adversarial NetworkCode2
Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object DetectionCode2
Progressive Distillation for Fast Sampling of Diffusion ModelsCode2
Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic SegmentationCode2
SC-DepthV3: Robust Self-supervised Monocular Depth Estimation for Dynamic ScenesCode2
Think While You Generate: Discrete Diffusion with Planned DenoisingCode2
VDT: General-purpose Video Diffusion Transformers via Mask ModelingCode2
Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMambaCode2
Lost in the Middle: How Language Models Use Long ContextsCode2
Representation Engineering: A Top-Down Approach to AI TransparencyCode2
WeatherGS: 3D Scene Reconstruction in Adverse Weather Conditions via Gaussian SplattingCode2
The Surprising Effectiveness of Multimodal Large Language Models for Video Moment RetrievalCode2
Aligning Text-to-Image Diffusion Models with Reward BackpropagationCode2
Temporal Graph Benchmark for Machine Learning on Temporal GraphsCode2
A Survey on Data Augmentation in Large Model EraCode2
On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model InferenceCode2
Active-Learning-as-a-Service: An Automatic and Efficient MLOps System for Data-Centric AICode2
XRAG: eXamining the Core -- Benchmarking Foundational Components in Advanced Retrieval-Augmented GenerationCode2
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via CipherCode2
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPOCode2
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMsCode2
AnyLoc: Towards Universal Visual Place RecognitionCode2
Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor RegressionCode2
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory MatchingCode2
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU LanguagesCode2
ColorMNet: A Memory-based Deep Spatial-Temporal Feature Propagation Network for Video ColorizationCode2
KVQ: Kwai Video Quality Assessment for Short-form VideosCode2
MedPromptX: Grounded Multimodal Prompting for Chest X-ray DiagnosisCode2
On Embeddings for Numerical Features in Tabular Deep LearningCode2
3D Vision with Transformers: A SurveyCode2
DSVT: Dynamic Sparse Voxel Transformer with Rotated SetsCode2
Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphsCode2
How to Merge Your Multimodal Models Over Time?Code2
PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for FinanceCode2
DS-1000: A Natural and Reliable Benchmark for Data Science Code GenerationCode2
Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve AdjustmentCode2
MM-IFEngine: Towards Multimodal Instruction FollowingCode2
MoFE-Time: Mixture of Frequency Domain Experts for Time-Series Forecasting ModelsCode2
Animal Avatars: Reconstructing Animatable 3D Animals from Casual VideosCode2
CFBench: A Comprehensive Constraints-Following Benchmark for LLMsCode2
Maintaining Plasticity in Deep Continual LearningCode2
Show:102550
← PrevPage 116 of 13232Next →