SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 67516800 of 177340 papers

TitleStatusHype
Global Estimation of Building-Integrated Facade and Rooftop Photovoltaic Potential by Integrating 3D Building Footprint and Spatio-Temporal DatasetsCode2
Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object TrackingCode2
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene UnderstandingCode2
SEGAN: Speech Enhancement Generative Adversarial NetworkCode2
Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object DetectionCode2
Progressive Distillation for Fast Sampling of Diffusion ModelsCode2
Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic SegmentationCode2
SC-DepthV3: Robust Self-supervised Monocular Depth Estimation for Dynamic ScenesCode2
Think While You Generate: Discrete Diffusion with Planned DenoisingCode2
VDT: General-purpose Video Diffusion Transformers via Mask ModelingCode2
Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMambaCode2
Lost in the Middle: How Language Models Use Long ContextsCode2
Representation Engineering: A Top-Down Approach to AI TransparencyCode2
WeatherGS: 3D Scene Reconstruction in Adverse Weather Conditions via Gaussian SplattingCode2
The Surprising Effectiveness of Multimodal Large Language Models for Video Moment RetrievalCode2
Aligning Text-to-Image Diffusion Models with Reward BackpropagationCode2
Temporal Graph Benchmark for Machine Learning on Temporal GraphsCode2
A Survey on Data Augmentation in Large Model EraCode2
On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model InferenceCode2
Active-Learning-as-a-Service: An Automatic and Efficient MLOps System for Data-Centric AICode2
XRAG: eXamining the Core -- Benchmarking Foundational Components in Advanced Retrieval-Augmented GenerationCode2
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via CipherCode2
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPOCode2
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMsCode2
AnyLoc: Towards Universal Visual Place RecognitionCode2
Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor RegressionCode2
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory MatchingCode2
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU LanguagesCode2
ColorMNet: A Memory-based Deep Spatial-Temporal Feature Propagation Network for Video ColorizationCode2
KVQ: Kwai Video Quality Assessment for Short-form VideosCode2
MedPromptX: Grounded Multimodal Prompting for Chest X-ray DiagnosisCode2
On Embeddings for Numerical Features in Tabular Deep LearningCode2
3D Vision with Transformers: A SurveyCode2
DSVT: Dynamic Sparse Voxel Transformer with Rotated SetsCode2
Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphsCode2
How to Merge Your Multimodal Models Over Time?Code2
PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for FinanceCode2
DS-1000: A Natural and Reliable Benchmark for Data Science Code GenerationCode2
Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve AdjustmentCode2
MM-IFEngine: Towards Multimodal Instruction FollowingCode2
MoFE-Time: Mixture of Frequency Domain Experts for Time-Series Forecasting ModelsCode2
Animal Avatars: Reconstructing Animatable 3D Animals from Casual VideosCode2
CFBench: A Comprehensive Constraints-Following Benchmark for LLMsCode2
Maintaining Plasticity in Deep Continual LearningCode2
Text-Only Training for Image Captioning using Noise-Injected CLIPCode2
DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading SystemsCode2
Leveraging Temporal Contextualization for Video Action RecognitionCode2
Towards Building Text-To-Speech Systems for the Next Billion UsersCode2
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual CompressionCode2
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled ModalityCode2
Show:102550
← PrevPage 136 of 3547Next →