SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1905119100 of 474278 papers

TitleStatusHype
LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position EncodingCode1
Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-OptimizationCode1
DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed DatasetsCode1
Learning Open-vocabulary Semantic Segmentation Models From Natural Language SupervisionCode1
Sliding Window FastEdit: A Framework for Lesion Annotation in Whole-body PET ImagesCode1
SAGDFN: A Scalable Adaptive Graph Diffusion Forecasting Network for Multivariate Time Series ForecastingCode1
Semi-Supervised Deep Regression with Uncertainty Consistency and Variational Model Ensembling via Bayesian Neural NetworksCode1
A Privacy-Preserving Hybrid Federated Learning Framework for Financial Crime DetectionCode1
AutoQA: From Databases To QA Semantic Parsers With Only Synthetic Training DataCode1
Self-Training Guided Disentangled Adaptation for Cross-Domain Remote Sensing Image Semantic SegmentationCode1
Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy HessiansCode1
Self-Supervised Correspondence Estimation via Multiview RegistrationCode1
Fine-grained Category Discovery under Coarse-grained supervision with Hierarchical Weighted Self-contrastive LearningCode1
Reinforced Structured State-Evolution for Vision-Language NavigationCode1
Visual Sound Localization in the Wild by Cross-Modal Interference ErasingCode1
Unlocking State-Tracking in Linear RNNs Through Negative EigenvaluesCode1
Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and ApplicationsCode1
Hyperbolic Geometric Latent Diffusion Model for Graph GenerationCode1
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimationCode1
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech RepresentationCode1
DevBench: A multimodal developmental benchmark for language learningCode1
BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained DevicesCode1
ForecastPFN: Synthetically-Trained Zero-Shot ForecastingCode1
Normalizing Flows are Capable Models for RLCode1
Multi-Stage Episodic Control for Strategic Exploration in Text GamesCode1
EWMoE: An effective model for global weather forecasting with mixture-of-expertsCode1
Benchmarking Data Science AgentsCode1
Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution GeneralisationCode1
Introducing Thermodynamics-Informed Symbolic Regression -- A Tool for Thermodynamic Equations of State DevelopmentCode1
Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language ModelsCode1
Avoiding Reasoning Shortcuts: Adversarial Evaluation, Training, and Model Development for Multi-Hop QACode1
Supervised Adversarial Contrastive Learning for Emotion Recognition in ConversationsCode1
Stage-by-stage Wavelet Optimization Refinement Diffusion Model for Sparse-View CT ReconstructionCode1
Mosaic-IT: Free Compositional Data Augmentation Improves Instruction TuningCode1
The Devil is in the Upsampling: Architectural Decisions Made Simpler for Denoising with Deep Image PriorCode1
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement LearningCode1
LED: Light Enhanced Depth Estimation at NightCode1
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode1
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive PruningCode1
PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample ConsensusCode1
Neural Target Speech Extraction: An OverviewCode1
ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution ShiftsCode1
Towards Few-Shot Adaptation of Foundation Models via Multitask FinetuningCode1
ResiDual: Transformer with Dual Residual ConnectionsCode1
AMD-Hummingbird: Towards an Efficient Text-to-Video ModelCode1
Unbiased Teacher for Semi-Supervised Object DetectionCode1
Truncation Sampling as Language Model DesmoothingCode1
SatLM: Satisfiability-Aided Language Models Using Declarative PromptingCode1
Learning Weakly Convex Regularizers for Convergent Image-Reconstruction AlgorithmsCode1
MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style ConvolutionCode1
Show:102550
← PrevPage 382 of 9486Next →