SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1905119100 of 474278 papers

TitleStatusHype
ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQLCode1
DEFAME: Dynamic Evidence-based FAct-checking with Multimodal ExpertsCode1
CognitionCapturer: Decoding Visual Stimuli From Human EEG Signal With Multimodal InformationCode1
From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial InjectionCode1
UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object DetectionCode1
Aspen Open Jets: Unlocking LHC Data for Foundation Models in Particle PhysicsCode1
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate SectorCode1
FM2S: Towards Spatially-Correlated Noise Modeling in Zero-Shot Fluorescence Microscopy Image DenoisingCode1
Filter or Compensate: Towards Invariant Representation from Distribution Shift for Anomaly DetectionCode1
GraSP: Simple yet Effective Graph Similarity PredictionsCode1
Semi-IIN: Semi-supervised Intra-inter modal Interaction Learning Network for Multimodal Sentiment AnalysisCode1
ChainStream: An LLM-based Framework for Unified Synthetic SensingCode1
Multi-Head Encoding for Extreme Label ClassificationCode1
The Complexity Dynamics of GrokkingCode1
Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training DataCode1
Enhancing Multimodal Large Language Models Complex Reason via Similarity ComputationCode1
CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language ModelsCode1
waveOrder: generalist framework for label-agnostic computational microscopyCode1
GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt OptimizersCode1
Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph CompletionCode1
Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLMCode1
Towards Open-Vocabulary Video Semantic SegmentationCode1
Enhancing Implicit Neural Representations via Symmetric Power TransformationCode1
Federated Foundation Models on Heterogeneous Time SeriesCode1
Motif Guided Graph Transformer with Combinatorial Skeleton Prototype Learning for Skeleton-Based Person Re-IdentificationCode1
Multimodal Music Generation with Explicit Bridges and Retrieval AugmentationCode1
GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic ExpressionCode1
A physics-informed transformer neural operator for learning generalized solutions of initial boundary value problemsCode1
Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec CompressionCode1
USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature DecorrelationCode1
A Flexible Plug-and-Play Module for Generating Variable-LengthCode1
RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World ScenariosCode1
In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement LearningCode1
Selective Visual Prompting in Vision MambaCode1
Toward Foundation Model for Multivariate Wearable Sensing of Physiological SignalsCode1
Weighted Poisson-disk Resampling on Large-Scale Point CloudsCode1
MultiEYE: Dataset and Benchmark for OCT-Enhanced Retinal Disease Recognition from Fundus ImagesCode1
SPRec: Leveraging Self-Play to Debias Preference Alignment for Large Language Model-based RecommendationsCode1
Can Modern LLMs Act as Agent Cores in Radiology Environments?Code1
SMMF: Square-Matricized Momentum Factorization for Memory-Efficient OptimizationCode1
Video Repurposing from User Generated Content: A Large-scale Dataset and BenchmarkCode1
Dynamic Contrastive Knowledge Distillation for Efficient Image RestorationCode1
Lexico: Extreme KV Cache Compression via Sparse Coding over Universal DictionariesCode1
OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offsCode1
CAPrompt: Cyclic Prompt Aggregation for Pre-Trained Model Based Class Incremental LearningCode1
Temporal Action Localization with Cross Layer Task Decoupling and RefinementCode1
Physics-Driven Autoregressive State Space Models for Medical Image ReconstructionCode1
Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion ModelCode1
GEAL: Generalizable 3D Affordance Learning with Cross-Modal ConsistencyCode1
PBR-NeRF: Inverse Rendering with Physics-Based Neural FieldsCode1
Show:102550
← PrevPage 382 of 9486Next →