SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1300113050 of 474278 papers

TitleStatusHype
GTA1: GUI Test-time Scaling AgentCode2
TextPixs: Glyph-Conditioned Diffusion with Character-Aware Attention and OCR-Guided Supervision0
Predicting Graph Structure via Adapted Flux Balance AnalysisCode0
What You Have is What You Track: Adaptive and Robust Multimodal TrackingCode0
Unconditional Diffusion for Generative Sequential RecommendationCode0
CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal RepresentationsCode0
Chat-Ghosting: A Comparative Study of Methods for Auto-Completion in Dialog Systems0
ReLayout: Integrating Relation Reasoning for Content-aware Layout Generation with Multi-modal Large Language Models0
Enhancing Scientific Visual Question Answering through Multimodal Reasoning and Ensemble Modeling0
Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation0
RecRankerEval: A Flexible and Extensible Framework for Top-k LLM-based Recommendation0
Automated Neuron Labelling Enables Generative Steering and Interpretability in Protein Language ModelsCode0
Robust One-step Speech Enhancement via Consistency DistillationCode1
PrefixAgent: An LLM-Powered Design Framework for Efficient Prefix Adder Optimization0
EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow0
Contrastive and Transfer Learning for Effective Audio Fingerprinting through a Real-World Evaluation Protocol0
Skywork-R1V3 Technical ReportCode7
eegFloss: A Python package for refining sleep EEG recordings using machine learning modelsCode1
Accelerating GenAI Workloads by Enabling RISC-V Microkernel Support in IREE0
Text Detoxification: Data Efficiency, Semantic Preservation and Model GeneralizationCode0
VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents0
MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding0
LAID: Lightweight AI-Generated Image Detection in Spatial and Spectral DomainsCode0
StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling0
DeepCS-TRD, a Deep Learning-based Cross-Section Tree Ring DetectorCode0
TeethGenerator: A two-stage framework for paired pre- and post-orthodontic 3D dental data generationCode0
Efficient Unlearning with Privacy GuaranteesCode0
LCDS: A Logic-Controlled Discharge Summary Generation System Supporting Source Attribution and Expert ReviewCode0
Geometric-Guided Few-Shot Dental Landmark Detection with Human-Centric Foundation ModelCode0
Parameterized Diffusion Optimization enabled Autoregressive Ordinal Regression for Diabetic Retinopathy GradingCode0
Robust Incomplete-Modality Alignment for Ophthalmic Disease Grading and Diagnosis via Labeled Optimal TransportCode0
Going Beyond Heuristics by Imposing Policy Improvement as a ConstraintCode0
LumiCRS: Asymmetric Contrastive Prototype Learning for Long-Tail Conversational Movie Recommendation0
GIST: Cross-Domain Click-Through Rate Prediction via Guided Content-Behavior Distillation0
TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation0
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning0
PRIME: Large Language Model Personalization with Cognitive Memory and Thought Processes0
DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer0
ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation0
Real-Time Graph-based Point Cloud Networks on FPGAs via Stall-Free Deep PipeliningCode0
Disappearing Ink: Obfuscation Breaks N-gram Code Watermarks in Theory and Practice0
MindFlow: Revolutionizing E-commerce Customer Support with Multimodal LLM Agents0
Hierarchical Intent-guided Optimization with Pluggable LLM-Driven Semantics for Session-based RecommendationCode0
Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions0
Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document RestorationCode2
Activation Steering for Chain-of-Thought CompressionCode0
Learn Globally, Speak Locally: Bridging the Gaps in Multilingual Reasoning0
2048: Reinforcement Learning in a Delayed Reward Environment0
pFedMMA: Personalized Federated Fine-Tuning with Multi-Modal Adapter for Vision-Language ModelsCode0
FindRec: Stein-Guided Entropic Flow for Multi-Modal Sequential RecommendationCode1
Show:102550
← PrevPage 261 of 9486Next →