SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 27512800 of 177339 papers

TitleStatusHype
Learning to Use Tools via Cooperative and Interactive AgentsCode3
WhisperNER: Unified Open Named Entity and Speech RecognitionCode3
Theory, Analysis, and Best Practices for Sigmoid Self-AttentionCode3
Fairness in Serving Large Language ModelsCode3
Degradation-Aware Residual-Conditioned Optimal Transport for Unified Image RestorationCode3
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution ImagesCode3
Language Models are Few-Shot LearnersCode3
Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D ModelsCode3
A Smart Multimodal Healthcare Copilot with Powerful LLM ReasoningCode3
TCFormer: Visual Recognition via Token Clustering TransformerCode3
TSI-Bench: Benchmarking Time Series ImputationCode3
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-ConstraintCode3
A Unified Anomaly Synthesis Strategy with Gradient Ascent for Industrial Anomaly Detection and LocalizationCode3
Seamless Human Motion Composition with Blended Positional EncodingsCode3
AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image DeblurringCode3
LocalMamba: Visual State Space Model with Windowed Selective ScanCode3
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language ModelsCode3
Will LLMs be Professional at Fund Investment? DeepFund: A Live Arena PerspectiveCode3
Event-Enhanced Blurry Video Super-ResolutionCode3
Generative Data Augmentation using LLMs improves Distributional Robustness in Question AnsweringCode3
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference LearningCode3
A Survey of Large Language Models in Finance (FinLLMs)Code3
INTERS: Unlocking the Power of Large Language Models in Search with Instruction TuningCode3
DM-VIO: Delayed Marginalization Visual-Inertial OdometryCode3
Semi-supervised Credit Card Fraud Detection via Attribute-Driven Graph RepresentationCode3
UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal PredictionCode3
Multimodal-Conditioned Latent Diffusion Models for Fashion Image EditingCode3
Separate Anything You DescribeCode3
TAP-Vid: A Benchmark for Tracking Any Point in a VideoCode3
Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software ImprovementCode3
A Practical Probabilistic Benchmark for AI Weather ModelsCode3
Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language ModelCode3
Vision-Language Models for Medical Report Generation and Visual Question Answering: A ReviewCode3
ALS-HAR: Harnessing Wearable Ambient Light Sensors to Enhance IMU-based Human Activity RecogntionCode3
PGL at TextGraphs 2020 Shared Task: Explanation Regeneration using Language and Graph Learning MethodsCode3
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion PriorsCode3
Metadata Embeddings for User and Item Cold-start RecommendationsCode3
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPTCode3
Fundus: A Simple-to-Use News Scraper Optimized for High Quality ExtractionsCode3
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image GenerationCode3
SVIT: Scaling up Visual Instruction TuningCode3
XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive GenerationCode3
Scaling Transformers for Low-Bitrate High-Quality Speech CodingCode3
Towards Controllable Speech Synthesis in the Era of Large Language Models: A SurveyCode3
Learning Bipedal Walking for Humanoids with Current FeedbackCode3
Resolution-robust Large Mask Inpainting with Fourier ConvolutionsCode3
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance SegmentationCode3
AutoVFX: Physically Realistic Video Editing from Natural Language InstructionsCode3
MAGREF: Masked Guidance for Any-Reference Video GenerationCode3
ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-SpeechCode3
Show:102550
← PrevPage 56 of 3547Next →