SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 68516900 of 661570 papers

TitleStatusHype
GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual GroundingCode2
SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction TuningCode2
Number it: Temporal Grounding Videos like Flipping MangaCode2
SymbolFit: Automatic Parametric Modeling with Symbolic RegressionCode2
M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image GenerationCode2
Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM eraCode2
SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D ReconstructionCode2
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic SegmentationCode2
MCL: Multi-view Enhanced Contrastive Learning for Chest X-ray Report GenerationCode2
Squeezed Attention: Accelerating Long Context Length LLM InferenceCode2
Image Matching Filtering and Refinement by Planes and BeyondCode2
Golden Noise for Diffusion Models: A Learning FrameworkCode2
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary SegmentationCode2
LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language InterpretationCode2
Isotropic Correlation Models for the Cross-Section of Equity ReturnsCode2
A Short Note on Evaluating RepNet for Temporal Repetition Counting in VideosCode2
PyGen: A Collaborative Human-AI Approach to Python Package CreationCode2
Searching Latent Program SpacesCode2
BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View SynthesisCode2
MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields RepresentationCode2
OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Fused Geometric and Semantic GuidanceCode2
LogLLM: Log-based Anomaly Detection Using Large Language ModelsCode2
Graph Neural Networks in Supply Chain Analytics and Optimization: Concepts, Perspectives, Dataset and BenchmarksCode2
Deep Learning Accelerated Quantum Transport Simulations in Nanoelectronics: From Break Junctions to Field-Effect TransistorsCode2
Physics Informed Distillation for Diffusion ModelsCode2
V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising DiffusionCode2
Retrieval Augmented Time Series ForecastingCode2
GTA: Global Tracklet Association for Multi-Object Tracking in SportsCode2
Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet EncodingsCode2
RedCode: Risky Code Execution and Generation Benchmark for Code AgentsCode2
Tucano: Advancing Neural Text Generation for PortugueseCode2
DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution DetectionCode2
TIPO: Text to Image with Text Presampling for Prompt OptimizationCode2
Large Language Models Can Self-Improve in Long-context ReasoningCode2
AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space modelsCode2
The Super Weight in Large Language ModelsCode2
StoryTeller: Improving Long Video Description through Global Audio-Visual Character IdentificationCode2
Token Merging for Training-Free Semantic Binding in Text-to-Image SynthesisCode2
ScaleKD: Strong Vision Transformers Could Be Excellent TeachersCode2
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web AgentsCode2
InvisMark: Invisible and Robust Watermarking for AI-generated Image ProvenanceCode2
Reaction-conditioned De Novo Enzyme Design with GENzymeCode2
Graph Neural Network Surrogates to leverage Mechanistic Expert Knowledge towards Reliable and Immediate Pandemic ResponseCode2
Community Research Earth Digital Intelligence Twin (CREDIT)Code2
Reliable-loc: Robust sequential LiDAR global localization in large-scale street scenes based on verifiable cuesCode2
Concept Bottleneck Language Models For protein designCode2
GFT: Graph Foundation Model with Transferable Tree VocabularyCode2
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 TasksCode2
End-to-End Navigation with Vision Language Models: Transforming Spatial Reasoning into Question-AnsweringCode2
LLM-PySC2: Starcraft II learning environment for Large Language ModelsCode2
Show:102550
← PrevPage 138 of 13232Next →