SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1660116650 of 474278 papers

TitleStatusHype
Window Token Concatenation for Efficient Visual Large Language ModelsCode1
Detection-Friendly Nonuniformity Correction: A Union Framework for Infrared UAVTarget DetectionCode1
A Survey of Pathology Foundation Model: Progress and Future DirectionsCode1
Learning-Based Conformal Tube MPC for Safe Control in Interactive Multi-Agent SystemsCode1
OLAF: An Open Life Science Analysis Framework for Conversational Bioinformatics Powered by Large Language ModelsCode1
Discovering Partially Known Ordinary Differential Equations: a Case Study on the Chemical Kinetics of Cellulose DegradationCode1
Meta-DAN: towards an efficient prediction strategy for page-level handwritten text recognitionCode1
SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image UnderstandingCode1
Distillation and Refinement of Reasoning in Small Language Models for Document Re-rankingCode1
Single-Pass Document Scanning for Question AnsweringCode1
Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion ModelsCode1
Sparsity-Promoting Reachability Analysis and Optimization of Constrained ZonotopesCode1
The AI Cosmologist I: An Agentic System for Automated Data AnalysisCode1
Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-GenerationCode1
Optimizing 4D Gaussians for Dynamic Scene Video from Single Landscape ImagesCode1
Monte Carlo Graph ColoringCode1
Language Models Are Implicitly ContinuousCode1
Multi-Flow: Multi-View-Enriched Normalizing Flows for Industrial Anomaly DetectionCode1
Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token PredictionCode1
IPA-CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language ModelingCode1
Learning Phase Distortion with Selective State Space Models for Video Turbulence MitigationCode1
A Physics-Informed Meta-Learning Framework for the Continuous Solution of Parametric PDEs on Arbitrary GeometriesCode1
ESC: Erasing Space Concept for Knowledge DeletionCode1
APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision TransformersCode1
Narrative Studio: Visual narrative exploration using LLMs and Monte Carlo Tree SearchCode1
MiLo: Efficient Quantized MoE Inference with Mixture of Low-Rank CompensatorsCode1
Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic SegmentationCode1
Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic AssessmentCode1
MMTL-UniAD: A Unified Framework for Multimodal and Multi-Task Learning in Assistive Driving PerceptionCode1
Large (Vision) Language Models are Unsupervised In-Context LearnersCode1
PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose EstimationCode1
Generative Evaluation of Complex Reasoning in Large Language ModelsCode1
AnesBench: Multi-Dimensional Evaluation of LLM Reasoning in AnesthesiologyCode1
JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language ModelCode1
Noise Calibration and Spatial-Frequency Interactive Network for STEM Image EnhancementCode1
F-ViTA: Foundation Model Guided Visible to Thermal TranslationCode1
MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal PairsCode1
Robustly identifying concepts introduced during chat fine-tuning using crosscodersCode1
Do Two AI Scientists Agree?Code1
Multi-Head Adaptive Graph Convolution Network for Sparse Point Cloud-Based Human Activity RecognitionCode1
Hyperspectral Remote Sensing Images Salient Object Detection: The First Benchmark Dataset and BaselineCode1
Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline ResultsCode1
Multimodal Fusion and Vision-Language Models: A Survey for Robot VisionCode1
GMR-Conv: An Efficient Rotation and Reflection Equivariant Convolution Kernel Using Gaussian Mixture RingsCode1
MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple GranularitiesCode1
Fine-Tuning Visual Autoregressive Models for Subject-Driven GenerationCode1
TailedCore: Few-Shot Sampling for Unsupervised Long-Tail Noisy Anomaly DetectionCode1
STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security InspectionCode1
Detecting Lip-Syncing Deepfakes: Vision Temporal Transformer for Analyzing Mouth InconsistenciesCode1
Representation Bending for Large Language Model SafetyCode1
Show:102550
← PrevPage 333 of 9486Next →