SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 73517400 of 661570 papers

TitleStatusHype
QAEncoder: Towards Aligned Representation Learning in Question Answering SystemCode2
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge AugmentationCode2
Spiking Transformer with Spatial-Temporal AttentionCode2
Effective Diffusion Transformer Architecture for Image Super-ResolutionCode2
Underwater Organism Color Enhancement via Color Code Decomposition, Adaptation and InterpolationCode2
One Token to Seg Them All: Language Instructed Reasoning Segmentation in VideosCode2
A Survey on Graph Neural Networks for Remaining Useful Life Prediction: Methodologies, Evaluation and Future TrendsCode2
CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet UpcyclingCode2
MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image SegmentationCode2
CycleBNN: Cyclic Precision Training in Binary Neural NetworksCode2
MicroFlow: An Efficient Rust-Based Inference Engine for TinyMLCode2
1st Place Solution of Multiview Egocentric Hand Tracking Challenge ECCV2024Code2
Epidemiology-Aware Neural ODE with Continuous Disease Transmission GraphCode2
Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal MaskingCode2
Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image RestorationCode2
Conditional Image Synthesis with Diffusion Models: A SurveyCode2
Cross-video Identity Correlating for Person Re-identification Pre-trainingCode2
Positional Encoder Graph Quantile Neural Networks for Geographic DataCode2
Do We Need Domain-Specific Embedding Models? An Empirical InvestigationCode2
YOLOv8-ResCBAM: YOLOv8 Based on An Effective Attention Module for Pediatric Wrist Fracture DetectionCode2
DualDn: Dual-domain Denoising via Differentiable ISPCode2
Space-time 2D Gaussian Splatting for Accurate Surface Reconstruction under Complex Dynamic ScenesCode2
A Survey on the Honesty of Large Language ModelsCode2
Rethinking the Power of Timestamps for Robust Time Series Forecasting: A Global-Local Fusion PerspectiveCode2
A Novel Unified Architecture for Low-Shot Counting by Detection and SegmentationCode2
SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal FusionCode2
Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D ReconstructionCode2
Control Industrial Automation System with Large Language Model AgentsCode2
Mamba Meets Financial Markets: A Graph-Mamba Approach for Stock Price PredictionCode2
From News to Forecast: Integrating Event Analysis in LLM-Based Time Series Forecasting with ReflectionCode2
PGN: The RNN's New Successor is Effective for Long-Range Time Series ForecastingCode2
Revisit Anything: Visual Place Recognition via Image Segment RetrievalCode2
EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image SegmentationCode2
FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity RefinerCode2
A Survey of Spatio-Temporal EEG data Analysis: from Models to ApplicationsCode2
Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image GenerationCode2
Event-based Stereo Depth Estimation: A SurveyCode2
Neural Light Spheres for Implicit Image Stitching and View SynthesisCode2
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event DetectionCode2
MaskLLM: Learnable Semi-Structured Sparsity for Large Language ModelsCode2
E.T. Bench: Towards Open-Ended Event-Level Video-Language UnderstandingCode2
Source-Free Domain Adaptation for YOLO Object DetectionCode2
ECG-Image-Database: A Dataset of ECG Images with Real-World Imaging and Scanning Artifacts; A Foundation for Computerized ECG Image Digitization and AnalysisCode2
General Detection-based Text Line RecognitionCode2
Statewide Visual Geolocalization in the WildCode2
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token ReductionCode2
Attention Prompting on Image for Large Vision-Language ModelsCode2
Progressive Representation Learning for Real-Time UAV TrackingCode2
Empirical Asset Pricing with Large Language Model AgentsCode2
DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D DiffusionCode2
Show:102550
← PrevPage 148 of 13232Next →