SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 64016450 of 661570 papers

TitleStatusHype
Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System StrategiesCode2
Graph-Aware Isomorphic Attention for Adaptive Dynamics in TransformersCode2
Navigation Variable-based Multi-objective Particle Swarm Optimization for UAV Path Planning with Kinematic ConstraintsCode2
TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose EstimationCode2
UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle ImageryCode2
VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo AlignmentCode2
PG-SAG: Parallel Gaussian Splatting for Fine-Grained Large-Scale Urban Buildings Reconstruction via Semantic-Aware GroupingCode2
FLAME: Financial Large-Language Model Assessment and Metrics EvaluationCode2
Virgo: A Preliminary Exploration on Reproducing o1-like MLLMCode2
Metadata Conditioning Accelerates Language Model Pre-trainingCode2
Merging Context Clustering with Visual State Space Models for Medical Image SegmentationCode2
Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal LearningCode2
R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual LocalizationCode2
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding ModelCode2
Click-Calib: A Robust Extrinsic Calibration Method for Surround-View SystemsCode2
RingFormer: A Neural Vocoder with Ring Attention and Convolution-Augmented TransformerCode2
High-Fidelity Lightweight Mesh Reconstruction from Point CloudsCode2
DynRefer: Delving into Region-level Multimodal Tasks via Dynamic ResolutionCode2
Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained AnalysisCode2
2.5 Years in Class: A Multimodal Textbook for Vision-Language PretrainingCode2
nnWNet: Rethinking the Use of Transformers in Biomedical Image Segmentation and Calling for a Unified Evaluation BenchmarkCode2
Navigating Image Restoration with VAR's Distribution Alignment PriorCode2
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual CompressionCode2
ShiftwiseConv: Small Convolutional Kernel with Large Kernel EffectCode2
One-shot 3D Object Canonicalization based on Geometric and Semantic ConsistencyCode2
Adaptive Keyframe Sampling for Long Video UnderstandingCode2
MATCHA: Towards Matching AnythingCode2
MNE-SLAM: Multi-Agent Neural SLAM for Mobile RobotsCode2
HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual PerceiverCode2
Structure-from-Motion with a Non-Parametric Camera ModelCode2
AutoPresent: Designing Structured Visuals from ScratchCode2
BWFormer: Building Wireframe Reconstruction from Airborne LiDAR Point Cloud with TransformerCode2
VoiceRestore: Flow-Matching Transformers for Speech Recording Quality RestorationCode2
LesionLocator: Zero-Shot Universal Tumor Segmentation and Tracking in 3D Whole-Body ImagingCode2
TrustRAG: Enhancing Robustness and Trustworthiness in RAGCode2
Wavelet and Prototype Augmented Query-based Transformer for Pixel-level Surface Defect DetectionCode2
Samba: A Unified Mamba-based Framework for General Salient Object DetectionCode2
Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
RORem: Training a Robust Object Remover with Human-in-the-LoopCode2
RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented InstructionsCode2
Superposition in Transformers: A Novel Way of Building Mixture of ExpertsCode2
PyMilo: A Python Library for ML I/OCode2
Online Video Understanding: OVBench and VideoChat-OnlineCode2
MCP-Solver: Integrating Language Models with Constraint Programming SystemsCode2
Dual Diffusion for Unified Image Generation and UnderstandingCode2
Efficient Parallel Genetic Algorithm for Perturbed Substructure Optimization in Complex NetworkCode2
Varformer: Adapting VAR's Generative Prior for Image RestorationCode2
YOLO-UniOW: Efficient Universal Open-World Object DetectionCode2
DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech RecognitionCode2
VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing ControlCode2
Show:102550
← PrevPage 129 of 13232Next →