SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 47514800 of 177340 papers

TitleStatusHype
ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localizationCode2
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented GenerationCode2
Tuning Language Models by ProxyCode2
iFormer: Integrating ConvNet and Transformer for Mobile ApplicationCode2
Wavelet and Prototype Augmented Query-based Transformer for Pixel-level Surface Defect DetectionCode2
PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase IdentificationCode2
ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language ModelsCode2
GrounDiT: Grounding Diffusion Transformers via Noisy Patch TransplantationCode2
Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at ScaleCode2
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the KeyCode2
GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AICode2
CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMsCode2
Mitigating Object Hallucination via Concentric Causal AttentionCode2
Differential TransformerCode2
Bridging the Gap Between End-to-End and Two-Step Text SpottingCode2
Degradation-Aware Feature Perturbation for All-in-One Image RestorationCode2
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule GenerationCode2
Unicom: Universal and Compact Representation Learning for Image RetrievalCode2
SkateFormer: Skeletal-Temporal Transformer for Human Action RecognitionCode2
Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech DetectionCode2
Towards Satellite Image Road Graph Extraction: A Global-Scale Dataset and A Novel MethodCode2
Golden Cudgel Network for Real-Time Semantic SegmentationCode2
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model SizesCode2
Agent Attention: On the Integration of Softmax and Linear AttentionCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
Scene Adaptive Sparse Transformer for Event-based Object DetectionCode2
Diffusion-based Reinforcement Learning via Q-weighted Variational Policy OptimizationCode2
Optimizing Large Language Models for OpenAPI Code CompletionCode2
Preference Alignment with Flow MatchingCode2
InstructUIE: Multi-task Instruction Tuning for Unified Information ExtractionCode2
Scaling Transformer to 1M tokens and beyond with RMTCode2
Occupancy as Set of PointsCode2
LangCoop: Collaborative Driving with LanguageCode2
PlanT: Explainable Planning Transformers via Object-Level RepresentationsCode2
Measuring and Narrowing the Compositionality Gap in Language ModelsCode2
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware PlatformsCode2
GrootVL: Tree Topology is All You Need in State Space ModelCode2
ViTs for SITS: Vision Transformers for Satellite Image Time SeriesCode2
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic DirectionsCode2
MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input FeaturesCode2
ProcessPainter: Learn Painting Process from Sequence DataCode2
ChainerCV: a Library for Deep Learning in Computer VisionCode2
Surg-3M: A Dataset and Foundation Model for Perception in Surgical SettingsCode2
Graph Diffusion Transformers for Multi-Conditional Molecular GenerationCode2
When and why vision-language models behave like bags-of-words, and what to do about it?Code2
CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency ModelsCode2
FlexiDreamer: Single Image-to-3D Generation with FlexiCubesCode2
USP: Unified Self-Supervised Pretraining for Image Generation and UnderstandingCode2
Alpha^2: Discovering Logical Formulaic Alphas using Deep Reinforcement LearningCode2
FastInst: A Simple Query-Based Model for Real-Time Instance SegmentationCode2
Show:102550
← PrevPage 96 of 3547Next →