SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 47514775 of 177340 papers

TitleStatusHype
ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localizationCode2
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented GenerationCode2
Tuning Language Models by ProxyCode2
iFormer: Integrating ConvNet and Transformer for Mobile ApplicationCode2
Wavelet and Prototype Augmented Query-based Transformer for Pixel-level Surface Defect DetectionCode2
PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase IdentificationCode2
ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language ModelsCode2
GrounDiT: Grounding Diffusion Transformers via Noisy Patch TransplantationCode2
Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at ScaleCode2
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the KeyCode2
GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AICode2
CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMsCode2
Mitigating Object Hallucination via Concentric Causal AttentionCode2
Differential TransformerCode2
Bridging the Gap Between End-to-End and Two-Step Text SpottingCode2
Degradation-Aware Feature Perturbation for All-in-One Image RestorationCode2
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule GenerationCode2
Unicom: Universal and Compact Representation Learning for Image RetrievalCode2
SkateFormer: Skeletal-Temporal Transformer for Human Action RecognitionCode2
Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech DetectionCode2
Towards Satellite Image Road Graph Extraction: A Global-Scale Dataset and A Novel MethodCode2
Golden Cudgel Network for Real-Time Semantic SegmentationCode2
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model SizesCode2
Agent Attention: On the Integration of Softmax and Linear AttentionCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
Show:102550
← PrevPage 191 of 7094Next →