SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 77517800 of 661570 papers

TitleStatusHype
DaCapo: a modular deep learning framework for scalable 3D image segmentationCode2
ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent SystemsCode2
YOWOv3: An Efficient and Generalized Framework for Human Action Detection and RecognitionCode2
VisionUnite: A Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical KnowledgeCode2
Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language ModelsCode2
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language ModelsCode2
radarODE: An ODE-Embedded Deep Learning Model for Contactless ECG Reconstruction from Millimeter-Wave RadarCode2
VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive ModelingCode2
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic SegmentationCode2
Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image EnhancementCode2
CFBench: A Comprehensive Constraints-Following Benchmark for LLMsCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuningCode2
MESA: Effective Matching Redundancy Reduction by Semantic Area SegmentationCode2
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary SegmentationCode2
Towards Reliable Advertising Image Generation Using Human FeedbackCode2
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion ModelsCode2
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language ModelsCode2
Segment anything model 2: an application to 2D and 3D medical imagesCode2
DeliLaw: A Chinese Legal Counselling System Based on a Large Language ModelCode2
Tamper-Resistant Safeguards for Open-Weight LLMsCode2
Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of AttentionCode2
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization MethodsCode2
CAMAv2: A Vision-Centric Approach for Static Map Element AnnotationCode2
Detecting, Explaining, and Mitigating Memorization in Diffusion ModelsCode2
RainMamba: Enhanced Locality Learning with State Space Models for Video DerainingCode2
MetaOpenFOAM: an LLM-based multi-agent framework for CFDCode2
MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory PredictionCode2
MIST: A Simple and Scalable End-To-End 3D Medical Imaging Segmentation FrameworkCode2
MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced Reranking and Noise-injected TrainingCode2
Tabular Data Augmentation for Machine Learning: Progress and Prospects of Embracing Generative AICode2
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM AgentCode2
MSA^2Net: Multi-scale Adaptive Attention-guided Network for Medical Image SegmentationCode2
H-Watch: An Open, Connected Platform for AI-Enhanced COVID19 Infection Symptoms Monitoring and Contact TracingCode2
Accelerating Image Super-Resolution Networks with Pixel-Level ClassificationCode2
Revisiting Tampered Scene Text Detection in the Era of Generative AICode2
Maverick: Efficient and Accurate Coreference Resolution Defying Recent TrendsCode2
ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement LearningCode2
Zero Shot Health Trajectory Prediction Using TransformerCode2
Interpretable Pre-Trained Transformers for Heart Time-Series DataCode2
SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language ModelsCode2
Palu: Compressing KV-Cache with Low-Rank ProjectionCode2
Machine Unlearning in Generative AI: A SurveyCode2
Autonomous Improvement of Instruction Following Skills via Foundation ModelsCode2
MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal ControlsCode2
Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional RepresentationCode2
XHand: Real-time Expressive Hand AvatarCode2
Efficient Face Super-Resolution via Wavelet-based Feature Enhancement NetworkCode2
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local SimilaritiesCode2
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning ProcessCode2
Show:102550
← PrevPage 156 of 13232Next →