SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 80018050 of 661570 papers

TitleStatusHype
FairDiff: Fair Segmentation with Point-Image DiffusionCode2
Video-STaR: Self-Training Enables Video Instruction Tuning with Any SupervisionCode2
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 LanguagesCode2
SOLO: A Single Transformer for Scalable Vision-Language ModelingCode2
LGRNet: Local-Global Reciprocal Network for Uterine Fibroid Segmentation in Ultrasound VideosCode2
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight GenerationCode2
MEEG and AT-DGNN: Improving EEG Emotion Recognition with Music Introducing and Graph-based LearningCode2
WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question AnsweringCode2
PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion ModelsCode2
Controllable and Reliable Knowledge-Intensive Task-Oriented Conversational Agents with Declarative Genie WorksheetsCode2
PsycoLLM: Enhancing LLM for Psychological Understanding and EvaluationCode2
Training-free CryoET Tomogram SegmentationCode2
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent SpaceCode2
4D Contrastive Superflows are Dense 3D Representation LearnersCode2
iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvementCode2
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion RecognitionCode2
Language Representations Can be What Recommenders Need: Findings and PotentialsCode2
See Further for Parameter Efficient Fine-tuning by Standing on the Shoulders of DecompositionCode2
Just read twice: closing the recall gap for recurrent language modelsCode2
P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point CloudsCode2
HiDe-PET: Continual Learning via Hierarchical Decomposition of Parameter-Efficient TuningCode2
Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language ModelsCode2
MMSci: A Dataset for Graduate-Level Multi-Discipline Multimodal Scientific UnderstandingCode2
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical QuestionsCode2
SCSA: Exploring the Synergistic Effects Between Spatial and Channel AttentionCode2
Slice-Consistent 3D Volumetric Brain CT-to-MRI Translation with 2D Brownian Bridge Diffusion ModelCode2
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language ModelsCode2
Associative Recurrent Memory TransformerCode2
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and TransportationCode2
PartCraft: Crafting Creative Objects by PartsCode2
Isomorphic Pruning for Vision ModelsCode2
Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic UnitsCode2
RPN: Reconciled Polynomial Network Towards Unifying PGMs, Kernel SVMs, MLP and KANCode2
Discovering symbolic expressions with parallelized tree searchCode2
Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneous Convolutional for accurate object detectionCode2
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language ModelsCode2
SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing IndustryCode2
RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic ManipulationCode2
AnySR: Realizing Image Super-Resolution as Any-Scale, Any-ResourceCode2
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM AgentsCode2
VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual ManipulationCode2
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the WildCode2
Occupancy as Set of PointsCode2
MiniGPT-Med: Large Language Model as a General Interface for Radiology DiagnosisCode2
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language ModelsCode2
Craftium: An Extensible Framework for Creating Reinforcement Learning EnvironmentsCode2
Benchmarking Complex Instruction-Following with Multiple Constraints CompositionCode2
Mixture of A Million ExpertsCode2
Unraveling Molecular Structure: A Multimodal Spectroscopic Dataset for ChemistryCode2
DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image ClassificationCode2
Show:102550
← PrevPage 161 of 13232Next →