SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 94019450 of 661570 papers

TitleStatusHype
PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human ModelingCode2
SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object TrackingCode2
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real WorldCode2
Omni-Kernel Network for Image RestorationCode2
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR SummarizationCode2
Towards Large-Scale Training of Pathology Foundation ModelsCode2
Space Group Informed Transformer for Crystalline Materials GenerationCode2
Adaptive Super Resolution For One-Shot Talking-Head GenerationCode2
In-Context MattingCode2
An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in Heterogeneous Federated LearningCode2
Neural Plasticity-Inspired Multimodal Foundation Model for Earth ObservationCode2
Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based RetrieversCode2
LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse KernelsCode2
MedPromptX: Grounded Multimodal Prompting for Chest X-ray DiagnosisCode2
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow InstructionsCode2
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal ModelsCode2
InterFusion: Text-Driven Generation of 3D Human-Object InteractionCode2
Transfer CLIP for Generalizable Image DenoisingCode2
LLM2LLM: Boosting LLMs with Novel Iterative Data EnhancementCode2
YOLOv5-6D: Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging GeometriesCode2
Addressing Concept Shift in Online Time Series Forecasting: Detect-then-AdaptCode2
Construction of a Japanese Financial Benchmark for Large Language ModelsCode2
Shadow Generation for Composite Image Using Diffusion modelCode2
MULDE: Multiscale Log-Density Estimation via Denoising Score Matching for Video Anomaly DetectionCode2
SoftPatch: Unsupervised Anomaly Detection with Noisy DataCode2
View-decoupled Transformer for Person Re-identification under Aerial-ground Camera NetworkCode2
Volumetric Environment Representation for Vision-Language NavigationCode2
Protein Conformation Generation via Force-Guided SE(3) Diffusion ModelsCode2
AutoRE: Document-Level Relation Extraction with Large Language ModelsCode2
SyncTweedies: A General Generative Framework Based on Synchronized DiffusionsCode2
Understanding the Ranking Loss for Recommendation with Sparse User FeedbackCode2
Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion ModelsCode2
Model Uncertainty in Evolutionary Optimization and Bayesian Optimization: A Comparative AnalysisCode2
Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-LocalizationCode2
SpikingResformer: Bridging ResNet and Vision Transformer in Spiking Neural NetworksCode2
Consistent Diffusion Meets Tweedie: Training Exact Ambient Diffusion Models with Noisy DataCode2
Fast-Poly: A Fast Polyhedral Framework For 3D Multi-Object TrackingCode2
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language ModelsCode2
Certified Human Trajectory PredictionCode2
Modeling the Label Distributions for Weakly-Supervised Semantic SegmentationCode2
vid-TLDR: Training Free Token merging for Light-weight Video TransformerCode2
AgentGroupChat: An Interactive Group Chat Simulacra For Better Eliciting Emergent BehaviorCode2
RAR: Retrieving And Ranking Augmented MLLMs for Visual RecognitionCode2
SocialBench: Sociality Evaluation of Role-Playing Conversational AgentsCode2
TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration TransducerCode2
Nellie: Automated organelle segmentation, tracking, and hierarchical feature extraction in 2D/3D live-cell microscopyCode2
PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual PatternsCode2
DanceCamera3D: 3D Camera Movement Synthesis with Music and DanceCode2
Diversified and Personalized Multi-rater Medical Image SegmentationCode2
eRST: A Signaled Graph Theory of Discourse Relations and OrganizationCode2
Show:102550
← PrevPage 189 of 13232Next →