SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1180111850 of 474278 papers

TitleStatusHype
Transformer tricks: Removing weights for skipless transformersCode2
Hyper-3DG: Text-to-3D Gaussian Generation via HypergraphCode2
Listen, Think, and UnderstandCode2
Pre-training Differentially Private Models with Limited Public DataCode2
Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine SamplingCode2
BARS: Towards Open Benchmarking for Recommender SystemsCode2
RAP: Retrieval-Augmented Personalization for Multimodal Large Language ModelsCode2
Optimal Invariant Bases for Atomistic Machine LearningCode2
Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image RegistrationCode2
UNETR++: Delving into Efficient and Accurate 3D Medical Image SegmentationCode2
StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video UnderstandingCode2
Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning MambaCode2
Light and Optimal Schrödinger Bridge MatchingCode2
Fuzz4All: Universal Fuzzing with Large Language ModelsCode2
VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly DetectionCode2
ProLLM: Protein Chain-of-Thoughts Enhanced LLM for Protein-Protein Interaction PredictionCode2
How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden StatesCode2
Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein DesignCode2
Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined LevelsCode2
TESTAM: A Time-Enhanced Spatio-Temporal Attention Model with Mixture of ExpertsCode2
The Chosen One: Consistent Characters in Text-to-Image Diffusion ModelsCode2
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence UnderstandingCode2
MovieChat: From Dense Token to Sparse Memory for Long Video UnderstandingCode2
Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal LearningCode2
AlignBench: Benchmarking Chinese Alignment of Large Language ModelsCode2
SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote SensingCode2
EMBER2024 -- A Benchmark Dataset for Holistic Evaluation of Malware ClassifiersCode2
Why are Visually-Grounded Language Models Bad at Image Classification?Code2
ICASSP 2023 Acoustic Echo Cancellation ChallengeCode2
Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking CapabilitiesCode2
HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and PruningCode2
ZERO-IG: Zero-Shot Illumination-Guided Joint Denoising and Adaptive Enhancement for Low-Light ImagesCode2
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement LearningCode2
Context-Guided Spatial Feature Reconstruction for Efficient Semantic SegmentationCode2
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile InstructionsCode2
Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and PlanningCode2
ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot CoordinationCode2
BrainMorph: A Foundational Keypoint Model for Robust and Flexible Brain MRI RegistrationCode2
Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous DrivingCode2
Pose for Everything: Towards Category-Agnostic Pose EstimationCode2
Neural Optimal TransportCode2
Fast Context-Based Low-Light Image Enhancement via Neural Implicit RepresentationsCode2
Singer Identity Representation Learning using Self-Supervised TechniquesCode2
What does a platypus look like? Generating customized prompts for zero-shot image classificationCode2
Towards Zero-shot Point Cloud Anomaly Detection: A Multi-View Projection FrameworkCode2
Skeleton-free Pose Transfer for Stylized 3D CharactersCode2
Ambiguous Medical Image Segmentation using Diffusion ModelsCode2
VQA^2: Visual Question Answering for Video Quality AssessmentCode2
PosterLlama: Bridging Design Ability of Langauge Model to Contents-Aware Layout GenerationCode2
Class-Incremental Learning: A SurveyCode2
Show:102550
← PrevPage 237 of 9486Next →