SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 58015850 of 661570 papers

TitleStatusHype
SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in ChineseCode2
ChainerCV: a Library for Deep Learning in Computer VisionCode2
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to RefuseCode2
CenterNet++ for Object DetectionCode2
Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion ModelsCode2
Conformal Symplectic Optimization for Stable Reinforcement LearningCode2
LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language ModelsCode2
STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment FusionCode2
LongReward: Improving Long-context Large Language Models with AI FeedbackCode2
Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A SurveyCode2
Deformable One-shot Face Stylization via DINO Semantic GuidanceCode2
ProcessPainter: Learn Painting Process from Sequence DataCode2
Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image TranslationCode2
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language ModelsCode2
Symbolic Music Generation with Non-Differentiable Rule Guided DiffusionCode2
Learning to Compress Prompts with Gist TokensCode2
TRADES: Generating Realistic Market Simulations with Diffusion ModelsCode2
SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory SignalsCode2
MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input FeaturesCode2
PPSURF: Combining Patches and Point Convolutions for Detailed Surface ReconstructionCode2
FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence InferenceCode2
Heterogeneous Multi-Robot Reinforcement LearningCode2
DETR Doesn't Need Multi-Scale or Locality DesignCode2
Multi-Scale Representations by Varying Window Attention for Semantic SegmentationCode2
Segment and Caption AnythingCode2
Attention as a HypernetworkCode2
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic DirectionsCode2
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D DataCode2
Ontology Embedding: A Survey of Methods, Applications and ResourcesCode2
3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image ClassificationCode2
Scaling Diffusion Transformers Efficiently via μPCode2
MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object DetectionCode2
Quanda: An Interpretability Toolkit for Training Data Attribution Evaluation and BeyondCode2
ViTs for SITS: Vision Transformers for Satellite Image Time SeriesCode2
RFWave: Multi-band Rectified Flow for Audio Waveform ReconstructionCode2
LambdaKG: A Library for Pre-trained Language Model-Based Knowledge Graph EmbeddingsCode2
Optimal Flow Matching: Learning Straight Trajectories in Just One StepCode2
DualBEV: Unifying Dual View Transformation with Probabilistic CorrespondencesCode2
Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation FrameworkCode2
g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in MandarinCode2
LangProp: A code optimization framework using Large Language Models applied to drivingCode2
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document UnderstandingCode2
GrootVL: Tree Topology is All You Need in State Space ModelCode2
Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-TuningCode2
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion PriorsCode2
CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language ModelsCode2
Towards Evaluating and Building Versatile Large Language Models for MedicineCode2
RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAMCode2
Practical Blind Image Denoising via Swin-Conv-UNet and Data SynthesisCode2
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware PlatformsCode2
Show:102550
← PrevPage 117 of 13232Next →