SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1370113750 of 474278 papers

TitleStatusHype
Scaling Down, LiTting Up: Efficient Zero-Shot Listwise Reranking with Seq2seq Encoder-Decoder ModelsCode2
Diffusion Models Beat GANs on Image SynthesisCode2
Towards Stable Test-Time Adaptation in Dynamic Wild WorldCode2
beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender SystemsCode2
Measuring Style Similarity in Diffusion ModelsCode2
LangBridge: Multilingual Reasoning Without Multilingual SupervisionCode2
LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future OpportunitiesCode2
LEACE: Perfect linear concept erasure in closed formCode2
SEBERTNets: Sequence Enhanced BERT Networks for Event Entity Extraction Tasks Oriented to the Finance FieldCode2
Graph-enhanced Large Language Models in Asynchronous Plan ReasoningCode2
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian EvaluationCode2
An OpenMind for 3D medical vision self-supervised learningCode2
Modality-Independent Graph Neural Networks with Global Transformers for Multimodal RecommendationCode2
SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language ModelsCode2
TetWeave: Isosurface Extraction using On-The-Fly Delaunay Tetrahedral Grids for Gradient-Based Mesh OptimizationCode2
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video GenerationCode2
MathOptAI.jl: Embed trained machine learning predictors into JuMP modelsCode2
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space ModelCode2
DETRPose: Real-time end-to-end transformer model for multi-person pose estimationCode2
On the Role of Attention Heads in Large Language Model SafetyCode2
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space ModelCode2
All-in-one foundational models learning across quantum chemical levelsCode2
Reinforcement learning-based motion imitation for physiologically plausible musculoskeletal motor controlCode2
Simplified and Generalized Masked Diffusion for Discrete DataCode2
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
TextAtlas5M: A Large-scale Dataset for Dense Text Image GenerationCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference OptimizationCode2
BiMediX2: Bio-Medical EXpert LMM for Diverse Medical ModalitiesCode2
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated ConvolutionsCode2
FG^2: Fine-Grained Cross-View Localization by Fine-Grained Feature MatchingCode2
Open-Vocabulary DETR with Conditional MatchingCode2
BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View SynthesisCode2
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization AlignmentCode2
CompassJudger-2: Towards Generalist Judge Model via Verifiable RewardsCode2
Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level LossCode2
Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge ComputingCode2
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything ModelCode2
HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale DatasetCode2
ResumeAtlas: Revisiting Resume Classification with Large-Scale Datasets and Large Language ModelsCode2
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic DataCode2
Multiview Scene GraphCode2
MovieBench: A Hierarchical Movie Level Dataset for Long Video GenerationCode2
N-HiTS: Neural Hierarchical Interpolation for Time Series ForecastingCode2
DepMamba: Progressive Fusion Mamba for Multimodal Depression DetectionCode2
Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language ModelsCode2
Arbitrary-Scale Video Super-Resolution with Structural and Textural PriorsCode2
DiMeR: Disentangled Mesh Reconstruction ModelCode2
Can Large Language Model Agents Simulate Human Trust Behavior?Code2
TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree SequencingCode2
Show:102550
← PrevPage 275 of 9486Next →