SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 98019850 of 661570 papers

TitleStatusHype
DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail PredictionCode2
Play to Generalize: Learning to Reason Through Game PlayCode2
ChineseHarm-Bench: A Chinese Harmful Content Detection BenchmarkCode2
Curve-Aware Gaussian Splatting for 3D Parametric Curve ReconstructionCode2
Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation BoosterCode2
LLM2Rec: Large Language Models Are Powerful Embedding Models for Sequential RecommendationCode2
TESS 2: A Large-Scale Generalist Diffusion Language ModelCode2
Learning a Decision Tree Algorithm with TransformersCode2
On the Arbitrary-Oriented Object Detection: Classification based Approaches RevisitedCode2
Speaker-change Aware CRF for Dialogue Act ClassificationCode2
MMFashion: An Open-Source Toolbox for Visual Fashion AnalysisCode2
Point2Mesh: A Self-Prior for Deformable MeshesCode2
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement LearningCode2
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision SupportCode2
GreaseLM: Graph REASoning Enhanced Language Models for Question AnsweringCode2
EvoJAX: Hardware-Accelerated NeuroevolutionCode2
LCCDE: A Decision-Based Ensemble Framework for Intrusion Detection in The Internet of VehiclesCode2
Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field InversionCode2
Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint)Code2
DETR Does Not Need Multi-Scale or Locality DesignCode2
Reconstructing Animatable Categories from VideosCode2
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERTCode2
SE(3) diffusion model with application to protein backbone generationCode2
GLAP: General contrastive audio-text pretraining across domains and languagesCode2
TimeZero: Temporal Video Grounding with Reasoning-Guided LVLMCode2
Rethinking Benchmark and Contamination for Language Models with Rephrased SamplesCode2
PG-Video-LLaVA: Pixel Grounding Large Video-Language ModelsCode2
QuIP: 2-Bit Quantization of Large Language Models With GuaranteesCode2
Machine Mindset: An MBTI Exploration of Large Language ModelsCode2
Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph TransformersCode2
Subobject-level Image TokenizationCode2
The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake AudioCode2
Large Language Models Must Be Taught to Know What They Don't KnowCode2
Text2Robot: Evolutionary Robot Design from Text DescriptionsCode2
Towards Reasoning in Large Language Models: A SurveyCode2
Shadow Generation for Composite Image Using Diffusion modelCode2
Universal Narrative Model: an Author-centric Storytelling Framework for Generative AICode2
REBEL: Reinforcement Learning via Regressing Relative RewardsCode2
Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware SparsityCode2
QuEST: Stable Training of LLMs with 1-Bit Weights and ActivationsCode2
Without Paired Labeled Data: An End-to-End Self-Supervised Paradigm for UAV-View Geo-LocalizationCode2
MC-LLaVA: Multi-Concept Personalized Vision-Language ModelCode2
TC-RAG:Turing-Complete RAG's Case study on Medical LLM SystemsCode2
Hacking CTFs with Plain AgentsCode2
Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation?Code2
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity PreservationCode2
VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic DatasetCode2
VerilogEval: Evaluating Large Language Models for Verilog Code GenerationCode2
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free LunchCode2
Finding Transformer Circuits with Edge PruningCode2
Show:102550
← PrevPage 197 of 13232Next →