SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 96519700 of 177340 papers

TitleStatusHype
Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation BoosterCode2
LLM2Rec: Large Language Models Are Powerful Embedding Models for Sequential RecommendationCode2
TESS 2: A Large-Scale Generalist Diffusion Language ModelCode2
Learning a Decision Tree Algorithm with TransformersCode2
On the Arbitrary-Oriented Object Detection: Classification based Approaches RevisitedCode2
Speaker-change Aware CRF for Dialogue Act ClassificationCode2
MMFashion: An Open-Source Toolbox for Visual Fashion AnalysisCode2
Point2Mesh: A Self-Prior for Deformable MeshesCode2
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement LearningCode2
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision SupportCode2
GreaseLM: Graph REASoning Enhanced Language Models for Question AnsweringCode2
EvoJAX: Hardware-Accelerated NeuroevolutionCode2
LCCDE: A Decision-Based Ensemble Framework for Intrusion Detection in The Internet of VehiclesCode2
Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field InversionCode2
Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint)Code2
DETR Does Not Need Multi-Scale or Locality DesignCode2
Reconstructing Animatable Categories from VideosCode2
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERTCode2
SE(3) diffusion model with application to protein backbone generationCode2
GLAP: General contrastive audio-text pretraining across domains and languagesCode2
TimeZero: Temporal Video Grounding with Reasoning-Guided LVLMCode2
Rethinking Benchmark and Contamination for Language Models with Rephrased SamplesCode2
PG-Video-LLaVA: Pixel Grounding Large Video-Language ModelsCode2
QuIP: 2-Bit Quantization of Large Language Models With GuaranteesCode2
Machine Mindset: An MBTI Exploration of Large Language ModelsCode2
Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph TransformersCode2
Subobject-level Image TokenizationCode2
The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake AudioCode2
Large Language Models Must Be Taught to Know What They Don't KnowCode2
Text2Robot: Evolutionary Robot Design from Text DescriptionsCode2
Towards Reasoning in Large Language Models: A SurveyCode2
Shadow Generation for Composite Image Using Diffusion modelCode2
Universal Narrative Model: an Author-centric Storytelling Framework for Generative AICode2
REBEL: Reinforcement Learning via Regressing Relative RewardsCode2
Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware SparsityCode2
QuEST: Stable Training of LLMs with 1-Bit Weights and ActivationsCode2
Without Paired Labeled Data: An End-to-End Self-Supervised Paradigm for UAV-View Geo-LocalizationCode2
MC-LLaVA: Multi-Concept Personalized Vision-Language ModelCode2
TC-RAG:Turing-Complete RAG's Case study on Medical LLM SystemsCode2
Hacking CTFs with Plain AgentsCode2
Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation?Code2
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity PreservationCode2
VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic DatasetCode2
VerilogEval: Evaluating Large Language Models for Verilog Code GenerationCode2
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free LunchCode2
Finding Transformer Circuits with Edge PruningCode2
DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image UnderstandingCode2
LoFormer: Local Frequency Transformer for Image DeblurringCode2
RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera MovementsCode2
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous DrivingCode2
Show:102550
← PrevPage 194 of 3547Next →