SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 92019250 of 661570 papers

TitleStatusHype
DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM JailbreakersCode2
Numerical Association Rule Mining: A Systematic Literature ReviewCode2
SciAssess: Benchmarking LLM Proficiency in Scientific Literature AnalysisCode2
Deep Constrained Least Squares for Blind Image Super-ResolutionCode2
Beyond Accuracy: Behavioral Testing of NLP models with CheckListCode2
Unified Contrastive Learning in Image-Text-Label SpaceCode2
Monitoring and explainability of models in productionCode2
Dragonfly: Multi-Resolution Zoom-In Encoding Enhances Vision-Language ModelsCode2
Shift-ConvNets: Small Convolutional Kernel with Large Kernel EffectsCode2
ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language ModelsCode2
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation ModelCode2
M^3-20M: A Large-Scale Multi-Modal Molecule Dataset for AI-driven Drug Design and DiscoveryCode2
JoJoGAN: One Shot Face StylizationCode2
MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object DetectionCode2
Reinforcing General Reasoning without VerifiersCode2
pyRDF2Vec: A Python Implementation and Extension of RDF2VecCode2
Search Arena: Analyzing Search-Augmented LLMsCode2
R3M: A Universal Visual Representation for Robot ManipulationCode2
RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured EnvironmentsCode2
UV-free Texture Generation with Denoising and Geodesic Heat DiffusionsCode2
From Tiny Machine Learning to Tiny Deep Learning: A SurveyCode2
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation GenerationCode2
Unishox: A hybrid encoder for Short Unicode StringsCode2
Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion UsersCode2
Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D GenerationCode2
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based PoliciesCode2
A Novel Plug-in Module for Fine-Grained Visual ClassificationCode2
Tevatron: An Efficient and Flexible Toolkit for Dense RetrievalCode2
SAMM (Segment Any Medical Model): A 3D Slicer Integration to SAMCode2
CLRNet: Cross Layer Refinement Network for Lane DetectionCode2
REGTR: End-to-end Point Cloud Correspondences with TransformersCode2
Can LLMs Follow Simple Rules?Code2
GIT: A Generative Image-to-text Transformer for Vision and LanguageCode2
URetinex-Net: Retinex-Based Deep Unfolding Network for Low-Light Image EnhancementCode2
WanJuan: A Comprehensive Multimodal Dataset for Advancing English and Chinese Large ModelsCode2
AnimeSR: Learning Real-World Super-Resolution Models for Animation VideosCode2
Scale-Aware Trident Networks for Object DetectionCode2
PrefixQuant: Eliminating Outliers by Prefixed Tokens for Large Language Models QuantizationCode2
Protein-to-genome alignment with miniprotCode2
Dialogue Learning With Human-In-The-LoopCode2
SynFlowNet: Design of Diverse and Novel Molecules with Synthesis ConstraintsCode2
RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and GenerationCode2
SEED: A Simple and Effective 3D DETR in Point CloudsCode2
ExBEHRT: Extended Transformer for Electronic Health Records to Predict Disease Subtypes & ProgressionsCode2
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous DrivingCode2
Enhancing Multi-Camera People Tracking with Anchor-Guided Clustering and Spatio-Temporal Consistency ID Re-AssignmentCode2
TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series ForecastingCode2
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion ModelsCode2
Generalized Portrait Quality AssessmentCode2
Benchmarking Potential Based Rewards for Learning Humanoid LocomotionCode2
Show:102550
← PrevPage 185 of 13232Next →