SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1050110550 of 661570 papers

TitleStatusHype
The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP ModelsCode2
Neural Combinatorial Optimization Algorithms for Solving Vehicle Routing Problems: A Comprehensive Survey with PerspectivesCode2
Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language ModelsCode2
Jailbreaking Attack against Multimodal Large Language ModelCode2
IFRNet: Intermediate Feature Refine Network for Efficient Frame InterpolationCode2
AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker SimulationCode2
PromptIR: Prompting for All-in-One Blind Image RestorationCode2
Convolutional Neural Operators for robust and accurate learning of PDEsCode2
Grappa -- A Machine Learned Molecular Mechanics Force FieldCode2
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language VariantsCode2
A Machine Learning Approach That Beats Large Rubik's CubesCode2
Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized MappingCode2
CAPO: Cost-Aware Prompt OptimizationCode2
BakedAvatar: Baking Neural Fields for Real-Time Head Avatar SynthesisCode2
MV-FCOS3D++: Multi-View Camera-Only 4D Object Detection with Pretrained Monocular BackbonesCode2
Artificial Intelligence of Things: A SurveyCode2
BianCang: A Traditional Chinese Medicine Large Language ModelCode2
Fast Dynamic Radiance Fields with Time-Aware Neural VoxelsCode2
Automatically Bounding the Taylor Remainder Series: Tighter Bounds and New ApplicationsCode2
LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural AnnotationsCode2
Fraud Dataset Benchmark and ApplicationsCode2
Video-STaR: Self-Training Enables Video Instruction Tuning with Any SupervisionCode2
DeBERTa: Decoding-enhanced BERT with Disentangled AttentionCode2
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language ModelsCode2
Streaming Active Learning with Deep Neural NetworksCode2
StyleCLIPDraw: Coupling Content and Style in Text-to-Drawing TranslationCode2
Frequency-Adaptive Dilated Convolution for Semantic SegmentationCode2
LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep FeaturesCode2
On Meta-PromptingCode2
Reducing Hallucinations in Vision-Language Models via Latent Space SteeringCode2
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D ScenesCode2
Mapping Global Floods with 10 Years of Satellite Radar DataCode2
EchoTracker: Advancing Myocardial Point Tracking in EchocardiographyCode2
Getting it Right: Improving Spatial Consistency in Text-to-Image ModelsCode2
Compression Represents Intelligence LinearlyCode2
Image Inversion: A Survey from GANs to Diffusion and BeyondCode2
CoSER: Coordinating LLM-Based Persona Simulation of Established RolesCode2
EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene UnderstandingCode2
DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete LatentsCode2
PatternRank: Leveraging Pretrained Language Models and Part of Speech for Unsupervised Keyphrase ExtractionCode2
Snuffy: Efficient Whole Slide Image ClassifierCode2
Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation ModelsCode2
Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation LearningCode2
TreeRL: LLM Reinforcement Learning with On-Policy Tree SearchCode2
CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text LabelsCode2
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse AutoencodersCode2
A User's Guide to KSig: GPU-Accelerated Computation of the Signature KernelCode2
RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question AnsweringCode2
SOTOPIA-π: Interactive Learning of Socially Intelligent Language AgentsCode2
Towards Localized Fine-Grained Control for Facial Expression GenerationCode2
Show:102550
← PrevPage 211 of 13232Next →