SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 56515700 of 177340 papers

TitleStatusHype
BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 LanguagesCode2
Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language ModelsCode2
Source-free Subject Adaptation for EEG-based Visual RecognitionCode2
HiddenDetect: Detecting Jailbreak Attacks against Large Vision-Language Models via Monitoring Hidden StatesCode2
Training-Free Adaptive Diffusion with Bounded Difference Approximation StrategyCode2
LayoutDiffusion: Controllable Diffusion Model for Layout-to-image GenerationCode2
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense PredictionCode2
Order Constraints in Optimal TransportCode2
Real-time Scene Text Detection with Differentiable BinarizationCode2
An Image is Worth 16x16 Words: Transformers for Image Recognition at ScaleCode2
VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo AlignmentCode2
Hopular: Modern Hopfield Networks for Tabular DataCode2
TOD3Cap: Towards 3D Dense Captioning in Outdoor ScenesCode2
Improving the Training of Rectified FlowsCode2
A Systematic Study of Joint Representation Learning on Protein Sequences and StructuresCode2
Evaluating the Performance of Large Language Models on GAOKAO BenchmarkCode2
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language modelsCode2
On the Origin of Llamas: Model Tree Heritage RecoveryCode2
GPT-NER: Named Entity Recognition via Large Language ModelsCode2
Interpretable and Generalizable Graph Learning via Stochastic Attention MechanismCode2
AST-T5: Structure-Aware Pretraining for Code Generation and UnderstandingCode2
NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting and LocalizationCode2
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering SupervisionCode2
MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMsCode2
Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?Code2
PDE-Transformer: Efficient and Versatile Transformers for Physics SimulationsCode2
Box-supervised Instance Segmentation with Level Set EvolutionCode2
AEM: Attention Entropy Maximization for Multiple Instance Learning based Whole Slide Image ClassificationCode2
Deep PCB To COCO ConvertorCode2
GPT4RoI: Instruction Tuning Large Language Model on Region-of-InterestCode2
Scaling Video-Language Models to 10K Frames via Hierarchical Differential DistillationCode2
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision GeneralistsCode2
TTS-GAN: A Transformer-based Time-Series Generative Adversarial NetworkCode2
Complex Embeddings for Simple Link PredictionCode2
Diffusion-based Generation, Optimization, and Planning in 3D ScenesCode2
Federated Learning with New Knowledge: Fundamentals, Advances, and FuturesCode2
ZnTrack -- Data as CodeCode2
Scaling Relationship on Learning Mathematical Reasoning with Large Language ModelsCode2
One-Step Diffusion Distillation through Score Implicit MatchingCode2
StreetSurf: Extending Multi-view Implicit Surface Reconstruction to Street ViewsCode2
AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot MovementsCode2
SeerAttention: Learning Intrinsic Sparse Attention in Your LLMsCode2
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image SynthesisCode2
mAIstro: an open-source multi-agentic system for automated end-to-end development of radiomics and deep learning models for medical imagingCode2
H-vmunet: High-order Vision Mamba UNet for Medical Image SegmentationCode2
LUCY: Linguistic Understanding and Control Yielding Early Stage of HerCode2
EDTER: Edge Detection with TransformerCode2
Automated MRI Quality Assessment of Brain T1-weighted MRI in Clinical Data Warehouses: A Transfer Learning Approach Relying on Artefact SimulationCode2
6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face GeometryCode2
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIPCode2
Show:102550
← PrevPage 114 of 3547Next →