SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 58515900 of 177340 papers

TitleStatusHype
A Unified Model for Multi-class Anomaly DetectionCode2
Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time AdaptationCode2
Spurious Forgetting in Continual Learning of Language ModelsCode2
NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the WildCode2
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier TransformCode2
Joint Spatio-Temporal Modeling for the Semantic Change Detection in Remote Sensing ImagesCode2
Thought2Text: Text Generation from EEG Signal using Large Language Models (LLMs)Code2
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video CaptioningCode2
DiffIR: Efficient Diffusion Model for Image RestorationCode2
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language ModelsCode2
Grounding Language Models to Images for Multimodal Inputs and OutputsCode2
Agent models: Internalizing Chain-of-Action Generation into Reasoning modelsCode2
Diffusion Models in Vision: A SurveyCode2
Exploring Visual Prompts for Adapting Large-Scale ModelsCode2
Power Bundle Adjustment for Large-Scale 3D ReconstructionCode2
TrafficVLM: A Controllable Visual Language Model for Traffic Video CaptioningCode2
Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A ReviewCode2
Analyzing Infrastructure LiDAR Placement with Realistic LiDAR Simulation LibraryCode2
Multi-Fidelity Active Learning with GFlowNetsCode2
Full Parameter Fine-tuning for Large Language Models with Limited ResourcesCode2
Making a MIRACL: Multilingual Information Retrieval Across a Continuum of LanguagesCode2
Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofingCode2
VRP-SAM: SAM with Visual Reference PromptCode2
BoW3D: Bag of Words for Real-Time Loop Closing in 3D LiDAR SLAMCode2
MeMemo: On-device Retrieval Augmentation for Private and Personalized Text GenerationCode2
How far are today's time-series models from real-world weather forecasting applications?Code2
Point Cloud Mamba: Point Cloud Learning via State Space ModelCode2
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation SchemeCode2
Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and DatasetsCode2
HiVG: Hierarchical Multimodal Fine-grained Modulation for Visual GroundingCode2
D-CIPHER: Dynamic Collaborative Intelligent Multi-Agent System with Planner and Heterogeneous Executors for Offensive SecurityCode2
Learning Diffusion Priors from Observations by Expectation MaximizationCode2
Emotionally Enhanced Talking Face GenerationCode2
Vision Transformer with Quadrangle AttentionCode2
DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D ImagesCode2
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text GenerationCode2
AbdomenAtlas-8K: Annotating 8,000 CT Volumes for Multi-Organ Segmentation in Three WeeksCode2
SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction TuningCode2
Towards Metrical Reconstruction of Human FacesCode2
Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic CorpusCode2
TorchAudio: Building Blocks for Audio and Speech ProcessingCode2
Deep Learning Accelerated Quantum Transport Simulations in Nanoelectronics: From Break Junctions to Field-Effect TransistorsCode2
Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene ClassificationCode2
Software package for simulations using the coarse-grained CALVADOS modelCode2
Multi-CPR: A Multi Domain Chinese Dataset for Passage RetrievalCode2
FourierGNN: Rethinking Multivariate Time Series Forecasting from a Pure Graph PerspectiveCode2
Interactive4D: Interactive 4D LiDAR SegmentationCode2
Prototypical Networks for Few-shot LearningCode2
BEVStereo: Enhancing Depth Estimation in Multi-view 3D Object Detection with Dynamic Temporal StereoCode2
Language is All a Graph NeedsCode2
Show:102550
← PrevPage 118 of 3547Next →