SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 76517700 of 177340 papers

TitleStatusHype
Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set RelationshipsCode2
GREEN: a lightweight architecture using learnable wavelets and Riemannian geometry for biomarker explorationCode2
Revitalizing Multivariate Time Series Forecasting: Learnable Decomposition with Inter-Series Dependencies and Intra-Series Variations ModelingCode2
Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken ConversationsCode2
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction TuningCode2
RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic ManipulationCode2
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept CompositionCode2
Towards Multi-spatiotemporal-scale Generalized PDE ModelingCode2
ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase PartitionCode2
An Automated End-to-End Open-Source Software for High-Quality Text-to-Speech Dataset GenerationCode2
Deep Homography Estimation for Visual Place RecognitionCode2
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward EncodingsCode2
CARZero: Cross-Attention Alignment for Radiology Zero-Shot ClassificationCode2
UN-SAM: Universal Prompt-Free Segmentation for Generalized Nuclei ImagesCode2
Contextualized Diffusion Models for Text-Guided Image and Video GenerationCode2
Retrieval is Accurate GenerationCode2
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective RewardsCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
A Survey on Remote Sensing Foundation Models: From Vision to MultimodalityCode2
A Cognitive-Based Trajectory Prediction Approach for Autonomous DrivingCode2
How do Large Language Models Handle Multilingualism?Code2
NARUTO: Neural Active Reconstruction from Uncertain Target ObservationsCode2
Deep learning for 3D human pose estimation and mesh recovery: A surveyCode2
Global and Local Prompts Cooperation via Optimal Transport for Federated LearningCode2
VNLP: Turkish NLP PackageCode2
TempCompass: Do Video LLMs Really Understand Videos?Code2
DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature FusionCode2
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement LearningCode2
VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPTCode2
Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QACode2
xT: Nested Tokenization for Larger Context in Large ImagesCode2
MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target DetectionCode2
InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model AgentsCode2
HMT-UNet: A hybird Mamba-Transformer Vision UNet for Medical Image SegmentationCode2
Apollo: A Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B PeopleCode2
Learning to Decode Collaboratively with Multiple Language ModelsCode2
Q-DiT: Accurate Post-Training Quantization for Diffusion TransformersCode2
QAQ: Quality Adaptive Quantization for LLM KV CacheCode2
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion ModelsCode2
IsolateGPT: An Execution Isolation Architecture for LLM-Based Agentic SystemsCode2
Tracking Meets LoRA: Faster Training, Larger Model, Stronger PerformanceCode2
VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion ModelsCode2
Beyond Text: Frozen Large Language Models in Visual Signal ComprehensionCode2
RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation ModelCode2
The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023Code2
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningCode2
OpenGraph: Open-Vocabulary Hierarchical 3D Graph Representation in Large-Scale Outdoor EnvironmentsCode2
Generative Region-Language Pretraining for Open-Ended Object DetectionCode2
A Comprehensive Study of Multimodal Large Language Models for Image Quality AssessmentCode2
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT AdaptationCode2
Show:102550
← PrevPage 154 of 3547Next →