SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 89519000 of 661570 papers

TitleStatusHype
Paint by Inpaint: Learning to Add Image Objects by Removing Them FirstCode2
WorldGPT: Empowering LLM as Multimodal World ModelCode2
Efficient Remote Sensing with Harmonized Transfer Learning and Modality AlignmentCode2
S^2Mamba: A Spatial-spectral State Space Model for Hyperspectral Image ClassificationCode2
FRAME: A Modular Framework for Autonomous Map Merging: Advancements in the FieldCode2
LLMParser: An Exploratory Study on Using Large Language Models for Log ParsingCode2
Generative Diffusion-based Downscaling for ClimateCode2
Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action RepresentationsCode2
Embedded FPGA Developments in 130nm and 28nm CMOS for Machine Learning in Particle Detector ReadoutCode2
UniRGB-IR: A Unified Framework for RGB-Infrared Semantic Tasks via Adapter TuningCode2
PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery GamesCode2
OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest SearchCode2
REBEL: Reinforcement Learning via Regressing Relative RewardsCode2
CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather ConditionsCode2
Learning Visuotactile Skills with Two Multifingered HandsCode2
Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM AgentsCode2
A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic SegmentationCode2
Commonsense Prototype for Outdoor Unsupervised 3D Object DetectionCode2
IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic LanguagesCode2
EEG-Deformer: A Dense Convolutional Transformer for Brain-computer InterfacesCode2
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMsCode2
DAVE -- A Detect-and-Verify Paradigm for Low-Shot CountingCode2
Multimodal Information Interaction for Medical Image SegmentationCode2
Weak-to-Strong Extrapolation Expedites AlignmentCode2
TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion ModelsCode2
Multi-Scale Representations by Varying Window Attention for Semantic SegmentationCode2
Latent Modulated Function for Computational Optimal Continuous Image RepresentationCode2
The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language ModelsCode2
Gradformer: Graph Transformer with Exponential DecayCode2
From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language ModelsCode2
A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-ResolutionCode2
MaGGIe: Masked Guided Gradual Human Instance MattingCode2
Let's Think Dot by Dot: Hidden Computation in Transformer Language ModelsCode2
Telco-RAG: Navigating the Challenges of Retrieval-Augmented Language Models for TelecommunicationsCode2
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and ChallengesCode2
zkLLM: Zero Knowledge Proofs for Large Language ModelsCode2
Facilitating Advanced Sentinel-2 Analysis Through a Simplified Computation of Nadir BRDF Adjusted ReflectanceCode2
Multi-Session SLAM with Differentiable Wide-Baseline Pose OptimizationCode2
From Parts to Whole: A Unified Reference Framework for Controllable Human Image GenerationCode2
Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space ModelCode2
Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question AnsweringCode2
GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist CollaborationCode2
SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose EstimationCode2
An empirical study of LLaMA3 quantization: from LLMs to MLLMsCode2
Graphic Design with Large Multimodal ModelCode2
Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMsCode2
Deep Learning-Based Point Cloud Registration: A Comprehensive Survey and TaxonomyCode2
SpaceByte: Towards Deleting Tokenization from Large Language ModelingCode2
SwinFuSR: an image fusion-inspired model for RGB-guided thermal image super-resolutionCode2
CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic UnderstandingCode2
Show:102550
← PrevPage 180 of 13232Next →