SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 95519600 of 661570 papers

TitleStatusHype
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object DetectionCode2
An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language ModelsCode2
GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic GraspingCode2
Caltech Aerial RGB-Thermal Dataset in the WildCode2
MonoOcc: Digging into Monocular Semantic Occupancy PredictionCode2
Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM EraCode2
Envision3D: One Image to 3D with Anchor Views InterpolationCode2
LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban EnvironmentsCode2
AcademiaOS: Automating Grounded Theory Development in Qualitative Research with Large Language ModelsCode2
SOTOPIA-π: Interactive Learning of Socially Intelligent Language AgentsCode2
A Decade's Battle on Dataset Bias: Are We There Yet?Code2
Tackling the Singularities at the Endpoints of Time Intervals in Diffusion ModelsCode2
Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at ScaleCode2
Knowledge Conflicts for LLMs: A SurveyCode2
Towards Dense and Accurate Radar Perception Via Efficient Cross-Modal Diffusion ModelCode2
Scattered Mixture-of-Experts ImplementationCode2
Language models scale reliably with over-training and on downstream tasksCode2
JAXbind: Bind any function to JAXCode2
Prompting Large Language Models to Tackle the Full Software Development Lifecycle: A Case StudyCode2
Pairwise Comparisons Are All You NeedCode2
PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistencyCode2
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language ModelCode2
GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting EditingCode2
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningCode2
CleanAgent: Automating Data Standardization with LLM-based AgentsCode2
FastMAC: Stochastic Spectral Sampling of Correspondence GraphCode2
Towards a clinically accessible radiology foundation model: open-access and lightweight, with automated evaluationCode2
Motion Mamba: Efficient and Long Sequence Motion GenerationCode2
Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic ArchitectureCode2
CMax-SLAM: Event-based Rotational-Motion Bundle Adjustment and SLAM System using Contrast MaximizationCode2
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled ReasoningCode2
VLKEB: A Large Vision-Language Model Knowledge Editing BenchmarkCode2
SemGauss-SLAM: Dense Semantic Gaussian Splatting SLAMCode2
Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training FrameworkCode2
Adaptive Fusion of Single-View and Multi-View Depth for Autonomous DrivingCode2
Harder Tasks Need More Experts: Dynamic Routing in MoE ModelsCode2
Dynamic Graph Representation with Knowledge-aware Attention for Histopathology Whole Slide Image AnalysisCode2
Open-World Semantic Segmentation Including Class SimilarityCode2
CALF: Aligning LLMs for Time Series Forecasting via Cross-modal Fine-TuningCode2
LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image SegmentationCode2
Beyond Text: Frozen Large Language Models in Visual Signal ComprehensionCode2
Characterization of Large Language Model Development in the DatacenterCode2
Ensembling Prioritized Hybrid Policies for Multi-agent PathfindingCode2
KnowCoder: Coding Structured Knowledge into LLMs for Universal Information ExtractionCode2
CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code CompletionCode2
RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation ModelCode2
Frequency-Aware Deepfake Detection: Improving Generalizability through Frequency Space LearningCode2
Robust Synthetic-to-Real Transfer for Stereo MatchingCode2
Scalable Spatiotemporal Prediction with Bayesian Neural FieldsCode2
Show:102550
← PrevPage 192 of 13232Next →