SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 69266950 of 474278 papers

TitleStatusHype
DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Semantic Instance SegmentationCode0
X-ReID: Multi-granularity Information Interaction for Video-Based Visible-Infrared Person Re-IdentificationCode0
Provably Outlier-resistant Semi-parametric Regression for Transferable Calibration of Low-cost Air-quality SensorsCode0
Distilling Cross-Modal Knowledge via Feature DisentanglementCode0
Multi-Context Fusion Transformer for Pedestrian Crossing Intention Prediction in Urban EnvironmentsCode0
iRadioDiff: Physics-Informed Diffusion Model for Indoor Radio Map Construction and LocalizationCode0
EM2LDL: A Multilingual Speech Corpus for Mixed Emotion Recognition through Label Distribution LearningCode0
Learning Subgroups with Maximum Treatment Effects without Causal HeuristicsCode0
Zoo3D: Zero-Shot 3D Object Detection at Scene LevelCode0
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal ModelsCode0
LiveVectorLake: A Real-Time Versioned Knowledge Base Architecture for Streaming Vector Updates and Temporal RetrievalCode0
Deep Research: A Systematic Survey0
PropensityBench: Evaluating Latent Safety Risks in Large Language Models via an Agentic ApproachCode0
LLMs for Low-Resource Dialect Translation Using Context-Aware Prompting: A Case Study on SylhetiCode0
Masked Autoencoder Joint Learning for Robust Spitzoid Tumor ClassificationCode0
BackdoorVLM: A Benchmark for Backdoor Attacks on Vision-Language ModelsCode0
MedBridge: Bridging Foundation Vision-Language Models to Medical Image Diagnosis in Chest X-RayCode0
How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective0
SlimInfer: Accelerating Long-Context LLM Inference via Dynamic Token PruningCode0
Do LLMs Feel? Teaching Emotion Recognition with Prompts, Retrieval, and Curriculum Learning0
Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?0
Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks0
Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling0
Cognitive Foundations for Reasoning and Their Manifestation in LLMs0
Beyond Multiple Choice: Verifiable OpenQA for Robust Vision-Language RFT0
Show:102550
← PrevPage 278 of 18972Next →