SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 33513400 of 659983 papers

TitleStatusHype
Bayesian Inference of Psychometric Variables From Brain and Behavior in Implicit Association Tests0
Fanar 2.0: Arabic Generative AI Stack0
TharuChat: Bootstrapping Large Language Models for a Low-Resource Language via Synthetic Data and Human Validation0
CFM: Language-aligned Concept Foundation Model for VisionCode0
TennisExpert: Towards Expert-Level Analytical Sports Video UnderstandingCode0
PashtoCorp: A 1.25-Billion-Word Corpus, Evaluation Suite, and Reproducible Pipeline for Low-Resource Language DevelopmentCode0
FSMC-Pose: Frequency and Spatial Fusion with Multiscale Self-calibration for Cattle Mounting Pose EstimationCode0
BUSSARD: Normalizing Flows for Bijective Universal Scene-Specific Anomalous Relationship DetectionCode0
Retrieving Counterfactuals Improves Visual In-Context LearningCode0
DSeq-JEPA: Discriminative Sequential Joint-Embedding Predictive ArchitectureCode0
Improving Low-Resource Machine Translation via Round-Trip Reinforcement LearningCode0
Lyapunov Constrained Soft Actor-Critic (LC-SAC) using Koopman Operator Theory for Quadrotor Trajectory TrackingCode0
Are a Thousand Words Better Than a Single Picture? Beyond Images -- A Framework for Multi-Modal Knowledge Graph Dataset EnrichmentCode0
Self-Conditioned Denoising for Atomistic Representation LearningCode0
ACPV-Net: All-Class Polygonal Vectorization for Seamless Vector Map Generation from Aerial ImageryCode0
ACE-LoRA: Graph-Attentive Context Enhancement for Parameter-Efficient Adaptation of Medical Vision-Language ModelsCode0
Understanding Cell Fate Decisions with Temporal AttentionCode0
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat AssistantCode0
Political Alignment in Large Language Models: A Multidimensional Audit of Psychometric Identity and Behavioral BiasCode0
Dynamics Within Latent Chain-of-Thought: An Empirical Study of Causal StructureCode0
Cross-modal learning for plankton recognitionCode0
DISCOVER: A Solver for Distributional Counterfactual ExplanationsCode0
Fast-HaMeR: Boosting Hand Mesh Reconstruction using Knowledge DistillationCode0
HeBA: Heterogeneous Bottleneck Adapters for Robust Vision-Language ModelsCode0
Omanic: Towards Step-wise Evaluation of Multi-hop Reasoning in Large Language ModelsCode0
Learning to Present: Inverse Specification Rewards for Agentic Slide GenerationCode0
Draft and Refine with Visual ExpertsCode0
SO-Bench: A Structural Output Evaluation of Multimodal LLMsCode0
DesertFormer: Transformer-Based Semantic Segmentation for Off-Road Desert Terrain Classification in Autonomous Navigation SystemsCode0
Uni-DAD: Unified Distillation and Adaptation of Diffusion Models for Few-step Few-shot Image GenerationCode0
HyPER-GAN: Hybrid Patch-Based Image-to-Image Translation for Real-Time Photorealism EnhancementCode0
VLOD-TTA: Test-Time Adaptation of Vision-Language Object DetectorsCode0
GeoBridge: A Semantic-Anchored Multi-View Foundation Model Bridging Images and Text for Geo-LocalizationCode0
REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation ModelsCode0
Demystifing Video Reasoning1
InCoder-32B: Code Foundation Model for Industrial Scenarios1
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models2
M^3: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM1
SegviGen: Repurposing 3D Generative Model for Part Segmentation2
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models2
LenghuSky-8: An 8-Year All-Sky Cloud Dataset with Star-Aware Masks and Alt-Az Calibration for Segmentation and Nowcasting0
interwhen: A Generalizable Framework for Verifiable Reasoning with Test-time MonitorsCode0
SA-CycleGAN-2.5D: Self-Attention CycleGAN with Tri-Planar Context for Multi-Site MRI Harmonization0
Bootstrapping Embeddings for Low Resource Languages0
NeuronSpark: A Spiking Neural Network Language Model with Selective State Space Dynamics0
VorTEX: Various overlap ratio for Target speech EXtraction0
CLAIM: Camera-LiDAR Alignment with Intensity and MonodepthCode0
OneWorld: Taming Scene Generation with 3D Unified Representation AutoencoderCode0
A Human-Centred Architecture for Large Language Models-Cognitive Assistants in Manufacturing within Quality Management Systems0
LibraGen: Playing a Balance Game in Subject-Driven Video Generation0
Show:102550
← PrevPage 68 of 13200Next →