SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 75517600 of 661570 papers

TitleStatusHype
Developing Foundation Models for Universal Segmentation from 3D Whole-Body Positron Emission Tomography0
Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge0
Text-only adaptation in LLM-based ASR through text denoising0
An Enhanced Projection Pursuit Tree Classifier with Visual Methods for Assessing Algorithmic Improvements0
CRAFT: A Tendon-Driven Hand with Hybrid Hard-Soft Compliance0
HELM: Hierarchical and Explicit Label Modeling with Graph Learning for Multi-Label Image Classification0
Context-dependent manifold learning: A neuromodulated constrained autoencoder approach0
See, Symbolize, Act: Grounding VLMs with Spatial Representations for Better Gameplay0
Topologically Stable Hough Transform0
Semantic-Aware Reconstruction Error for Detecting AI-Generated Images0
Normative Common Ground Replication (NormCoRe): Replication-by-Translation for Studying Norms in Multi-agent AI0
BTZSC: A Benchmark for Zero-Shot Text Classification Across Cross-Encoders, Embedding Models, Rerankers and LLMs0
EReCu: Pseudo-label Evolution Fusion and Refinement with Multi-Cue Learning for Unsupervised Camouflage Detection0
Pyramidal Patchification Flow for Visual GenerationCode0
Logics-Parsing-Omni Technical ReportCode0
TianQuan-S2S: A Subseasonal-to-Seasonal Global Weather Model via Incorporate Climatology StateCode0
SkeletonAgent: An Agentic Interaction Framework for Skeleton-based Action RecognitionCode0
Towards Highly Transferable Vision-Language Attack via Semantic-Augmented Dynamic Contrastive InteractionCode0
High-Fidelity Medical Shape Generation via Skeletal Latent DiffusionCode0
Lifelong Imitation Learning with Multimodal Latent Replay and Incremental AdjustmentCode0
Geometric Autoencoder for Diffusion ModelsCode0
Historical Consensus: Preventing Posterior Collapse via Iterative Selection of Gaussian Mixture PriorsCode0
Sharpness-Aware Minimization for Generalized Embedding Learning in Federated RecommendationCode0
Legal-DC: Benchmarking Retrieval-Augmented Generation for Legal DocumentsCode0
CEI-3D: Collaborative Explicit-Implicit 3D Reconstruction for Realistic and Fine-Grained Object EditingCode0
SommBench: Assessing Sommelier Expertise of Language ModelsCode0
TopoBench: Benchmarking LLMs on Hard Topological ReasoningCode0
O3N: Omnidirectional Open-Vocabulary Occupancy PredictionCode0
The Latent Color Subspace: Emergent Order in High-Dimensional ChaosCode0
Verifying LLM Inference to Detect Model Weight ExfiltrationCode0
Overcoming the Curvature Bottleneck in MeanFlowCode0
Towards Contextual Sensitive Data DetectionCode0
RAT+: Train Dense, Infer Sparse -- Recurrence Augmented Attention for Dilated InferenceCode0
ReHARK: Refined Hybrid Adaptive RBF Kernels for Robust One-Shot Vision-Language AdaptationCode0
GlyphBanana: Advancing Precise Text Rendering Through Agentic WorkflowsCode0
CodeEvolve: an open source evolutionary coding agent for algorithmic discovery and optimizationCode0
Evaluating Generative Models via One-Dimensional Code DistributionsCode0
Efficient Construction of Implicit Surface Models From a Single Image for Motion GenerationCode0
GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language ModelsCode0
Evolving Beyond Snapshots: Harmonizing Structure and Sequence via Entity State Tuning for Temporal Knowledge Graph ForecastingCode0
[b]=[d]-[t]+[p]: Self-supervised Speech Models Discover Phonological Vector ArithmeticCode0
Multi-Paradigm Collaborative Adversarial Attack Against Multi-Modal Large Language ModelsCode0
BLooP: Zero-Shot Abstractive Summarization using Large Language Models with Bigram Lookahead PromotionCode0
ZTab: Domain-based Zero-shot Annotation for Table ColumnsCode0
Bridging Discrete Marks and Continuous Dynamics: Dual-Path Cross-Interaction for Marked Temporal Point ProcessesCode0
TornadoNet: Real-Time Building Damage Detection with Ordinal SupervisionCode0
The Density of Cross-Persistence Diagrams and Its ApplicationsCode0
MV-SAM3D: Adaptive Multi-View Fusion for Layout-Aware 3D GenerationCode0
Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement LearningCode0
FL-MedSegBench: A Comprehensive Benchmark for Federated Learning on Medical Image SegmentationCode0
Show:102550
← PrevPage 152 of 13232Next →