SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1685116900 of 474278 papers

TitleStatusHype
Detecting Harmful Memes with Decoupled Understanding and Guided CoT ReasoningCode0
Princeton365: A Diverse Dataset with Accurate Camera PoseCode1
Flow Diverse and Efficient: Learning Momentum Flow Matching via Stochastic Velocity Field SamplingCode0
FEDTAIL: Federated Long-Tailed Domain Generalization with Sharpness-Guided Gradient MatchingCode0
Transforming Expert Knowledge into Scalable Ontology via Large Language Models0
Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe MicroscopyCode0
Low-resource domain adaptation while minimizing energy and hardware resource consumption0
TACTIC: Translation Agents with Cognitive-Theoretic Interactive CollaborationCode1
Diversity-Guided MLP Reduction for Efficient Large Vision TransformersCode1
Vuyko Mistral: Adapting LLMs for Low-Resource Dialectal TranslationCode0
KokushiMD-10: Benchmark for Evaluating Large Language Models on Ten Japanese National Healthcare Licensing Examinations0
SEED: Enhancing Text-to-SQL Performance and Practical Usability Through Automatic Evidence GenerationCode1
Automatic Depression Assessment using Machine Learning: A Comprehensive Survey0
Variational Supervised Contrastive Learning0
Segment Any Architectural Facades (SAAF):An automatic segmentation model for building facades, walls and windows based on multimodal semantics guidance0
DEBATE: A Dataset for Disentangling Textual Ambiguity in Mandarin Through SpeechCode0
C3S3: Complementary Competition and Contrastive Selection for Semi-Supervised Medical Image SegmentationCode1
HAELT: A Hybrid Attentive Ensemble Learning Transformer Framework for High-Frequency Stock Price Forecasting0
Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech0
Knowledge Compression via Question Generation: Enhancing Multihop Document Retrieval without Fine-tuning0
WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-CodeCode0
A Hybrid GA LLM Framework for Structured Task OptimizationCode0
Recommendations and Reporting Checklist for Rigorous & Transparent Human Baselines in Model EvaluationsCode0
Hidden Bias in the Machine: Stereotypes in Text-to-Image Models0
Double Low-Rank 4D Tensor Decomposition for Circular RIS-Aided mmWave MIMO-NOMA System Channel Estimation in Mobility Scenarios0
Computation Capacity Maximization for Pinching Antennas-Assisted Wireless Powered MEC Systems0
Multipath Component-Enhanced Signal Processing for Integrated Sensing and Communication Systems0
Stability of Mean-Field Variational Inference0
Automating Exploratory Multiomics Research via Language Models0
Refusal-Feature-guided Teacher for Safe Finetuning via Data Filtering and Alignment Distillation0
The Catechol Benchmark: Time-series Solvent Selection Data for Few-shot Machine LearningCode0
Heavy Lasso: sparse penalized regression under heavy-tailed noise via data-augmented soft-thresholdingCode0
Robust Transceiver Design for RIS Enhanced Dual-Functional Radar-Communication with Movable Antenna0
Diffusion Sequence Models for Enhanced Protein Representation and GenerationCode1
Conditional Local Independence Testing with Application to Dynamic Causal Discovery0
CommSense: A Rapid and Accurate ISAC Paradigm0
Real-Time Execution of Action Chunking Flow PoliciesCode3
Prompt to Protection: A Comparative Study of Multimodal LLMs in Construction Hazard Recognition0
SILK: Smooth InterpoLation frameworK for motion in-betweening A Simplified Computational Approach0
When Style Breaks Safety: Defending Language Models Against Superficial Style AlignmentCode0
STREAMINGGS: Voxel-Based Streaming 3D Gaussian Splatting with Memory Optimization and Architectural Support0
Towards a Small Language Model Lifecycle Framework0
SoK: Data Reconstruction Attacks Against Machine Learning Models: Definition, Metrics, and Benchmark0
Are Trees Really Green? A Detection Approach of IoT Malware Attacks0
HyColor: An Efficient Heuristic Algorithm for Graph Coloring0
Diffusion of Responsibility in Collective Decision Making0
Mind the Gap: Removing the Discretization Gap in Differentiable Logic Gate Networks0
Hierarchical Scoring with 3D Gaussian Splatting for Instance Image-Goal Navigation0
Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition0
Slow and Fast Neurons Cooperate in Contextual Working Memory through Timescale Diversity0
Show:102550
← PrevPage 338 of 9486Next →