SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 90769100 of 474278 papers

TitleStatusHype
DNA-DetectLLM: Unveiling AI-Generated Text via a DNA-Inspired Mutation-Repair ParadigmCode0
TASP: Topology-aware Sequence ParallelismCode0
Multilingual Knowledge Graph Completion via Efficient Multilingual Knowledge SharingCode0
An LLM-Powered Cooperative Framework for Large-Scale Multi-Vehicle NavigationCode0
MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During GenerationCode0
IsoSignVid2Aud: Sign Language Video to Audio Conversion without Text IntermediariesCode0
Understanding DeepResearch via ReportsCode0
LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?Code0
RASALoRE: Region Aware Spatial Attention with Location-based Random Embeddings for Weakly Supervised Anomaly Detection in Brain MRI ScansCode0
Towards Real-World Deepfake Detection: A Diverse In-the-wild Dataset of Forgery FacesCode0
Mitigating Judgment Preference Bias in Large Language Models through Group-Based PollingCode0
GCPO: When Contrast Fails, Go GoldCode0
Dynamic Factor Analysis of Price Movements in the Philippine Stock Exchange0
Interpretable Robot Control via Structured Behavior Trees and Large Language ModelsCode0
LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS0
ObjexMT: Objective Extraction and Metacognitive Calibration for LLM-as-a-Judge under Multi-Turn JailbreaksCode0
Autonomy-Aware Clustering: When Local Decisions Supersede Global PrescriptionsCode0
PIKA: Expert-Level Synthetic Datasets for Post-Training Alignment from ScratchCode0
Rethinking Inter-LoRA Orthogonality in Adapter Merging: Insights from Orthogonal Monte Carlo Dropout0
TFM Dataset: A Novel Multi-task Dataset and Integrated Pipeline for Automated Tear Film Break-Up SegmentationCode0
HARP-NeXt: High-Speed and Accurate Range-Point Fusion Network for 3D LiDAR Semantic SegmentationCode0
Unified Unsupervised Anomaly Detection via Matching Cost FilteringCode0
POME: Post Optimization Model Edit via Muon-style ProjectionCode0
Angular Constraint Embedding via SpherePair Loss for Constrained ClusteringCode0
Robot Learning from Any Images0
Show:102550
← PrevPage 364 of 18972Next →