SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1475114800 of 474278 papers

TitleStatusHype
Privacy-Preserving LLM Interaction with Socratic Chain-of-Thought Reasoning and Homomorphically Encrypted Vector Databases0
Automatic Speech Recognition Biases in Newcastle English: an Error Analysis0
Relational Deep Learning: Challenges, Foundations and Next-Generation Architectures0
Bayesian Epistemology with Weighted Authority: A Formal Architecture for Truth-Promoting Autonomous Scientific Reasoning0
FlatCAD: Fast Curvature Regularization of Neural SDFs for CAD Models0
Solving Zero-Sum Convex Markov Games0
Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues0
The Role of Explanation Styles and Perceived Accuracy on Decision Making in Predictive Process Monitoring0
Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support0
SEP-GCN: Leveraging Similar Edge Pairs with Temporal and Spatial Contexts for Location-Based Recommender Systems0
GeoGuess: Multimodal Reasoning based on Hierarchy of Visual Information in Street View0
Adaptive Social Metaverse Streaming based on Federated Multi-Agent Deep Reinforcement Learning0
Fine-grained Image Retrieval via Dual-Vision Adaptation0
Reimagination with Test-time Observation Interventions: Distractor-Robust World Model Predictions for Visual Model Predictive Control0
CapsDT: Diffusion-Transformer for Capsule Robot Manipulation0
Spatially-Aware Evaluation of Segmentation Uncertainty0
Multi-use LLM Watermarking and the False Detection Problem0
Streaming Non-Autoregressive Model for Accent Conversion and Pronunciation Improvement0
Reproducible Evaluation of Camera Auto-Exposure Methods in the Field: Platform, Benchmark and Lessons LearnedCode0
Adversarial Attacks and Detection in Visual Place Recognition for Safer Robot NavigationCode1
Spatio-spectral diarization of meetings by combining TDOA-based segmentation and speaker embedding-based clusteringCode0
Beyond Audio and Pose: A General-Purpose Framework for Video SynchronizationCode0
Dense 3D Displacement Estimation for Landslide Monitoring via Fusion of TLS Point Clouds and Embedded RGB ImagesCode1
TrainVerify: Equivalence-Based Verification for Distributed LLM Training0
Double Entendre: Robust Audio-Based AI-Generated Lyrics Detection via Multi-View FusionCode0
On using AI for EEG-based BCI applications: problems, current challenges and future trendsCode1
PBFT-Backed Semantic Voting for Multi-Agent Memory PruningCode0
LLMs in Coding and their Impact on the Commercial Software Engineering Landscape0
Floating-Point Neural Networks Are Provably Robust Universal ApproximatorsCode0
Beyond Prediction -- Structuring Epistemic Integrity in Artificial Reasoning Systems0
FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation0
Noise Fusion-based Distillation Learning for Anomaly Detection in Complex Industrial Environments0
Improved Intelligibility of Dysarthric Speech using Conditional Flow Matching0
BIDA: A Bi-level Interaction Decision-making Algorithm for Autonomous Vehicles in Dynamic Traffic Scenarios0
Knee-Deep in C-RASP: A Transformer Depth HierarchyCode0
Unpacking Generative AI in Education: Computational Modeling of Teacher and Student Perspectives in Social Media Discourse0
PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models0
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech SystemsCode1
Malware Classification Leveraging NLP & Machine Learning for Enhanced AccuracyCode0
Enhanced Dermatology Image Quality Assessment via Cross-Domain Training0
Spotting tell-tale visual artifacts in face swapping videos: strengths and pitfalls of CNN detectors0
On the Performance of Cyber-Biomedical Features for Intrusion Detection in Healthcare 5.00
CodeDiffuser: Attention-Enhanced Diffusion Policy via VLM-Generated Code for Instruction Ambiguity0
Weight Factorization and Centralization for Continual Learning in Speech Recognition0
EDNet: A Distortion-Agnostic Speech Enhancement Framework with Gating Mamba Mechanism and Phase Shift-Invariant Training0
Probing the Robustness of Large Language Models Safety to Latent PerturbationsCode1
LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling ResearchCode1
TrajSceneLLM: A Multimodal Perspective on Semantic GPS Trajectory AnalysisCode0
Data-Agnostic Cardinality Learning from Imperfect WorkloadsCode0
Empowering Graph-based Approximate Nearest Neighbor Search with Adaptive Awareness Capabilities0
Show:102550
← PrevPage 296 of 9486Next →