SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1860118650 of 474278 papers

TitleStatusHype
LASPA: Language Agnostic Speaker Disentanglement with Prefix-Tuned Cross-Attention0
The Impact of Software Testing with Quantum Optimization Meets Machine Learning0
unMORE: Unsupervised Multi-Object Segmentation via Center-Boundary ReasoningCode0
Comparison of spectrogram scaling in multi-label Music Genre Recognition0
Near-Optimal Clustering in Mixture of Markov Chains0
FinRobot: Generative Business Process AI Agents for Enterprise Resource Planning in Finance0
Zero-Shot Text-to-Speech for Vietnamese0
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient RoboticsCode12
Towards Machine Unlearning for Paralinguistic Speech Processing0
Cocktail-Party Audio-Visual Speech Recognition0
Continual Speech Learning with Fused Speech Features0
Flow2Code: Evaluating Large Language Models for Flowchart-based Code Generation CapabilityCode0
Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head GenerationCode1
Polishing Every Facet of the GEM: Testing Linguistic Competence of LLMs and Humans in KoreanCode1
Red Teaming AI Policy: A Taxonomy of Avoision and the EU AI Act0
SALAD: Systematic Assessment of Machine Unlearing on LLM-Aided Hardware Design0
LAMARL: LLM-Aided Multi-Agent Reinforcement Learning for Cooperative Policy Generation0
Enhancing Speech Emotion Recognition with Graph-Based Multimodal Fusion and Prosodic Features for the Speech Emotion Recognition in Naturalistic Conditions Challenge at Interspeech 20250
Through a Steerable Lens: Magnifying Neural Network Interpretability via Phase-Based Extrapolation0
LLM in the Loop: Creating the PARADEHATE Dataset for Hate Speech Detoxification0
COALESCE: Economic and Security Dynamics of Skill-Based Task Outsourcing Among Team of Autonomous LLM Agents0
Fingerprinting Deep Learning Models via Network Traffic Patterns in Federated Learning0
SMOTE-DP: Improving Privacy-Utility Tradeoff with Synthetic Data0
Trojan Horse Hunt in Time Series Forecasting for Space Operations0
From Street Views to Urban Science: Discovering Road Safety Factors with Multimodal Large Language Models0
Explainable AI Systems Must Be Contestable: Here's How to Make It Happen0
Selecting for Less Discriminatory Algorithms: A Relational Search Framework for Navigating Fairness-Accuracy Trade-offs in Practice0
AI Data Development: A Scorecard for the System Card Framework0
Retrieval-Augmented Generation of Ontologies from Relational Databases0
Feature-aware Hypergraph Generation via Next-Scale Prediction0
Image Generation from Contextually-Contradictory Prompts0
WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented DialogueCode1
Automatic Stage Lighting Control: Is it a Rule-Driven Process or Generative Task?Code0
scDataset: Scalable Data Loading for Deep Learning on Large-Scale Single-Cell OmicsCode1
GLoSS: Generative Language Models with Semantic Search for Sequential RecommendationCode1
Dual encoding feature filtering generalized attention UNET for retinal vessel segmentationCode0
TimeGraph: Synthetic Benchmark Datasets for Robust Time-Series Causal DiscoveryCode1
TaxaDiffusion: Progressively Trained Diffusion Model for Fine-Grained Species GenerationCode0
Ridgeformer: Mutli-Stage Contrastive Training For Fine-grained Cross-Domain Fingerprint RecognitionCode0
ViTA-PAR: Visual and Textual Attribute Alignment with Attribute Prompting for Pedestrian Attribute RecognitionCode0
ReGA: Representation-Guided Abstraction for Model-based Safeguarding of LLMsCode0
Mitigating Disparate Impact of Differentially Private Learning through Bounded Adaptive Clipping0
Gradient-Based Model Fingerprinting for LLM Similarity Detection and Family Classification0
A Dynamic Framework for Semantic Grouping of Common Data Elements (CDE) Using Embeddings and Clustering0
Align is not Enough: Multimodal Universal Jailbreak Attack against Multimodal Large Language Models0
AIMSCheck: Leveraging LLMs for AI-Assisted Review of Modern Slavery Statements Across JurisdictionsCode1
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-TuningCode5
IF-GUIDE: Influence Function-Guided Detoxification of LLMsCode1
VirnyFlow: A Design Space for Responsible Model DevelopmentCode0
Constrained Sliced Wasserstein EmbeddingCode0
Show:102550
← PrevPage 373 of 9486Next →