SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1980119850 of 474278 papers

TitleStatusHype
Community Forensics: Using Thousands of Generators to Train Fake Image DetectorsCode1
MEG: Medical Knowledge-Augmented Large Language Models for Question AnsweringCode1
The Recurrent Sticky Hierarchical Dirichlet Process Hidden Markov ModelCode1
Beyond Model Adaptation at Test Time: A SurveyCode1
Learning Generalizable Policy for Obstacle-Aware Autonomous Drone RacingCode1
Energy-based physics-informed neural network for frictionless contact problems under large deformationCode1
PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose ClothingCode1
Number Cookbook: Number Understanding of Language Models and How to Improve ItCode1
RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language ModelsCode1
DiMSUM: Diffusion Mamba -- A Scalable and Unified Spatial-Frequency Method for Image GenerationCode1
MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for MambaCode1
Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object DetectionCode1
Time-Causal VAE: Robust Financial Time Series GeneratorCode1
MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMsCode1
Membership Inference Attacks against Large Vision-Language ModelsCode1
Privacy-Preserving Graph-Based Machine Learning with Fully Homomorphic Encryption for Collaborative Anti-Money LaunderingCode1
Generative Artificial Intelligence Meets Synthetic Aperture Radar: A SurveyCode1
SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-AgentsCode1
Adversarial multi-task underwater acoustic target recognition: towards robustness against various influential factorsCode1
GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation ModelsCode1
Inference Optimal VLMs Need Fewer Visual Tokens and More ParametersCode1
CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object DetectionCode1
PACE: Pacing Operator Learning to Accurate Optical Field Simulation for Complicated Photonic DevicesCode1
Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression PerspectiveCode1
Grounding Natural Language to SQL Translation with Data-Based Self-ExplanationsCode1
Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution DetectionCode1
LiVOS: Light Video Object Segmentation with Gated Linear MatchingCode1
Label Critic: Design Data Before ModelsCode1
Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity DatasetCode1
Breaking the Reclustering Barrier in Centroid-based Deep ClusteringCode1
Improving Steering Vectors by Targeting Sparse Autoencoder FeaturesCode1
QCS: Feature Refining from Quadruplet Cross Similarity for Facial Expression RecognitionCode1
Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease KnowledgeCode1
Multi-Transmotion: Pre-trained Model for Human Motion PredictionCode1
MILU: A Multi-task Indic Language Understanding BenchmarkCode1
TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for NetworkCode1
PIAST: A Multimodal Piano Dataset with Audio, Symbolic and TextCode1
The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant UnitsCode1
On Targeted Manipulation and Deception when Optimizing LLMs for User FeedbackCode1
Learning to Assist Humans without Inferring RewardsCode1
Not Just Object, But State: Compositional Incremental Learning without ForgettingCode1
Sparsing Law: Towards Large Language Models with Greater Activation SparsityCode1
Continual LLaVA: Continual Instruction Tuning in Large Vision-Language ModelsCode1
Bridge-IF: Learning Inverse Protein Folding with Markov BridgesCode1
Benchmarking Vision, Language, & Action Models on Robotic Learning TasksCode1
GraphXAIN: Narratives to Explain Graph Neural NetworksCode1
Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence ChallengeCode1
Expanding Sparse Tuning for Low Memory UsageCode1
Can Language Models Learn to Skip Steps?Code1
Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual InputsCode1
Show:102550
← PrevPage 397 of 9486Next →