SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1975119800 of 474278 papers

TitleStatusHype
GlocalCLIP: Object-agnostic Global-Local Prompt Learning for Zero-shot Anomaly DetectionCode1
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance RepresentationCode1
Training objective drives the consistency of representational similarity across datasetsCode1
Inversion-based Latent Bayesian OptimizationCode1
SM3-Text-to-Query: Synthetic Multi-Model Medical Text-to-Query BenchmarkCode1
LLMs as Method Actors: A Model for Prompt Engineering and ArchitectureCode1
Aioli: A Unified Optimization Framework for Language Model Data MixingCode1
Aligning Large Language Models and Geometric Deep Models for Protein RepresentationCode1
From Transparent to Opaque: Rethinking Neural Implicit Surfaces with α-NeuSCode1
MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling QuantizationCode1
HeartBERT: A Self-Supervised ECG Embedding Model for Efficient and Effective Medical Signal AnalysisCode1
BayesianFitForecast: A User-Friendly R Toolbox for Parameter Estimation and Forecasting with Ordinary Differential EquationsCode1
Learning the rules of peptide self-assembly through data mining with large language modelsCode1
Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity RecognitionCode1
Why These Documents? Explainable Generative Retrieval with Hierarchical Category PathsCode1
Tell What You Hear From What You See -- Video to Audio Generation Through TextCode1
CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature PropagationCode1
FineTuneBench: How well do commercial fine-tuning APIs infuse knowledge into LLMs?Code1
NeuroFly: A framework for whole-brain single neuron reconstructionCode1
Generating Highly Designable Proteins with Geometric Algebra Flow MatchingCode1
Peri-midFormer: Periodic Pyramid Transformer for Time Series AnalysisCode1
Towards Competitive Search Relevance For Inference-Free Learned Sparse RetrieversCode1
OneProt: Towards Multi-Modal Protein Foundation ModelsCode1
Enabling LLM Knowledge Analysis via Extensive MaterializationCode1
Image Understanding Makes for A Good Tokenizer for Image GenerationCode1
Cross- and Intra-image Prototypical Learning for Multi-label Disease Diagnosis and InterpretationCode1
IGDrivSim: A Benchmark for the Imitation Gap in Autonomous DrivingCode1
Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion InversionCode1
Variational Low-Rank Adaptation Using IVONCode1
The State and Fate of Summarization DatasetsCode1
BhasaAnuvaad: A Speech Translation Dataset for 13 Indian LanguagesCode1
Distributed Attack-Resilient Platooning Against False Data InjectionCode1
ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark DatasetCode1
AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein EngineeringCode1
wav2sleep: A Unified Multi-Modal Approach to Sleep Stage Classification from Physiological SignalsCode1
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and ModalitiesCode1
Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement LearningCode1
DELIFT: Data Efficient Language model Instruction Fine TuningCode1
Energy-based physics-informed neural network for frictionless contact problems under large deformationCode1
MEG: Medical Knowledge-Augmented Large Language Models for Question AnsweringCode1
Learning Generalizable Policy for Obstacle-Aware Autonomous Drone RacingCode1
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data ContaminationCode1
Reconsidering the Performance of GAE in Link PredictionCode1
PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose ClothingCode1
Beyond Model Adaptation at Test Time: A SurveyCode1
Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language ModelsCode1
The Recurrent Sticky Hierarchical Dirichlet Process Hidden Markov ModelCode1
Bio-xLSTM: Generative modeling, representation and in-context learning of biological and chemical sequencesCode1
Polynomial Composition Activations: Unleashing the Dynamics of Large Language ModelsCode1
Community Forensics: Using Thousands of Generators to Train Fake Image DetectorsCode1
Show:102550
← PrevPage 396 of 9486Next →