SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1795118000 of 474278 papers

TitleStatusHype
SAMGPT: Text-free Graph Foundation Model for Multi-domain Pre-training and Cross-domain AdaptationCode1
OntoTune: Ontology-Driven Self-training for Aligning Large Language ModelsCode1
ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution ShiftsCode1
Unsupervised Self-Prior Embedding Neural Representation for Iterative Sparse-View CT ReconstructionCode1
A Comprehensive Review of Protein Language ModelsCode1
QuantumDNA: A Python Package for Analyzing Quantum Charge Dynamics in DNA and Exploring Its Biological RelevanceCode1
UniCMs: A Unified Consistency Model For Efficient Multimodal Generation and UnderstandingCode1
Graph Neural Networks for Efficient AC Power Flow Prediction in Power GridsCode1
APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel EncodingCode1
Bridging Traffic State and Trajectory for Dynamic Road Network and Trajectory Representation LearningCode1
ParaSurf: A Surface-Based Deep Learning Approach for Paratope-Antigen Interaction PredictionCode1
ProofWala: Multilingual Proof Data Synthesis and Theorem-ProvingCode1
SurGen: 1020 H&E-stained Whole Slide Images With Survival and Genetic MarkersCode1
Swin-MSTP: Swin transformer with multi-scale temporal perception for continuous sign language recognitionCode1
Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 ChallengeCode1
Gemstones: A Model Suite for Multi-Faceted Scaling LawsCode1
Cached Multi-Lora Composition for Multi-Concept Image GenerationCode1
3DMolFormer: A Dual-channel Framework for Structure-based Drug DiscoveryCode1
Position-aware Automatic Circuit DiscoveryCode1
EigenLoRAx: Recycling Adapters to Find Principal Subspaces for Resource-Efficient Adaptation and InferenceCode1
Towards LLM Unlearning Resilient to Relearning Attacks: A Sharpness-Aware Minimization Perspective and BeyondCode1
Tolerance-Aware Deep OpticsCode1
LLM-Supported Natural Language to Bash TranslationCode1
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI AgentsCode1
Oracular Programming: A Modular Foundation for Building LLM-Enabled SoftwareCode1
No Task Left Behind: Isotropic Model Merging with Common and Task-Specific SubspacesCode1
SSMLoRA: Enhancing Low-Rank Adaptation with State Space ModelCode1
FlightForge: Advancing UAV Research with Procedural Generation of High-Fidelity Simulation and Integrated AutonomyCode1
Otter: Generating Tests from Issues to Validate SWE PatchesCode1
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM GuardrailsCode1
Chest X-ray Foundation Model with Global and Local Representations IntegrationCode1
Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and EvaluationCode1
nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent WorkflowCode1
pytopicgram: A library for data extraction and topic modeling from Telegram channelsCode1
M-IFEval: Multilingual Instruction-Following EvaluationCode1
An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative TasksCode1
Distillation and Pruning for Scalable Self-Supervised Representation-Based Speech Quality AssessmentCode1
Wavelet-Assisted Multi-Frequency Attention Network for PansharpeningCode1
Mitigating Unintended Memorization with LoRA in Federated Learning for LLMsCode1
Decoding Human Attentive States from Spatial-temporal EEG Patches Using TransformersCode1
Great Models Think Alike and this Undermines AI OversightCode1
ImprovNet -- Generating Controllable Musical Improvisations with Iterative Corruption RefinementCode1
LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation ModelsCode1
Every Call is Precious: Global Optimization of Black-Box Functions with Unknown Lipschitz ConstantsCode1
Towards Unified Music Emotion Recognition across Dimensional and Categorical ModelsCode1
Generative Autoregressive Transformers for Model-Agnostic Federated MRI ReconstructionCode1
Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among InstancesCode1
Division-of-Thoughts: Harnessing Hybrid Language Model Synergy for Efficient On-Device AgentsCode1
PixFoundation: Are We Heading in the Right Direction with Pixel-level Vision Foundation Models?Code1
Variational Control for Guidance in Diffusion ModelsCode1
Show:102550
← PrevPage 360 of 9486Next →