SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2015120200 of 474278 papers

TitleStatusHype
Cross-model Control: Improving Multiple Large Language Models in One-time TrainingCode1
Value Residual Learning For Alleviating Attention Concentration In TransformersCode1
ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language TuningCode1
PlantCamo: Plant Camouflage DetectionCode1
VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-TuningCode1
WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language ModelsCode1
Entity-based Reinforcement Learning for Autonomous Cyber DefenceCode1
Federated Transformer: Multi-Party Vertical Federated Learning on Practical Fuzzily Linked DataCode1
Graphusion: A RAG Framework for Knowledge Graph Construction with a Global PerspectiveCode1
ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context PromptingCode1
Vehicle Dynamics Parameter Estimation Methodology for Virtual Automated Driving TestingCode1
PyTSC: A Unified Platform for Multi-Agent Reinforcement Learning in Traffic Signal ControlCode1
SpeakGer: A meta-data enriched speech corpus of German state and federal parliamentsCode1
Neural Cover Selection for Image SteganographyCode1
CLEAR: Character Unlearning in Textual and Visual ModalitiesCode1
Physics-informed Neural Networks for Functional Differential Equations: Cylindrical Approximation and Its Convergence GuaranteesCode1
GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent CollaborationCode1
Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web InteractionsCode1
Multi-scale feature reconstruction network for industrial anomaly detectionCode1
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic EnvironmentsCode1
Gaze-Assisted Medical Image SegmentationCode1
Att2CPC: Attention-Guided Lossy Attribute Compression of Point CloudsCode1
DisenGCD: A Meta Multigraph-assisted Disentangled Graph Learning Framework for Cognitive DiagnosisCode1
Leveraging Skills from Unlabeled Prior Data for Efficient Online ExplorationCode1
Scalable Random Feature Latent Variable ModelsCode1
Diffusion Priors for Variational Likelihood Estimation and Image DenoisingCode1
Spiking Graph Neural Network on Riemannian ManifoldsCode1
Fire and Smoke Detection with Burning Intensity RepresentationCode1
LiNo: Advancing Recursive Residual Decomposition of Linear and Nonlinear Patterns for Robust Time Series ForecastingCode1
Benchmarking Multi-Scene Fire and Smoke DetectionCode1
Progressive Compositionality In Text-to-Image Generative ModelsCode1
GALA: Graph Diffusion-based Alignment with Jigsaw for Source-free Domain AdaptationCode1
Publishing Neural Networks in Drug Discovery Might Compromise Training Data PrivacyCode1
Scalable Influence and Fact Tracing for Large Language Model PretrainingCode1
EEG-DIF: Early Warning of Epileptic Seizures through Generative Diffusion Model-based Multi-channel EEG Signals ForecastingCode1
SpikMamba: When SNN meets Mamba in Event-based Human Action RecognitionCode1
Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward PassesCode1
Emphasizing Discriminative Features for Dataset Distillation in Complex ScenariosCode1
Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement LearningCode1
Aligning Large Language Models via Self-Steering OptimizationCode1
TopoDiffusionNet: A Topology-aware Diffusion ModelCode1
Automated Spinal MRI Labelling from Reports Using a Large Language ModelCode1
Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Under AmbiguitiesCode1
Joint Point Cloud Upsampling and Cleaning with Octree-based CNNsCode1
ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information CoverageCode1
Multi-Layer Gaussian Splatting for Immersive Anatomy VisualizationCode1
Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and ImprovementsCode1
LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model MergingCode1
Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output GenerationCode1
Non-myopic Generation of Language Models for Reasoning and PlanningCode1
Show:102550
← PrevPage 404 of 9486Next →