SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2050120550 of 474278 papers

TitleStatusHype
MTL-LoRA: Low-Rank Adaptation for Multi-Task LearningCode1
Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning TasksCode1
Rethinking Data Selection at Scale: Random Selection is Almost All You NeedCode1
LogLM: From Task-based to Instruction-based Automated Log AnalysisCode1
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing AttentionCode1
OpenCity: A Scalable Platform to Simulate Urban Activities with Massive LLM AgentsCode1
Synth-SONAR: Sonar Image Synthesis with Enhanced Diversity and Realism via Dual Diffusion Models and GPT PromptingCode1
Learning General Representation of 12-Lead Electrocardiogram with a Joint-Embedding Predictive ArchitectureCode1
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image SegmentationCode1
AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention ManipulationCode1
Distillation of Discrete Diffusion through Dimensional CorrelationsCode1
Refusal-Trained LLMs Are Easily Jailbroken As Browser AgentsCode1
Retraining-Free Merging of Sparse MoE via Hierarchical ClusteringCode1
Parameter-Efficient Fine-Tuning of State Space ModelsCode1
Chain-of-Restoration: Multi-Task Image Restoration Models are Zero-Shot Step-by-Step Universal Image RestorersCode1
SPORTU: A Comprehensive Sports Understanding Benchmark for Multimodal Large Language ModelsCode1
Unintentional Unalignment: Likelihood Displacement in Direct Preference OptimizationCode1
DiffPO: A causal diffusion model for learning distributions of potential outcomesCode1
E-Motion: Future Motion Simulation via Event Sequence DiffusionCode1
Zeroth-Order Fine-Tuning of LLMs in Random SubspacesCode1
Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language ModelsCode1
Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand AvatarsCode1
Hespi: A pipeline for automatically detecting information from hebarium specimen sheetsCode1
Do Unlearning Methods Remove Information from Language Model Weights?Code1
KinDEL: DNA-Encoded Library Dataset for Kinase InhibitorsCode1
VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video UnderstandingCode1
Language Imbalance Driven Rewarding for Multilingual Self-improvingCode1
MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge DevicesCode1
Low-complexity Attention-based Unsupervised Anomalous Sound Detection exploiting Separable Convolutions and Angular LossCode1
When Graph meets Multimodal: Benchmarking on Multimodal Attributed Graphs LearningCode1
Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement LearningCode1
Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language BootstrappingCode1
Batched Energy-Entropy acquisition for Bayesian OptimizationCode1
MiRAGeNews: Multimodal Realistic AI-Generated News DetectionCode1
Mentor-KD: Making Small Language Models Better Multi-step ReasonersCode1
Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics ModelsCode1
Zero-Shot Offline Imitation Learning via Optimal TransportCode1
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter EfficientCode1
PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model AgentsCode1
PoisonBench: Assessing Large Language Model Vulnerability to Data PoisoningCode1
A foundation model for generalizable disease diagnosis in chest X-ray imagesCode1
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion PredictionCode1
Recovering complex ecological dynamics from time series using state-space universal dynamic equationsCode1
DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object DetectionCode1
CrackSegDiff: Diffusion Probability Model-based Multi-modal Crack SegmentationCode1
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical ReasoningCode1
Understanding the Interplay between Parametric and Contextual Knowledge for Large Language ModelsCode1
Executing Arithmetic: Fine-Tuning Large Language Models as Turing MachinesCode1
Multi-Agent Collaborative Data Selection for Efficient LLM PretrainingCode1
SPA: 3D Spatial-Awareness Enables Effective Embodied RepresentationCode1
Show:102550
← PrevPage 411 of 9486Next →