SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 99019925 of 474278 papers

TitleStatusHype
Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A BenchmarkCode2
Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge ComputingCode2
MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object DetectionCode2
Aligning Modalities in Vision Large Language Models via Preference Fine-tuningCode2
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data VisualizationCode2
Neighborhood-Enhanced Supervised Contrastive Learning for Collaborative FilteringCode2
3D Point Cloud Compression with Recurrent Neural Network and Image Compression MethodsCode2
Momentor: Advancing Video Large Language Model with Fine-Grained Temporal ReasoningCode2
Continual Learning on Graphs: Challenges, Solutions, and OpportunitiesCode2
Centroid-Based Efficient Minimum Bayes Risk DecodingCode2
Optimizing tiny colorless feedback delay networksCode2
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based AgentsCode2
Beyond Generalization: A Survey of Out-Of-Distribution Adaptation on GraphsCode2
PEDANTS: Cheap but Effective and Interpretable Answer EquivalenceCode2
CoLLaVO: Crayon Large Language and Vision mOdelCode2
EEG2Rep: Enhancing Self-supervised EEG Representation Through Informative Masked InputsCode2
Do Llamas Work in English? On the Latent Language of Multilingual TransformersCode2
OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation ModelsCode2
Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMsCode2
Incremental Sequence Labeling: A Tale of Two ShiftsCode2
ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity AlignmentCode2
Distillation Enhanced Generative RetrievalCode2
An end-to-end attention-based approach for learning on graphsCode2
When is Tree Search Useful for LLM Planning? It Depends on the DiscriminatorCode2
Large Language Models as Zero-shot Dialogue State Tracker through Function CallingCode2
Show:102550
← PrevPage 397 of 18972Next →