SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 79017925 of 474278 papers

TitleStatusHype
KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems0
PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity0
Kimi Linear: An Expressive, Efficient Attention Architecture0
Reject Only Critical Tokens: Pivot-Aware Speculative DecodingCode0
Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models0
GDROS: A Geometry-Guided Dense Registration Framework for Optical-SAR Images under Large Geometric TransformationsCode0
Chain of Retrieval: Multi-Aspect Iterative Search Expansion and Post-Order Search Aggregation for Full Paper RetrievalCode0
RL Fine-Tuning Heals OOD Forgetting in SFTCode0
Enhancing Adversarial Transferability by Balancing Exploration and Exploitation with Gradient-Guided SamplingCode0
Friend or Foe: How LLMs' Safety Mind Gets Fooled by Intent Shift AttackCode0
Enhancing Heavy Rain Nowcasting with Multimodal Data: Integrating Radar and Satellite ObservationsCode0
Emotion Detection in Speech Using Lightweight and Transformer-Based Models: A Comparative and Ablation StudyCode0
MIFO: Learning and Synthesizing Multi-Instance from One ImageCode0
MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP ToolsCode0
OSMGen: Highly Controllable Satellite Image Synthesis using OpenStreetMap DataCode0
PADBen: A Comprehensive Benchmark for Evaluating AI Text Detectors Against Paraphrase AttacksCode0
ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-trainingCode0
Why Federated Optimization Fails to Achieve Perfect Fitting? A Theoretical Perspective on Client-Side OptimaCode0
FedMGP: Personalized Federated Learning with Multi-Group Text-Visual PromptsCode0
ToM: Leveraging Tree-oriented MapReduce for Long-Context Reasoning in Large Language ModelsCode0
Three-dimensional narrow volume reconstruction method with unconditional stability based on a phase-field Lagrange multiplier approachCode0
Foundation Models for Trajectory Planning in Autonomous Driving: A Review of Progress and Open ChallengesCode0
DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models0
ConnectomeBench: Can LLMs Proofread the Connectome?Code0
H2-Cache: A Novel Hierarchical Dual-Stage Cache for High-Performance Acceleration of Generative Diffusion ModelsCode0
Show:102550
← PrevPage 317 of 18972Next →