SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 72017250 of 661570 papers

TitleStatusHype
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language ModelsCode2
Thought2Text: Text Generation from EEG Signal using Large Language Models (LLMs)Code2
Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-ExpertsCode2
Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to AutomationCode2
Q-VLM: Post-training Quantization for Large Vision-Language ModelsCode2
Window Function-less DFT with Reduced Noise and Latency for Real-Time Music AnalysisCode2
Interactive4D: Interactive 4D LiDAR SegmentationCode2
Spiking GS: Towards High-Accuracy and Low-Cost Surface Reconstruction via Spiking Neuron-based Gaussian SplattingCode2
Enhancing Soccer Camera Calibration Through Keypoint ExploitationCode2
Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and TrainingCode2
Cheating Automatic LLM Benchmarks: Null Models Achieve High Win RatesCode2
Quanda: An Interpretability Toolkit for Training Data Attribution Evaluation and BeyondCode2
Compositional Entailment Learning for Hyperbolic Vision-Language ModelsCode2
CursorCore: Assist Programming through Aligning AnythingCode2
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration RateCode2
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language ModelsCode2
Towards Interpreting Visual Information Processing in Vision-Language ModelsCode2
SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference AccelerationCode2
LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor ExtractionCode2
An Undetectable Watermark for Generative Image ModelsCode2
Rodimus*: Breaking the Accuracy-Efficiency Trade-Off with Efficient AttentionsCode2
Sylber: Syllabic Embedding Representation of Speech from Raw AudioCode2
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision TransformersCode2
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific HypothesesCode2
Towards Natural Image Matting in the Wild via Real-Scenario PriorCode2
MatMamba: A Matryoshka State Space ModelCode2
ReFIR: Grounding Large Restoration Models with Retrieval AugmentationCode2
FedGraph: A Research Library and Benchmark for Federated Graph LearningCode2
Think While You Generate: Discrete Diffusion with Planned DenoisingCode2
TRACE: Temporal Grounding Video LLM via Causal Event ModelingCode2
DeMo: Decoupling Motion Forecasting into Directional Intentions and Dynamic StatesCode2
TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation DataCode2
LLM-based SPARQL Query Generation from Natural Language over Federated Knowledge GraphsCode2
Large Continual Instruction AssistantCode2
Motion Forecasting in Continuous DrivingCode2
MedUniSeg: 2D and 3D Medical Image Segmentation via a Prompt-driven Universal ModelCode2
Prompting DirectSAM for Semantic Contour Extraction in Remote Sensing ImagesCode2
TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video GenerationCode2
Unlocking the Capabilities of Thought: A Reasoning Boundary Framework to Quantify and Optimize Chain-of-ThoughtCode2
Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to SeeCode2
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains MoreCode2
LeanAgent: Lifelong Learning for Formal Theorem ProvingCode2
BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile ManipulationCode2
PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse SamplingCode2
BEVLoc: Cross-View Localization and Matching via Birds-Eye-View SynthesisCode2
Causal Context Adjustment Loss for Learned Image CompressionCode2
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention CausalityCode2
Next Best Sense: Guiding Vision and Touch with FisherRF for 3D Gaussian SplattingCode2
Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNetCode2
TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse AttentionCode2
Show:102550
← PrevPage 145 of 13232Next →