SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 64516475 of 474278 papers

TitleStatusHype
VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing ControlCode2
FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language ModelsCode2
Edicho: Consistent Image Editing in the WildCode2
DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech RecognitionCode2
MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic MasksCode2
Natural Language Fine-TuningCode2
Learning an Adaptive and View-Invariant Vision Transformer for Real-Time UAV TrackingCode2
From Generalist to Specialist: A Survey of Large Language Models for ChemistryCode2
OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction SystemCode2
GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian SplattingCode2
DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face SynthesisCode2
MaIR: A Locality- and Continuity-Preserving Mamba for Image RestorationCode2
Towards Open-Vocabulary Remote Sensing Image Semantic SegmentationCode2
MBQ: Modality-Balanced Quantization for Large Vision-Language ModelsCode2
RecLM: Recommendation Instruction TuningCode2
SUTrack: Towards Simple and Unified Single Object TrackingCode2
ETTA: Elucidating the Design Space of Text-to-Audio ModelsCode2
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task AlignmentCode2
CGCOD: Class-Guided Camouflaged Object DetectionCode2
Simultaneously Recovering Multi-Person Meshes and Multi-View Cameras with Human SemanticsCode2
WeatherGS: 3D Scene Reconstruction in Adverse Weather Conditions via Gaussian SplattingCode2
Token-Budget-Aware LLM ReasoningCode2
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language ModelsCode2
Long-Form Speech Generation with Spoken Language ModelsCode2
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene UnderstandingCode2
Show:102550
← PrevPage 259 of 18972Next →