SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,984 papers248,105 code links4,818 tasks

Papers

Showing 24012450 of 177340 papers

TitleStatusHype
PointNeXt: Revisiting PointNet++ with Improved Training and Scaling StrategiesCode3
Bidirectional Multi-Scale Implicit Neural Representations for Image DerainingCode3
Accelerating Transformer Inference for Translation via Parallel DecodingCode3
DiM: Diffusion Mamba for Efficient High-Resolution Image SynthesisCode3
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and EditingCode3
ConceptAttention: Diffusion Transformers Learn Highly Interpretable FeaturesCode3
A Distractor-Aware Memory for Visual Object Tracking with SAM2Code3
TAPIP3D: Tracking Any Point in Persistent 3D GeometryCode3
CharacterEval: A Chinese Benchmark for Role-Playing Conversational Agent EvaluationCode3
Data Generation for Hardware-Friendly Post-Training QuantizationCode3
LLMmap: Fingerprinting For Large Language ModelsCode3
SongComposer: A Large Language Model for Lyric and Melody Generation in Song CompositionCode3
PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic ThinkingCode3
ExCoT: Optimizing Reasoning for Text-to-SQL with Execution FeedbackCode3
MagicPIG: LSH Sampling for Efficient LLM GenerationCode3
MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMsCode3
Qihoo-T2X: An Efficient Proxy-Tokenized Diffusion Transformer for Text-to-Any-TaskCode3
What Language Model to Train if You Have One Million GPU Hours?Code3
FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion ModelCode3
An Evolved Universal Transformer MemoryCode3
Instruct-IPT: All-in-One Image Processing Transformer via Weight ModulationCode3
DFormerv2: Geometry Self-Attention for RGBD Semantic SegmentationCode3
SegFormer3D: an Efficient Transformer for 3D Medical Image SegmentationCode3
CATANet: Efficient Content-Aware Token Aggregation for Lightweight Image Super-ResolutionCode3
Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few LabelsCode3
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by StepCode3
Diffusion Feedback Helps CLIP See BetterCode3
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG SystemsCode3
CAX: Cellular Automata Accelerated in JAXCode3
Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?Code3
Anything-3D: Towards Single-view Anything Reconstruction in the WildCode3
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking PortraitCode3
Simplifying Deep Temporal Difference LearningCode3
GFM-RAG: Graph Foundation Model for Retrieval Augmented GenerationCode3
XAttention: Block Sparse Attention with Antidiagonal ScoringCode3
4M: Massively Multimodal Masked ModelingCode3
Unifying Flow, Stereo and Depth EstimationCode3
EgoLife: Towards Egocentric Life AssistantCode3
AlpacaFarm: A Simulation Framework for Methods that Learn from Human FeedbackCode3
Planning with Diffusion for Flexible Behavior SynthesisCode3
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUsCode3
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMsCode3
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation ModelsCode3
BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and AlignmentCode3
Text-guided Sparse Voxel Pruning for Efficient 3D Visual GroundingCode3
Data Engineering for Scaling Language Models to 128K ContextCode3
A Multiscale Visualization of Attention in the Transformer ModelCode3
Beyond A*: Better Planning with Transformers via Search Dynamics BootstrappingCode3
RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object DetectionCode3
Streaming Deep Reinforcement Learning Finally WorksCode3
Show:102550
← PrevPage 49 of 3547Next →