SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 251300 of 474278 papers

TitleStatusHype
Robust Inverse Graphics via Probabilistic InferenceCode7
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!Code7
From Bytes to Ideas: Language Modeling with Autoregressive U-NetsCode7
Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image AnimationCode7
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented GenerationCode7
2D Gaussian Splatting for Geometrically Accurate Radiance FieldsCode7
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language ModelsCode7
MoE-LLaVA: Mixture of Experts for Large Vision-Language ModelsCode7
LHM: Large Animatable Human Reconstruction Model from a Single Image in SecondsCode7
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation ModelCode7
Dynamic data sampler for cross-language transfer learning in large language modelsCode7
CALE: Continuous Arcade Learning EnvironmentCode7
LLaMA: Open and Efficient Foundation Language ModelsCode7
FourierKAN outperforms MLP on Text Classification Head Fine-tuningCode7
Prometheus: Inducing Fine-grained Evaluation Capability in Language ModelsCode7
Domain Expansion of Image GeneratorsCode7
OmniGen: Unified Image GenerationCode7
Fast Timing-Conditioned Latent Audio DiffusionCode7
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence SegmentationCode7
PuLID: Pure and Lightning ID Customization via Contrastive AlignmentCode7
Byte Latent Transformer: Patches Scale Better Than TokensCode7
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human AnimationCode7
OmniGen2: Exploration to Advanced Multimodal GenerationCode7
xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive ParallelismCode7
Champ: Controllable and Consistent Human Image Animation with 3D Parametric GuidanceCode7
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement LearningCode7
Gravity-aligned Rotation Averaging with Circular RegressionCode7
Full Scaling Automation for Sustainable Development of Green Data CentersCode7
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement LearningCode7
HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion TransformerCode7
LLM Post-Training: A Deep Dive into Reasoning Large Language ModelsCode7
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative PretrainingCode7
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation DatasetCode7
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for ReasoningCode7
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse DomainsCode7
xLSTM 7B: A Recurrent LLM for Fast and Efficient InferenceCode7
SageAttention2++: A More Efficient Implementation of SageAttention2Code7
NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and BenchmarkingCode7
Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via TensorizationCode7
PowerPM: Foundation Model for Power SystemsCode7
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference AccelerationCode7
Open Deep Search: Democratizing Search with Open-source Reasoning AgentsCode7
TextGrad: Automatic "Differentiation" via TextCode7
X-MeshGraphNet: Scalable Multi-Scale Graph Neural Networks for Physics SimulationCode7
ViDoRe Benchmark V2: Raising the Bar for Visual RetrievalCode7
CodeUltraFeedback: An LLM-as-a-Judge Dataset for Aligning Large Language Models to Coding PreferencesCode7
VMamba: Visual State Space ModelCode7
In-Context LoRA for Diffusion TransformersCode7
Rethinking the Sample Relations for Few-Shot ClassificationCode7
xLSTM: Extended Long Short-Term MemoryCode7
Show:102550
← PrevPage 6 of 9486Next →