SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 42514300 of 661570 papers

TitleStatusHype
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language ModelCode3
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud LearningCode3
GaussianEditor: Swift and Controllable 3D Editing with Gaussian SplattingCode3
GraphStorm: all-in-one graph machine learning framework for industry applicationsCode3
TokenPacker: Efficient Visual Projector for Multimodal LLMCode3
WeatherMesh-3: Fast and accurate operational global weather forecastingCode3
NdLinear Is All You Need for Representation LearningCode3
Bake off redux: a review and experimental evaluation of recent time series classification algorithmsCode3
TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic RepresentationCode3
CameraHMR: Aligning People with PerspectiveCode3
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World KnowledgeCode3
DEFOM-Stereo: Depth Foundation Model Based Stereo MatchingCode3
Rainbow: Combining Improvements in Deep Reinforcement LearningCode3
Mambular: A Sequential Model for Tabular Deep LearningCode3
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference OptimizationCode3
WHAC: World-grounded Humans and CamerasCode3
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic EvaluationsCode3
Generative AI Act II: Test Time Scaling Drives Cognition EngineeringCode3
ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language ModelsCode3
Cognify: Supercharging Gen-AI Workflows With Hierarchical AutotuningCode3
Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AICode3
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI AgentsCode3
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language ModelsCode3
From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation ModelsCode3
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language ModelsCode3
Chain of Draft: Thinking Faster by Writing LessCode3
Data Augmentation for Sequential Recommendation: A SurveyCode3
Programming Every Example: Lifting Pre-training Data Quality like Experts at ScaleCode3
MLVU: Benchmarking Multi-task Long Video UnderstandingCode3
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image RecognitionCode3
ECON: Explicit Clothed humans Optimized via Normal integrationCode3
Partially Rewriting a Transformer in Natural LanguageCode3
A Clean Slate for Offline Reinforcement LearningCode3
MarioGPT: Open-Ended Text2Level Generation through Large Language ModelsCode3
PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural MapCode3
VisualRWKV: Exploring Recurrent Neural Networks for Visual Language ModelsCode3
OS-ATLAS: A Foundation Action Model for Generalist GUI AgentsCode3
HadaCore: Tensor Core Accelerated Hadamard Transform KernelCode3
Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token RecyclingCode3
Description Boosting for Zero-Shot Entity and Relation ClassificationCode3
LibCity: A Unified Library Towards Efficient and Comprehensive Urban Spatial-Temporal PredictionCode3
Bird-Eye Transformers for Text Generation ModelsCode3
Lightplane: Highly-Scalable Components for Neural 3D FieldsCode3
Apollo: Band-sequence Modeling for High-Quality Audio RestorationCode3
ExTrans: Multilingual Deep Reasoning Translation via Exemplar-Enhanced Reinforcement LearningCode3
Image Quality Assessment for Magnetic Resonance ImagingCode3
RoadBEV: Road Surface Reconstruction in Bird's Eye ViewCode3
MetaSpatial: Reinforcing 3D Spatial Reasoning in VLMs for the MetaverseCode3
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning TasksCode3
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory ModelCode3
Show:102550
← PrevPage 86 of 13232Next →