SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 42014250 of 661570 papers

TitleStatusHype
Unified Data Management and Comprehensive Performance Evaluation for Urban Spatial-Temporal Prediction [Experiment, Analysis & Benchmark]Code3
Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMsCode3
StableVideo: Text-driven Consistency-aware Diffusion Video EditingCode3
OctoPack: Instruction Tuning Code Large Language ModelsCode3
EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language ModelsCode3
ModelScope Text-to-Video Technical ReportCode3
MapTRv2: An End-to-End Framework for Online Vectorized HD Map ConstructionCode3
Separate Anything You DescribeCode3
On the use of deep learning for phase recoveryCode3
Causal-learn: Causal Discovery in PythonCode3
Evaluating Large Language Models for Radiology Natural Language ProcessingCode3
WebArena: A Realistic Web Environment for Building Autonomous AgentsCode3
3D-LLM: Injecting the 3D World into Large Language ModelsCode3
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual ShiftingCode3
Meta-Transformer: A Unified Framework for Multimodal LearningCode3
TokenFlow: Consistent Diffusion Features for Consistent Video EditingCode3
Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait SynthesisCode3
RepViT: Revisiting Mobile CNN From ViT PerspectiveCode3
Retentive Network: A Successor to Transformer for Large Language ModelsCode3
Secrets of RLHF in Large Language Models Part I: PPOCode3
Objaverse-XL: A Universe of 10M+ 3D ObjectsCode3
Emu: Generative Pretraining in MultimodalityCode3
SVIT: Scaling up Visual Instruction TuningCode3
Focused Transformer: Contrastive Training for Context ScalingCode3
A Survey on Evaluation of Large Language ModelsCode3
OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained ModelsCode3
DeepfakeBench: A Comprehensive Benchmark of Deepfake DetectionCode3
Segment Anything Meets Point TrackingCode3
CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal ReasoningCode3
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion PriorsCode3
DisCo: Disentangled Control for Realistic Human Dance GenerationCode3
One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape OptimizationCode3
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image EditingCode3
MotionGPT: Human Motion as a Foreign LanguageCode3
ViNT: A Foundation Model for Visual NavigationCode3
Improving visual image reconstruction from human brain activity using latent diffusion models via multiple decoded inputsCode3
Opportunities and Risks of LLMs for Scalable Deliberation with PolisCode3
GlyphNet: Homoglyph domains dataset and detection using attention-based Convolutional Neural NetworksCode3
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text IntegrationCode3
TAPIR: Tracking Any Point with per-frame Initialization and temporal RefinementCode3
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human PreferencesCode3
Data-Copilot: Bridging Billions of Data and Humans with Autonomous WorkflowCode3
High-Fidelity Audio Compression with Improved RVQGANCode3
Interpretable Differencing of Machine Learning ModelsCode3
How Can Recommender Systems Benefit from Large Language Models: A SurveyCode3
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language ModelsCode3
Designing a Better Asymmetric VQGAN for StableDiffusionCode3
SAM3D: Segment Anything in 3D ScenesCode3
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot LearningCode3
TRACE: 5D Temporal Regression of Avatars with Dynamic Cameras in 3D EnvironmentsCode3
Show:102550
← PrevPage 85 of 13232Next →