SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1135111400 of 661570 papers

TitleStatusHype
Text-Guided Synthesis of Eulerian CinemagraphsCode2
Lost in the Middle: How Language Models Use Long ContextsCode2
FITS: Modeling Time Series with 10k ParametersCode2
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion ModelsCode2
Building Cooperative Embodied Agents Modularly with Large Language ModelsCode2
NMS Threshold matters for Ego4D Moment Queries -- 2nd place solution to the Ego4D Moment Queries Challenge 2023Code2
Evaluating AI systems under uncertain ground truth: a case study in dermatologyCode2
tsdownsample: high-performance time series downsampling for scalable visualizationCode2
EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation ModelsCode2
What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?Code2
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape GenerationCode2
Empirical Sample Complexity of Neural Network Mixed State ReconstructionCode2
Spike-driven TransformerCode2
ClimateLearn: Benchmarking Machine Learning for Weather and Climate ModelingCode2
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View TransformationCode2
SDXL: Improving Latent Diffusion Models for High-Resolution Image SynthesisCode2
Temporal Graph Benchmark for Machine Learning on Temporal GraphsCode2
SCITUNE: Aligning Large Language Models with Scientific Multimodal InstructionsCode2
Motion-X: A Large-scale 3D Expressive Whole-body Human Motion DatasetCode2
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware DiffusionCode2
Hierarchical Open-vocabulary Universal Image SegmentationCode2
JourneyDB: A Benchmark for Generative Image UnderstandingCode2
MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information RetrievalCode2
Numerical Association Rule Mining: A Systematic Literature ReviewCode2
BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained TransformerCode2
Act3D: 3D Feature Field Transformers for Multi-Task Robotic ManipulationCode2
Provable Robust Watermarking for AI-Generated TextCode2
MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and Guided Intention QueryingCode2
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent RepresentationCode2
MIS-FM: 3D Medical Image Segmentation using Foundation Models Pretrained on a Large-Scale Unannotated DatasetCode2
DreamDiffusion: Generating High-Quality Images from Brain EEG SignalsCode2
Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-trainCode2
Towards Zero-Shot Scale-Aware Monocular Depth EstimationCode2
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion ModelsCode2
BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated MotionCode2
SkiROS2: A skill-based Robot Control Platform for ROSCode2
LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image UnderstandingCode2
Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation StudioCode2
RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation ModelCode2
Towards Language Models That Can See: Computer Vision Through the LENS of Natural LanguageCode2
Towards Open Vocabulary Learning: A SurveyCode2
BayesFlow: Amortized Bayesian Workflows With Neural NetworksCode2
cuSLINK: Single-linkage Agglomerative Clustering on the GPUCode2
MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep LearningCode2
PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle AdjustmentCode2
Detector-Free Structure from MotionCode2
Shikra: Unleashing Multimodal LLM's Referential Dialogue MagicCode2
CellViT: Vision Transformers for Precise Cell Segmentation and ClassificationCode2
HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide ResolutionCode2
Evidential Detection and Tracking Collaboration: New Problem, Benchmark and Algorithm for Robust Anti-UAV SystemCode2
Show:102550
← PrevPage 228 of 13232Next →