SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 89018925 of 474278 papers

TitleStatusHype
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language modelsCode2
Automating the Enterprise with Foundation ModelsCode2
On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?Code2
A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language ModelCode2
SynFlowNet: Design of Diverse and Novel Molecules with Synthesis ConstraintsCode2
Part-aware Shape Generation with Latent 3D Diffusion of Neural Voxel FieldsCode2
MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D PriorsCode2
Multi-Space Alignments Towards Universal LiDAR SegmentationCode2
Benchmarking Representations for Speech, Music, and Acoustic EventsCode2
SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image DenoisingCode2
A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and LawCode2
EchoScene: Indoor Scene Generation via Information Echo over Scene Graph DiffusionCode2
SATO: Stable Text-to-Motion FrameworkCode2
FeNNol: an Efficient and Flexible Library for Building Force-field-enhanced Neural Network PotentialsCode2
LocInv: Localization-aware Inversion for Text-Guided Image EditingCode2
Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator DesignCode2
ASAM: Boosting Segment Anything Model with Adversarial TuningCode2
TFPred: Learning Discriminative Representations from Unlabeled Data for Few-Label Rotating Machinery Fault DiagnosisCode2
Model Quantization and Hardware Acceleration for Vision Transformers: A Comprehensive SurveyCode2
Causal Evaluation of Language ModelsCode2
WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace SettingCode2
Toward Unified Practices in Trajectory Prediction Research on Bird's-Eye-View DatasetsCode2
Adaptive Bidirectional Displacement for Semi-Supervised Medical Image SegmentationCode2
GraCo: Granularity-Controllable Interactive SegmentationCode2
Spectrally Pruned Gaussian Fields with Neural CompensationCode2
Show:102550
← PrevPage 357 of 18972Next →