SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1140111450 of 661570 papers

TitleStatusHype
To Spike or Not To Spike: A Digital Hardware Perspective on Deep Learning AccelerationCode2
CLIPA-v2: Scaling CLIP Training with 81.1% Zero-shot ImageNet Accuracy within a \10,000 Budget; An Extra \4,000 Unlocks 81.8% AccuracyCode2
CellViT: Vision Transformers for Precise Cell Segmentation and ClassificationCode2
PMaF: Deep Declarative Layers for Principal Matrix FeaturesCode2
DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species GenomeCode2
DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion ModelsCode2
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction TuningCode2
RVT: Robotic View Transformer for 3D Object ManipulationCode2
MedLSAM: Localize and Segment Anything Model for 3D CT ImagesCode2
InterCode: Standardizing and Benchmarking Interactive Coding with Execution FeedbackCode2
H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language ModelsCode2
ToolQA: A Dataset for LLM Question Answering with External ToolsCode2
OpenMask3D: Open-Vocabulary 3D Instance SegmentationCode2
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language ModelsCode2
3DSAM-adapter: Holistic adaptation of SAM from 2D to 3D for promptable tumor segmentationCode2
Maintaining Plasticity in Deep Continual LearningCode2
3D Reconstruction of Spherical Images based on Incremental Structure from MotionCode2
From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of ThoughtCode2
Wind Noise Reduction with a Diffusion-based Stochastic Regeneration ModelCode2
SoftGPT: Learn Goal-oriented Soft Object Manipulation Skills by Generative Pre-trained Heterogeneous Graph TransformerCode2
PyKoopman: A Python Package for Data-Driven Approximation of the Koopman OperatorCode2
PromptIR: Prompting for All-in-One Blind Image RestorationCode2
Visual Adversarial Examples Jailbreak Aligned Large Language ModelsCode2
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text DocumentsCode2
EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-Degree RepresentationsCode2
SituatedGen: Incorporating Geographical and Temporal Contexts into Generative Commonsense ReasoningCode2
SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPTCode2
RoMe: Towards Large Scale Road Surface Reconstruction via Mesh RepresentationCode2
PyRCA: A Library for Metric-based Root Cause AnalysisCode2
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote SensingCode2
Multi-Fidelity Active Learning with GFlowNetsCode2
A Simple and Effective Pruning Approach for Large Language ModelsCode2
LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph MatchingCode2
Maximum Entropy Heterogeneous-Agent Reinforcement LearningCode2
SGFormer: Simplifying and Empowering Transformers for Large-Graph RepresentationsCode2
BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language ModelsCode2
RemoteCLIP: A Vision Language Foundation Model for Remote SensingCode2
OpenP5: An Open-Source Platform for Developing, Training, and Evaluating LLM-based Recommender SystemsCode2
Guiding Language Models of Code with Global Context using MonitorsCode2
QCNeXt: A Next-Generation Framework For Joint Multi-Agent Trajectory PredictionCode2
MachMap: End-to-End Vectorized Solution for Compact HD-Map ConstructionCode2
DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly DetectionCode2
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image EditingCode2
MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image ClassificationCode2
End-to-End Vectorized HD-map Construction with Piecewise Bezier CurveCode2
Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAXCode2
Full Parameter Fine-tuning for Large Language Models with Limited ResourcesCode2
Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and ProspectsCode2
The 1st-place Solution for CVPR 2023 OpenLane Topology in Autonomous Driving ChallengeCode2
RED^ FM: a Filtered and Multilingual Relation Extraction DatasetCode2
Show:102550
← PrevPage 229 of 13232Next →