SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 97519800 of 177340 papers

TitleStatusHype
Autoregressive Visual TrackingCode2
OpenCOLE: Towards Reproducible Automatic Graphic Design GenerationCode2
COSMIC: COmmonSense knowledge for eMotion Identification in ConversationsCode2
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical InformationCode2
Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language ModelsCode2
Decoupled-and-Coupled Networks: Self-Supervised Hyperspectral Image Super-Resolution with Subpixel FusionCode2
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future DirectionsCode2
Lossless Image Compression through Super-ResolutionCode2
DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with Iterative Diffusion-Based RefinementCode2
DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion ModelsCode2
CLIP-CLOP: CLIP-Guided Collage and PhotomontageCode2
A Survey on In-context LearningCode2
Learn to Reason Efficiently with Adaptive Length-based Reward ShapingCode2
Using Large Language Models to Tackle Fundamental Challenges in Graph Learning: A Comprehensive SurveyCode2
Spiking Transformers Need High Frequency InformationCode2
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization ChallengesCode2
ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image ExplorationCode2
COVID-19 Image Data CollectionCode2
Against The Achilles' Heel: A Survey on Red Teaming for Generative ModelsCode2
OmniCaptioner: One Captioner to Rule Them AllCode2
DeepDTA: Deep Drug-Target Binding Affinity PredictionCode2
Hierarchical NeuroSymbolic Approach for Comprehensive and Explainable Action Quality AssessmentCode2
Crossformer: Transformer Utilizing Cross-Dimension Dependency for Multivariate Time Series ForecastingCode2
FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion ModelsCode2
WeakSAM: Segment Anything Meets Weakly-supervised Instance-level RecognitionCode2
DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-TrainingCode2
Generative Diffusion-based Downscaling for ClimateCode2
MambaVC: Learned Visual Compression with Selective State SpacesCode2
Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter AircraftsCode2
Matte Anything: Interactive Natural Image Matting with Segment Anything ModelsCode2
AmadeusGPT: a natural language interface for interactive animal behavioral analysisCode2
FocalFormer3D: Focusing on Hard Instance for 3D Object DetectionCode2
MetricX-24: The Google Submission to the WMT 2024 Metrics Shared TaskCode2
The Neural Hype and Comparisons Against Weak BaselinesCode2
Residual Quantization with Implicit Neural CodebooksCode2
ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement TasksCode2
AIM 2020 Challenge on Efficient Super-Resolution: Methods and ResultsCode2
User Behavior Simulation with Large Language Model based AgentsCode2
A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and FutureCode2
Nemo: First Glimpse of a New Rule EngineCode2
Softpick: No Attention Sink, No Massive Activations with Rectified SoftmaxCode2
Interpretability at Scale: Identifying Causal Mechanisms in AlpacaCode2
BEVCar: Camera-Radar Fusion for BEV Map and Object SegmentationCode2
Point Segment and Count: A Generalized Framework for Object CountingCode2
XrayGPT: Chest Radiographs Summarization using Medical Vision-Language ModelsCode2
QQQ: Quality Quattuor-Bit Quantization for Large Language ModelsCode2
CV-Cities: Advancing Cross-View Geo-Localization in Global CitiesCode2
A Unified Framework for 3D Scene UnderstandingCode2
Differentiable Reward Optimization for LLM based TTS systemCode2
SF-V: Single Forward Video Generation ModelCode2
Show:102550
← PrevPage 196 of 3547Next →