SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1125111300 of 661570 papers

TitleStatusHype
Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-trainingCode2
An Unforgeable Publicly Verifiable Watermark for Large Language ModelsCode2
UnIVAL: Unified Model for Image, Video, Audio and Language TasksCode2
SEED-Bench: Benchmarking Multimodal LLMs with Generative ComprehensionCode2
Implicit Neural Representation in Medical Imaging: A Comparative SurveyCode2
XMem++: Production-level Video Segmentation From Few Annotated FramesCode2
MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object TrackingCode2
Equivariance and partial observations in Koopman operator theory for partial differential equationsCode2
Scaling Data Generation in Vision-and-Language NavigationCode2
TaskExpert: Dynamically Assembling Multi-Task Representations with Memorial Mixture-of-ExpertsCode2
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic ControlCode2
Widespread Flaws in Offline Evaluation of Recommender SystemsCode2
PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point TrackingCode2
The Effect of Third Party Implementations on ReproducibilityCode2
IML-ViT: Benchmarking Image Manipulation Localization by Vision TransformerCode2
Solving Data Quality Problems with Desbordante: a DemoCode2
Distilled Feature Fields Enable Few-Shot Language-Guided ManipulationCode2
Med-Flamingo: a Multimodal Medical Few-shot LearnerCode2
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth EstimationCode2
MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous DrivingCode2
TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormerCode2
Generative AI for Medical Imaging: extending the MONAI FrameworkCode2
NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object DetectionCode2
Three Bricks to Consolidate Watermarks for Large Language ModelsCode2
Hypergraph Isomorphism ComputationCode2
trajdata: A Unified Interface to Multiple Human Trajectory DatasetsCode2
Tracking Anything in High QualityCode2
TabR: Tabular Deep Learning Meets Nearest Neighbors in 2023Code2
WavJourney: Compositional Audio Creation with Large Language ModelsCode2
LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA CompositionCode2
QuIP: 2-Bit Quantization of Large Language Models With GuaranteesCode2
Zshot: An Open-source Framework for Zero-Shot Named Entity Recognition and Relation ExtractionCode2
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain ScenariosCode2
Foundational Models Defining a New Era in Vision: A Survey and OutlookCode2
TF-ICON: Diffusion-Based Training-Free Cross-Domain Image CompositionCode2
Aligning Large Language Models with Human: A SurveyCode2
Getting pwn'd by AI: Penetration Testing with Large Language ModelsCode2
COCO-O: A Benchmark for Object Detectors under Natural Distribution ShiftsCode2
A Systematic Survey of Prompt Engineering on Vision-Language Foundation ModelsCode2
Remote Bio-Sensing: Open Source Benchmark Framework for Fair Evaluation of rPPGCode2
A Simple and Model-Free Path Filtering Algorithm for Smoothing and AccuracyCode2
Pyramid Semantic Graph-based Global Point Cloud Registration with Low OverlapCode2
PINNsFormer: A Transformer-Based Framework For Physics-Informed Neural NetworksCode2
Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series ForecastingCode2
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuningCode2
BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained DiffusionCode2
CNOS: A Strong Baseline for CAD-based Novel Object SegmentationCode2
BlendFace: Re-designing Identity Encoders for Face-SwappingCode2
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill SetsCode2
DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric RenderingCode2
Show:102550
← PrevPage 226 of 13232Next →