SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1110111150 of 661570 papers

TitleStatusHype
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language VariantsCode2
Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image SegmentationCode2
Accurate Computation of Quantum Excited States with Neural NetworksCode2
InterDiff: Generating 3D Human-Object Interactions with Physics-Informed DiffusionCode2
PointLLM: Empowering Large Language Models to Understand Point CloudsCode2
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language ModelsCode2
The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 LanguagesCode2
MVDream: Multi-view Diffusion for 3D GenerationCode2
PivotNet: Vectorized Pivot Learning for End-to-end HD Map ConstructionCode2
GREC: Generalized Referring Expression ComprehensionCode2
DTrOCR: Decoder-only Transformer for Optical Character RecognitionCode2
LLaSM: Large Language and Speech ModelCode2
Nemo: First Glimpse of a New Rule EngineCode2
WeatherBench 2: A benchmark for the next generation of data-driven global weather modelsCode2
AutoDroid: LLM-powered Task Automation in AndroidCode2
When Do Program-of-Thoughts Work for Reasoning?Code2
CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUsCode2
Text-to-SQL Empowered by Large Language Models: A Benchmark EvaluationCode2
Fast Feedforward NetworksCode2
Graph Meets LLMs: Towards Large Graph ModelsCode2
DISC-MedLLM: Bridging General Large Language Models and Real-World Medical ConsultationCode2
High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing NetCode2
The DiffuseStyleGesture+ entry to the GENEA Challenge 2023Code2
Residual Denoising Diffusion ModelsCode2
Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMsCode2
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language ModelsCode2
DARWIN Series: Domain Specific Large Language Models for Natural ScienceCode2
The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settingsCode2
FastSurfer-HypVINN: Automated sub-segmentation of the hypothalamus and adjacent structures on high-resolutional brain MRICode2
Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image TranslationCode2
NeO 360: Neural Fields for Sparse View Synthesis of Outdoor ScenesCode2
Dense Text-to-Image Generation with Attention ModulationCode2
BridgeData V2: A Dataset for Robot Learning at ScaleCode2
Motion In-Betweening with Phase ManifoldsCode2
WavMark: Watermarking for Audio GenerationCode2
StreamMapNet: Streaming Mapping Network for Vectorized Online HD Map ConstructionCode2
Topical-Chat: Towards Knowledge-Grounded Open-Domain ConversationsCode2
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction TuningCode2
Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable DiffusionCode2
Knowledge Graph Prompting for Multi-Document Question AnsweringCode2
IT3D: Improved Text-to-3D Generation with Explicit View SynthesisCode2
G3Reg: Pyramid Graph-based Global Registration using Gaussian Ellipsoid ModelCode2
SONAR: Sentence-Level Multimodal and Language-Agnostic RepresentationsCode2
VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly DetectionCode2
SeamlessM4T: Massively Multilingual & Multimodal Machine TranslationCode2
TOPIC: A Parallel Association Paradigm for Multi-Object Tracking under Complex Motions and Diverse ScenesCode2
ScanNet++: A High-Fidelity Dataset of 3D Indoor ScenesCode2
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and CaptioningCode2
Giraffe: Adventures in Expanding Context Lengths in LLMsCode2
Turning a CLIP Model into a Scene Text SpotterCode2
Show:102550
← PrevPage 223 of 13232Next →