SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 91519200 of 661570 papers

TitleStatusHype
Nemo: First Glimpse of a New Rule EngineCode2
Softpick: No Attention Sink, No Massive Activations with Rectified SoftmaxCode2
Interpretability at Scale: Identifying Causal Mechanisms in AlpacaCode2
BEVCar: Camera-Radar Fusion for BEV Map and Object SegmentationCode2
Point Segment and Count: A Generalized Framework for Object CountingCode2
XrayGPT: Chest Radiographs Summarization using Medical Vision-Language ModelsCode2
QQQ: Quality Quattuor-Bit Quantization for Large Language ModelsCode2
CV-Cities: Advancing Cross-View Geo-Localization in Global CitiesCode2
A Unified Framework for 3D Scene UnderstandingCode2
Differentiable Reward Optimization for LLM based TTS systemCode2
SF-V: Single Forward Video Generation ModelCode2
Graph Neural Networks in TensorFlow and Keras with SpektralCode2
ArtGS: Building Interactable Replicas of Complex Articulated Objects via Gaussian SplattingCode2
TensorOpt: Exploring the Tradeoffs in Distributed DNN Training with Auto-ParallelismCode2
Rank-based Non-dominated SortingCode2
Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video AnomalyCode2
Tractable Probabilistic Graph Representation Learning with Graph-Induced Sum-Product NetworksCode2
PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point CloudsCode2
Torch-Struct: Deep Structured Prediction LibraryCode2
FaceDancer: Pose- and Occlusion-Aware High Fidelity Face SwappingCode2
Curriculum Learning for ab initio Deep Learned Refractive OpticsCode2
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative TasksCode2
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase SpectraCode2
Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation frameworkCode2
Revisiting Scene Text Recognition: A Data PerspectiveCode2
Spanish Pre-trained BERT Model and Evaluation DataCode2
Large Language Models for Information Retrieval: A SurveyCode2
Simplifying Paragraph-level Question Generation via Transformer Language ModelsCode2
Diffusion Enhancement for Cloud Removal in Ultra-Resolution Remote Sensing ImageryCode2
OGNI-DC: Robust Depth Completion with Optimization-Guided Neural IterationsCode2
Prioritized Training on Points that are Learnable, Worth Learning, and Not Yet LearntCode2
Receding Moving Object Segmentation in 3D LiDAR Data Using Sparse 4D ConvolutionsCode2
Scalable 3D Captioning with Pretrained ModelsCode2
Open High-Resolution Satellite Imagery: The WorldStrat Dataset -- With Application to Super-ResolutionCode2
Mastering Atari, Go, Chess and Shogi by Planning with a Learned ModelCode2
YAYI-UIE: A Chat-Enhanced Instruction Tuning Framework for Universal Information ExtractionCode2
Complex-YOLO: Real-time 3D Object Detection on Point CloudsCode2
MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning EngineeringCode2
Rényi Differential Privacy of the Sampled Gaussian MechanismCode2
AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMsCode2
A Temporal Kolmogorov-Arnold Transformer for Time Series ForecastingCode2
Block Transformer: Global-to-Local Language Modeling for Fast InferenceCode2
Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object DetectionCode2
YUAN 2.0: A Large Language Model with Localized Filtering-based AttentionCode2
Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and TrainingCode2
Decomposition Betters Tracking Everything EverywhereCode2
Kani: A Lightweight and Highly Hackable Framework for Building Language Model ApplicationsCode2
Multimodal Prototyping for cancer survival predictionCode2
WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question AnsweringCode2
Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLPCode2
Show:102550
← PrevPage 184 of 13232Next →