SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1105111100 of 661570 papers

TitleStatusHype
Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark SuiteCode2
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and contextCode2
Optimization of Rank Losses for Image RetrievalCode2
PromptASR for contextualized ASR with controllable styleCode2
FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech CodecCode2
MMICL: Empowering Vision-language Model with Multi-Modal In-Context LearningCode2
VerilogEval: Evaluating Large Language Models for Verilog Code GenerationCode2
Generative Image DynamicsCode2
Unified Human-Scene Interaction via Prompted Chain-of-ContactsCode2
Beyond Adapting SAM: Towards End-to-End Ultrasound Image Segmentation via Auto PromptingCode2
PILOT: A Pre-Trained Model-Based Continual Learning ToolboxCode2
SafetyBench: Evaluating the Safety of Large Language ModelsCode2
CFDBench: A Large-Scale Benchmark for Machine Learning Methods in Fluid DynamicsCode2
BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language ModelsCode2
Commands as AI ConversationsCode2
Temporal Action Localization with Enhanced Instant DiscriminabilityCode2
Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction TuningCode2
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target SimulationCode2
MAmmoTH: Building Math Generalist Models through Hybrid Instruction TuningCode2
Kani: A Lightweight and Highly Hackable Framework for Building Language Model ApplicationsCode2
UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg CodebaseCode2
A physics-informed and attention-based graph learning approach for regional electric vehicle charging demand predictionCode2
Efficient Emotional Adaptation for Audio-Driven Talking-Head GenerationCode2
VoiceFlow: Efficient Text-to-Speech with Rectified Flow MatchingCode2
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual TokenizationCode2
InstructDiffusion: A Generalist Modeling Interface for Vision TasksCode2
A-Eval: A Benchmark for Cross-Dataset Evaluation of Abdominal Multi-Organ SegmentationCode2
XGen-7B Technical ReportCode2
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language ModelsCode2
PyGraft: Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your FingertipsCode2
Natural and Robust Walking using Reinforcement Learning without Demonstrations in High-Dimensional Musculoskeletal ModelsCode2
GPT-InvestAR: Enhancing Stock Investment Strategies through Annual Report Analysis with Large Language ModelsCode2
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial NetworkCode2
Automated Bioinformatics Analysis via AutoBACode2
GPT Can Solve Mathematical Problems Without a CalculatorCode2
CoLA: Exploiting Compositional Structure for Automatic and Efficient Numerical Linear AlgebraCode2
Dynamic Brain Transformer with Multi-level Attention for Functional Brain Network AnalysisCode2
GO-SLAM: Global Optimization for Consistent 3D Instant ReconstructionCode2
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction TuningCode2
DAT++: Spatially Dynamic Vision Transformer with Deformable AttentionCode2
Relay Diffusion: Unifying diffusion process across resolutions for image synthesisCode2
Benchmarking Large Language Models in Retrieval-Augmented GenerationCode2
NLLB-CLIP -- train performant multilingual image retrieval model on a budgetCode2
Adapting Segment Anything Model for Change Detection in HR Remote Sensing ImagesCode2
Orientation-Independent Chinese Text Recognition in Scene ImagesCode2
Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS AligningCode2
RevColV2: Exploring Disentangled Representations in Masked Image ModelingCode2
CityDreamer: Compositional Generative Model of Unbounded 3D CitiesCode2
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance SegmentationCode2
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction FollowingCode2
Show:102550
← PrevPage 222 of 13232Next →