SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1115111200 of 661570 papers

TitleStatusHype
WanJuan: A Comprehensive Multimodal Dataset for Advancing English and Chinese Large ModelsCode2
SRFormer: Text Detection Transformer with Incorporated Segmentation and RegressionCode2
Giraffe: Adventures in Expanding Context Lengths in LLMsCode2
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence UnderstandingCode2
Texture Generation on 3D Meshes with Point-UV DiffusionCode2
Turning a CLIP Model into a Scene Text SpotterCode2
STAEformer: Spatio-Temporal Adaptive Embedding Makes Vanilla Transformer SOTA for Traffic ForecastingCode2
Towards Real-World Visual Tracking with Temporal ContextsCode2
ExpeL: LLM Agents Are Experiential LearnersCode2
Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked AutoencodersCode2
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual QuestionsCode2
DiffusionTrack: Diffusion Model For Multi-Object TrackingCode2
FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language ModelsCode2
SwinJSCC: Taming Swin Transformer for Deep Joint Source-Channel CodingCode2
Diffusion Models for Image Restoration and Enhancement -- A Comprehensive SurveyCode2
Diff2Lip: Audio Conditioned Diffusion Models for Lip-SynchronizationCode2
SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera VideosCode2
LibreFace: An Open-Source Toolkit for Deep Facial Expression AnalysisCode2
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech EnhancementCode2
Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D ScenesCode2
MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language ModelsCode2
CMB: A Comprehensive Medical Benchmark in ChineseCode2
DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature MatchingCode2
DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and TrajectoryCode2
MeViS: A Large-scale Benchmark for Video Segmentation with Motion ExpressionsCode2
TeCH: Text-guided Reconstruction of Lifelike Clothed HumansCode2
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-VerificationCode2
ICAFusion: Iterative Cross-Attention Guided Feature Fusion for Multispectral Object DetectionCode2
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View RepresentationCode2
RestoreFormer++: Towards Real-World Blind Face Restoration from Undegraded Key-Value PairsCode2
ChatEval: Towards Better LLM-based Evaluators through Multi-Agent DebateCode2
S3IM: Stochastic Structural SIMilarity and Its Unreasonable Effectiveness for Neural FieldsCode2
Global Features are All You Need for Image Retrieval and RerankingCode2
Platypus: Quick, Cheap, and Powerful Refinement of LLMsCode2
Bayesian Flow NetworksCode2
Machine Unlearning: Solutions and ChallengesCode2
Large Language Models for Information Retrieval: A SurveyCode2
The Sound Demixing Challenge 2023 x2013 Music Demixing TrackCode2
Language is All a Graph NeedsCode2
EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerceCode2
#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language ModelsCode2
AerialVLN: Vision-and-Language Navigation for UAVsCode2
Effect of Choosing Loss Function when Using T-batching for Representation Learning on Dynamic NetworksCode2
A Survey on Deep Neural Network Pruning-Taxonomy, Comparison, Analysis, and RecommendationsCode2
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via CipherCode2
Tiny and Efficient Model for the Edge Detection GeneralizationCode2
Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance FlowCode2
BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous AgentsCode2
Phoneme Hallucinator: One-shot Voice Conversion via Set ExpansionCode2
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion ModelsCode2
Show:102550
← PrevPage 224 of 13232Next →