SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1070110750 of 661570 papers

TitleStatusHype
A Survey on Multimodal Large Language Models for Autonomous DrivingCode2
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose EstimationCode2
Sparse4D v3: Advancing End-to-End 3D Detection and TrackingCode2
System 2 Attention (is something you might need too)Code2
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language AgentsCode2
GPQA: A Graduate-Level Google-Proof Q&A BenchmarkCode2
Fast Inner-Product Algorithms and Architectures for Deep Neural Network AcceleratorsCode2
Meta Prompting for AI SystemsCode2
LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score MatchingCode2
Open-Vocabulary Camouflaged Object SegmentationCode2
Tactics2D: A Highly Modular and Extensible Simulator for Driving Decision-makingCode2
An Embodied Generalist Agent in 3D WorldCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
SUQL: Conversational Search over Structured and Unstructured Data with Large Language ModelsCode2
The Chosen One: Consistent Characters in Text-to-Image Diffusion ModelsCode2
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level CodeCode2
Stella Nera: Achieving 161 TOp/s/W with Multiplier-free DNN Acceleration based on Approximate Matrix MultiplicationCode2
JaxMARL: Multi-Agent RL Environments and Algorithms in JAXCode2
MedAgents: Large Language Models as Collaborators for Zero-shot Medical ReasoningCode2
HuatuoGPT-II, One-stage Training for Medical Adaption of LLMsCode2
GEO: Generative Engine OptimizationCode2
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation SystemsCode2
Exponentially Faster Language ModellingCode2
FastBlend: a Powerful Model-Free Toolkit Making Video Stylization EasierCode2
MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction TuningCode2
Striped Attention: Faster Ring Attention for Causal TransformersCode2
Correlation-Guided Query-Dependency Calibration for Video Temporal GroundingCode2
Adapting Large Language Models by Integrating Collaborative Semantics for RecommendationCode2
Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional TrainingCode2
GLiNER: Generalist Model for Named Entity Recognition using Bidirectional TransformerCode2
Mustango: Toward Controllable Text-to-Music GenerationCode2
REST: Retrieval-Based Speculative DecodingCode2
MeLo: Low-rank Adaptation is Better than Fine-tuning for Medical Image DiagnosisCode2
Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame InterpolationCode2
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video UnderstandingCode2
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers FasterCode2
Learning to Filter Context for Retrieval-Augmented GenerationCode2
To See is to Believe: Prompting GPT-4V for Better Visual Instruction TuningCode2
Neural General Circulation Models for Weather and ClimateCode2
SpectralGPT: Spectral Remote Sensing Foundation ModelCode2
Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation ModelsCode2
LayoutPrompter: Awaken the Design Ability of Large Language ModelsCode2
Tamil-Llama: A New Tamil Language Model Based on Llama 2Code2
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor CoresCode2
FourierGNN: Rethinking Multivariate Time Series Forecasting from a Pure Graph PerspectiveCode2
Practical Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt CalibrationCode2
Frequency-domain MLPs are More Effective Learners in Time Series ForecastingCode2
High-dimensional mixed-categorical Gaussian processes with application to multidisciplinary design optimization for a green aircraftCode2
Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction ModelCode2
EVORA: Deep Evidential Traversability Learning for Risk-Aware Off-Road AutonomyCode2
Show:102550
← PrevPage 215 of 13232Next →