SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 38513900 of 659983 papers

TitleStatusHype
UniVS: Unified and Universal Video Segmentation with Prompts as QueriesCode3
Simple linear attention language models balance the recall-throughput tradeoffCode3
Training-Free Long-Context Scaling of Large Language ModelsCode3
TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence GenerationCode3
Explicit Interaction for Fusion-Based Place RecognitionCode3
VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image AnalysisCode3
SongComposer: A Large Language Model for Lyric and Melody Generation in Song CompositionCode3
ShapeLLM: Universal 3D Object Understanding for Embodied InteractionCode3
Leveraging Enhanced Queries of Point Sets for Vectorized Map ConstructionCode3
DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based ReasoningCode3
VastGaussian: Vast 3D Gaussians for Large Scene ReconstructionCode3
PreRoutGNN for Timing Prediction with Order Preserving Partition: Global Circuit Pre-training, Local Delay Learning and Attentional Cell ModelingCode3
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement LearningCode3
TOTEM: TOkenized Time Series EMbeddings for General Time Series AnalysisCode3
A Survey on Data Selection for Language ModelsCode3
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety DetectorsCode3
Why Transformers Need Adam: A Hessian PerspectiveCode3
ChatMusician: Understanding and Generating Music Intrinsically with LLMCode3
UrbanGPT: Spatio-Temporal Large Language ModelsCode3
Exploring gene content with pangene graphsCode3
Seamless Human Motion Composition with Blended Positional EncodingsCode3
State Space Models for Event CamerasCode3
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech ProcessingCode3
Genie: Generative Interactive EnvironmentsCode3
Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene UnderstandingCode3
IEPile: Unearthing Large-Scale Schema-Based Information Extraction CorpusCode3
Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single ShotCode3
OmniPred: Language Models as Universal RegressorsCode3
MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein EmbeddingCode3
Cleaner Pretraining Corpus Curation with Neural Web ScrapingCode3
Towards Seamless Adaptation of Pre-trained Models for Visual Place RecognitionCode3
Beyond A*: Better Planning with Transformers via Search Dynamics BootstrappingCode3
Towards Building Multilingual Language Model for MedicineCode3
LongRoPE: Extending LLM Context Window Beyond 2 Million TokensCode3
Bench: Extending Long Context Evaluation Beyond 100K TokensCode3
Visual Style Prompting with Swapping Self-AttentionCode3
Video ReCap: Recursive Captioning of Hour-Long VideosCode3
TorchCP: A Python Library for Conformal PredictionCode3
Smaug: Fixing Failure Modes of Preference Optimisation with DPO-PositiveCode3
Codec-SUPERB: An In-Depth Analysis of Sound Codec ModelsCode3
FiT: Flexible Vision Transformer for Diffusion ModelCode3
A Chinese Dataset for Evaluating the Safeguards in Large Language ModelsCode3
UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal PredictionCode3
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image GenerationCode3
Language-Codec: Bridging Discrete Codec Representations and Speech Language ModelsCode3
Sequoia: Scalable, Robust, and Hardware-aware Speculative DecodingCode3
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart ReasoningCode3
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic EvaluationsCode3
Major TOM: Expandable Datasets for Earth ObservationCode3
Query-Based Adversarial Prompt GenerationCode3
Show:102550
← PrevPage 78 of 13200Next →