SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 25512600 of 659983 papers

TitleStatusHype
Accelerating Neural Network Training: An Analysis of the AlgoPerf CompetitionCode3
UniVS: Unified and Universal Video Segmentation with Prompts as QueriesCode3
PsyDT: Using LLMs to Construct the Digital Twin of Psychological Counselor with Personalized Counseling Style for Psychological CounselingCode3
U^2-Net: Going Deeper with Nested U-Structure for Salient Object DetectionCode3
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-ServeCode3
Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding ModelsCode3
Robust High-Resolution Video Matting with Temporal GuidanceCode3
DeepInteraction++: Multi-Modality Interaction for Autonomous DrivingCode3
A Practical Review of Mechanistic Interpretability for Transformer-Based Language ModelsCode3
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model AgentsCode3
TristouNet: Triplet Loss for Speaker Turn EmbeddingCode3
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The WildCode3
LION: Linear Group RNN for 3D Object Detection in Point CloudsCode3
Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and Texture GenerationCode3
Robust and Accurate Object Detection via Adversarial LearningCode3
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid ArchitectureCode3
MultiModal-GPT: A Vision and Language Model for Dialogue with HumansCode3
Behavior Generation with Latent ActionsCode3
OmniAudio: Generating Spatial Audio from 360-Degree VideoCode3
TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSONCode3
Ludwig: a type-based declarative deep learning toolboxCode3
Comparison of Syntactic and Semantic Representations of Programs in Neural EmbeddingsCode3
WantWords: An Open-source Online Reverse Dictionary SystemCode3
Implicit Style-Content Separation using B-LoRACode3
EfficientQAT: Efficient Quantization-Aware Training for Large Language ModelsCode3
N-LTP: An Open-source Neural Language Technology Platform for ChineseCode3
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM AgentsCode3
EfficientDet: Scalable and Efficient Object DetectionCode3
Auto-Sklearn 2.0: Hands-free AutoML via Meta-LearningCode3
LongRoPE2: Near-Lossless LLM Context Window ScalingCode3
A Novel Non-population-based Meta-heuristic Optimizer Inspired by the Philosophy of Yi JingCode3
CarDreamer: Open-Source Learning Platform for World Model based Autonomous DrivingCode3
Practical Video Object Detection via Feature Selection and AggregationCode3
LIMR: Less is More for RL ScalingCode3
WebCanvas: Benchmarking Web Agents in Online EnvironmentsCode3
ptwt - The PyTorch Wavelet ToolboxCode3
UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond ScalingCode3
DNA Family: Boosting Weight-Sharing NAS with Block-Wise SupervisionsCode3
Syzygy of Thoughts: Improving LLM CoT with the Minimal Free ResolutionCode3
MTVCrafter: 4D Motion Tokenization for Open-World Human Image AnimationCode3
mlpack 3: a fast, flexible machine learning libraryCode3
Large Language Model-Brained GUI Agents: A SurveyCode3
Rethinking Histology Slide Digitization Workflows for Low-Resource SettingsCode3
Allo: A Programming Model for Composable Accelerator DesignCode3
CogCoM: Train Large Vision-Language Models Diving into Details through Chain of ManipulationsCode3
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of contextCode3
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM InferenceCode3
PyTorch Metric LearningCode3
ReasonIR: Training Retrievers for Reasoning TasksCode3
OCR-free Document Understanding TransformerCode3
Show:102550
← PrevPage 52 of 13200Next →