SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1750117550 of 474278 papers

TitleStatusHype
A Survey of Automatic Evaluation Methods on Text, Visual and Speech Generations0
GS4: Generalizable Sparse Splatting Semantic SLAM0
Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments0
Mitigating Catastrophic Forgetting with Adaptive Transformer Block Expansion in Federated Fine-Tuning0
Edge-Enabled Collaborative Object Detection for Real-Time Multi-Vehicle PerceptionCode0
NeurNCD: Novel Class Discovery via Implicit Neural Representation0
Generating Long Semantic IDs in Parallel for RecommendationCode2
3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World ModelCode1
Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric VisionCode0
EqCollide: Equivariant and Collision-Aware Deformable Objects Neural Simulator0
FlowOE: Imitation Learning with Flow Policy from Ensemble RL Experts for Optimal Execution under Heston Volatility and Concave Market Impacts0
The World of AI: A Novel Approach to AI Literacy for First-year Engineering Students0
BEAST: Efficient Tokenization of B-Splines Encoded Action Sequences for Imitation Learning0
BiAssemble: Learning Collaborative Affordance for Bimanual Geometric Assembly0
Neural-Augmented Kelvinlet: Real-Time Soft Tissue Deformation with Multiple GraspersCode0
Being Strong Progressively! Enhancing Knowledge Distillation of Large Language Models through a Curriculum Learning FrameworkCode0
TimeWak: Temporal Chained-Hashing Watermark for Time Series DataCode0
Voice Impression Control in Zero-Shot TTS0
InstantFT: An FPGA-Based Runtime Subsecond Fine-tuning of CNN Models0
CrimeMind: Simulating Urban Crime with Multi-Modal LLM Agents0
Exploring Microstructural Dynamics in Cryptocurrency Limit Order Books: Better Inputs Matter More Than Stacking Another Hidden Layer0
Pruning Spurious Subgraphs for Graph Out-of-Distribtuion GeneralizationCode0
An Optimized Franz-Parisi Criterion and its Equivalence with SQ Lower Bounds0
FinanceReasoning: Benchmarking Financial Numerical Reasoning More Credible, Comprehensive and ChallengingCode1
Topology of Reasoning: Understanding Large Reasoning Models through Reasoning Graph PropertiesCode1
MLOps with Microservices: A Case Study on the Maritime Domain0
Numerical Investigation of Sequence Modeling Theory using Controllable Memory Functions0
Simple Yet Effective: Extracting Private Data Across Clients in Federated Fine-Tuning of Large Language Models0
CodeContests+: High-Quality Test Case Generation for Competitive Programming0
CP-Bench: Evaluating Large Language Models for Constraint Modelling0
MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based AttacksCode0
TissUnet: Improved Extracranial Tissue and Cranium Segmentation for Children through AdulthoodCode0
SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction0
Statistical Guarantees in Data-Driven Nonlinear Control: Conformal Robustness for Stability and Safety0
Eigenspectrum Analysis of Neural Networks without Aspect Ratio BiasCode1
Peer-Ranked Precision: Creating a Foundational Dataset for Fine-Tuning Vision Models from DataSeeds' Annotated ImageryCode0
MLLM-CL: Continual Learning for Multimodal Large Language Models0
PCDVQ: Enhancing Vector Quantization for Large Language Models via Polar Coordinate Decoupling0
Pseudo-Siamese Blind-Spot Transformers for Self-Supervised Real-World DenoisingCode0
Customizing Speech Recognition Model with Large Language Model Feedback0
Better Pseudo-labeling with Multi-ASR Fusion and Error Correction by SpeechLLM0
Intelligibility of Text-to-Speech Systems for Mathematical Expressions0
Structured Labeling Enables Faster Vision-Language Models for End-to-End Autonomous Driving0
MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal ReasoningCode1
MobiEdit: Resource-efficient Knowledge Editing for Personalized On-device LLMs0
Dynamic Context Tuning for Retrieval-Augmented Generation: Enhancing Multi-Turn Planning and Tool Adaptation0
Deep histological synthesis from mass spectrometry imaging for multimodal registrationCode0
U-NetMN and SegNetMN: Modified U-Net and SegNet models for bimodal SAR image segmentation0
Massive MIMO with 1-Bit DACs: Data Detection for Quantized Linear Precoding with Dithering0
Spectral Efficiency Maximization for mmWave MIMO-Aided Integrated Sensing and Communication Under Practical Constraints0
Show:102550
← PrevPage 351 of 9486Next →