SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 32013225 of 661570 papers

TitleStatusHype
BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language ModelsCode3
BoostTrack++: using tracklet information to detect more objects in multiple object trackingCode3
Frequency-aware Feature Fusion for Dense Image PredictionCode3
Recent Advances on Machine Learning for Computational Fluid Dynamics: A SurveyCode3
GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF FusionCode3
Exploring the Feasibility of Automated Data Standardization using Large Language Models for Seamless PositioningCode3
Controllable Text Generation for Large Language Models: A SurveyCode3
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented GenerationCode3
A Survey of Embodied Learning for Object-Centric Robotic ManipulationCode3
GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian SplattingCode3
A Short Review and Evaluation of SAM2's Performance in 3D CT Image SegmentationCode3
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal ModelCode3
AnyGraph: Graph Foundation Model in the WildCode3
Revisiting VerilogEval: A Year of Improvements in Large-Language Models for Hardware Code GenerationCode3
Accelerating Goal-Conditioned RL Algorithms and ResearchCode3
NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge DevicesCode3
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video GenerationCode3
LoopSplat: Loop Closure by Registering 3D Gaussian SplatsCode3
SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation ModelsCode3
ALS-HAR: Harnessing Wearable Ambient Light Sensors to Enhance IMU-based Human Activity RecogntionCode3
The First Competition on Resource-Limited Infrared Small Target Detection Challenge: Methods and ResultsCode3
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing CommunityCode3
Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token RecyclingCode3
RadioDiff: An Effective Generative Diffusion Model for Sampling-Free Dynamic Radio Map ConstructionCode3
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language ModelsCode3
Show:102550
← PrevPage 129 of 26463Next →