SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 32013250 of 659983 papers

TitleStatusHype
BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language ModelsCode3
Frequency-aware Feature Fusion for Dense Image PredictionCode3
BoostTrack++: using tracklet information to detect more objects in multiple object trackingCode3
Controllable Text Generation for Large Language Models: A SurveyCode3
Exploring the Feasibility of Automated Data Standardization using Large Language Models for Seamless PositioningCode3
GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF FusionCode3
Recent Advances on Machine Learning for Computational Fluid Dynamics: A SurveyCode3
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented GenerationCode3
GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian SplattingCode3
A Survey of Embodied Learning for Object-Centric Robotic ManipulationCode3
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal ModelCode3
A Short Review and Evaluation of SAM2's Performance in 3D CT Image SegmentationCode3
AnyGraph: Graph Foundation Model in the WildCode3
Revisiting VerilogEval: A Year of Improvements in Large-Language Models for Hardware Code GenerationCode3
Accelerating Goal-Conditioned RL Algorithms and ResearchCode3
NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge DevicesCode3
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video GenerationCode3
LoopSplat: Loop Closure by Registering 3D Gaussian SplatsCode3
SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation ModelsCode3
ALS-HAR: Harnessing Wearable Ambient Light Sensors to Enhance IMU-based Human Activity RecogntionCode3
The First Competition on Resource-Limited Infrared Small Target Detection Challenge: Methods and ResultsCode3
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing CommunityCode3
Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token RecyclingCode3
RadioDiff: An Effective Generative Diffusion Model for Sampling-Free Dynamic Radio Map ConstructionCode3
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language ModelsCode3
SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image SegmentationCode3
Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language ModelsCode3
Graph Retrieval-Augmented Generation: A SurveyCode3
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition TasksCode3
FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution RenderingCode3
Accelerating High-Fidelity Waveform Generation via Adversarial Flow Matching OptimizationCode3
Aquila2 Technical ReportCode3
Panacea+: Panoramic and Controllable Video Generation for Autonomous DrivingCode3
PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform GenerationCode3
OpenResearcher: Unleashing AI for Accelerated Scientific ResearchCode3
Imagen 3Code3
BMX: Entropy-weighted Similarity and Semantic-enhanced Lexical SearchCode3
FruitNeRF: A Unified Neural Radiance Field based Fruit Counting FrameworkCode3
SkillMimic: Learning Basketball Interaction Skills from DemonstrationsCode3
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image PersonalizationCode3
Music2Latent: Consistency Autoencoders for Latent Audio CompressionCode3
Mambular: A Sequential Model for Tabular Deep LearningCode3
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation AgentsCode3
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at ScaleCode3
MooER: LLM-based Speech Recognition and Translation Models from Moore ThreadsCode3
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2Code3
BoFire: Bayesian Optimization Framework Intended for Real ExperimentsCode3
Hyper-YOLO: When Visual Object Detection Meets Hypergraph ComputationCode3
ECG-FM: An Open Electrocardiogram Foundation ModelCode3
UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond ScalingCode3
Show:102550
← PrevPage 65 of 13200Next →