SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1830118350 of 474278 papers

TitleStatusHype
SEF-PNet: Speaker Encoder-Free Personalized Speech Enhancement with Local and Global Contexts AggregationCode1
Technical Report for the Forgotten-by-Design Project: Targeted Obfuscation for Machine LearningCode1
A Survey of World Models for Autonomous DrivingCode1
UniTrans: A Unified Vertical Federated Knowledge Transfer Framework for Enhancing Cross-Hospital CollaborationCode1
Communication-Efficient Federated Learning Based on Explanation-Guided Pruning for Remote Sensing Image ClassificationCode1
Curiosity-Driven Reinforcement Learning from Human FeedbackCode1
Automatic Labelling & Semantic Segmentation with 4D Radar TensorsCode1
MyGO Multiplex CoT: A Method for Self-Reflection in Large Language Models via Double Chain of Thought ThinkingCode1
Chat3GPP: An Open-Source Retrieval-Augmented Generation Framework for 3GPP DocumentsCode1
MedicoSAM: Towards foundation models for medical image segmentationCode1
Glinthawk: A Two-Tiered Architecture for Offline LLM InferenceCode1
Finer-CAM: Spotting the Difference Reveals Finer Details for Visual ExplanationCode1
PD-SORT: Occlusion-Robust Multi-Object Tracking Using Pseudo-Depth CuesCode1
Control LLM: Controlled Evolution for Intelligence Retention in LLMCode1
Synthetic Data Generation by Supervised Neural Gas Network for Physiological Emotion Recognition DataCode1
InsQABench: Benchmarking Chinese Insurance Domain Question Answering with Large Language ModelsCode1
ChaosEater: Fully Automating Chaos Engineering with Large Language ModelsCode1
AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language ModelCode1
Tell me about yourself: LLMs are aware of their learned behaviorsCode1
BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-ResolutionCode1
GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. HumanCode1
A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial DifferencesCode1
The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMsCode1
Simultaneous Computation with Multiple Prioritizations in Multi-Agent Motion PlanningCode1
Graph Coloring to Reduce Computation Time in Prioritized PlanningCode1
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight DetectionCode1
Dynamic Trend Fusion Module for Traffic Flow PredictionCode1
Semi-supervised Semantic Segmentation for Remote Sensing Images via Multi-scale Uncertainty Consistency and Cross-Teacher-Student AttentionCode1
MedFILIP: Medical Fine-grained Language-Image Pre-trainingCode1
GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar EditorCode1
Evaluation and Efficiency Comparison of Evolutionary Algorithms for Service Placement Optimization in Fog ArchitecturesCode1
GenSC-6G: A Prototype Testbed for Integrated Generative AI, Quantum, and Semantic CommunicationCode1
Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural NetworksCode1
MSTS: A Multimodal Safety Test Suite for Vision-Language ModelsCode1
When language and vision meet road safety: leveraging multimodal large language models for video-based traffic accident analysisCode1
Agent-as-Judge for Factual Summarization of Long NarrativesCode1
The R-Vessel-X ProjectCode1
Aneumo: A Large-Scale Comprehensive Synthetic Dataset of Aneurysm HemodynamicsCode1
PandaSkill -- Player Performance and Skill Rating in Esports: Application to League of LegendsCode1
landmarker: a Toolkit for Anatomical Landmark Localization in 2D/3D ImagesCode1
AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified RepresentationsCode1
Surrogate-based multiscale analysis of experiments on thermoplastic composites under off-axis loadingCode1
FaceXBench: Evaluating Multimodal LLMs on Face UnderstandingCode1
MechIR: A Mechanistic Interpretability Framework for Information RetrievalCode1
A Unified Comparative Study with Generalized Conformity Scores for Multi-Output Conformal RegressionCode1
OpticFusion: Multi-Modal Neural Implicit 3D Reconstruction of Microstructures by Fusing White Light Interferometry and Optical MicroscopyCode1
HSPFormer: Hierarchical Spatial Perception Transformer for Semantic SegmentationCode1
DSTIGCN: Deformable Spatial-Temporal Interaction Graph Convolution Network for Pedestrian Trajectory PredictionCode1
Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model MergingCode1
Lossy Compression with Pretrained Diffusion ModelsCode1
Show:102550
← PrevPage 367 of 9486Next →