SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 89018950 of 661570 papers

TitleStatusHype
On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?Code2
FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State SpaceCode2
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language modelsCode2
SCIMAP: A Python Toolkit for Integrated Spatial Analysis of Multiplexed Imaging DataCode2
Part-aware Shape Generation with Latent 3D Diffusion of Neural Voxel FieldsCode2
FeNNol: an Efficient and Flexible Library for Building Force-field-enhanced Neural Network PotentialsCode2
Multi-Space Alignments Towards Universal LiDAR SegmentationCode2
Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator DesignCode2
EchoScene: Indoor Scene Generation via Information Echo over Scene Graph DiffusionCode2
SATO: Stable Text-to-Motion FrameworkCode2
A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and LawCode2
Benchmarking Representations for Speech, Music, and Acoustic EventsCode2
LocInv: Localization-aware Inversion for Text-Guided Image EditingCode2
SynFlowNet: Design of Diverse and Novel Molecules with Synthesis ConstraintsCode2
SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image DenoisingCode2
MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D PriorsCode2
Model Quantization and Hardware Acceleration for Vision Transformers: A Comprehensive SurveyCode2
HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and BeyondCode2
Toward Unified Practices in Trajectory Prediction Research on Bird's-Eye-View DatasetsCode2
Adaptive Bidirectional Displacement for Semi-Supervised Medical Image SegmentationCode2
Spectrally Pruned Gaussian Fields with Neural CompensationCode2
ASAM: Boosting Segment Anything Model with Adversarial TuningCode2
WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace SettingCode2
GraCo: Granularity-Controllable Interactive SegmentationCode2
Causal Evaluation of Language ModelsCode2
TFPred: Learning Discriminative Representations from Unlabeled Data for Few-Label Rotating Machinery Fault DiagnosisCode2
Training-free Graph Neural Networks and the Power of Labels as FeaturesCode2
LVOS: A Benchmark for Large-scale Long-term Video Object SegmentationCode2
Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband RangingCode2
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain GeneralizationCode2
Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule GenerationCode2
MicroDreamer: Efficient 3D Generation in 20 Seconds by Score-based Iterative ReconstructionCode2
Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video AnomalyCode2
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian EvaluationCode2
HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level SynthesisCode2
Towards Extreme Image Compression with Latent Feature Guidance and Diffusion PriorCode2
4D-DRESS: A 4D Dataset of Real-world Human Clothing with Semantic AnnotationsCode2
Benchmarking Benchmark Leakage in Large Language ModelsCode2
TheaterGen: Character Management with LLM for Consistent Multi-turn Image GenerationCode2
Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model ErasCode2
3D Gaussian Splatting with Deferred ReflectionCode2
Kangaroo: Lossless Self-Speculative Decoding via Double Early ExitingCode2
Joint Signal Detection and Automatic Modulation Classification via Deep LearningCode2
Efficient Inverted Indexes for Approximate Retrieval over Learned Sparse RepresentationsCode2
How secure is AI-generated Code: A Large-Scale Comparison of Large Language ModelsCode2
RSCaMa: Remote Sensing Image Change Captioning with State Space ModelCode2
PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document RetrievalCode2
SIDBench: A Python Framework for Reliably Assessing Synthetic Image Detection MethodsCode2
OpenStreetView-5M: The Many Roads to Global Visual GeolocationCode2
OAEI Machine Learning Dataset for Online Model GenerationCode2
Show:102550
← PrevPage 179 of 13232Next →