SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1145111500 of 177340 papers

TitleStatusHype
BatteryML:An Open-source platform for Machine Learning on Battery DegradationCode2
LightLoc: Learning Outdoor LiDAR Localization at Light SpeedCode2
N-Dimensional Gaussians for Fitting of High Dimensional FunctionsCode2
Diffusion Models in Recommendation Systems: A SurveyCode2
Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge TransferCode2
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual FeedbackCode2
CNN-based Density Estimation and Crowd Counting: A SurveyCode2
Latent-NeRF for Shape-Guided Generation of 3D Shapes and TexturesCode2
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain GeneralizationCode2
Efficient Emotional Adaptation for Audio-Driven Talking-Head GenerationCode2
Fine-grained Image Captioning with CLIP RewardCode2
Fusing Visual Appearance and Geometry for Multi-modality 6DoF Object TrackingCode2
emg2qwerty: A Large Dataset with Baselines for Touch Typing using Surface ElectromyographyCode2
You Only Demonstrate Once: Category-Level Manipulation from Single Visual DemonstrationCode2
reStructured Pre-trainingCode2
Mercury: A Code Efficiency Benchmark for Code Large Language ModelsCode2
Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text RetrievalCode2
PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian SplattingCode2
Zero-Painter: Training-Free Layout Control for Text-to-Image SynthesisCode2
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement LearningCode2
Aligning language models with human preferencesCode2
Towards Universal Sequence Representation Learning for Recommender SystemsCode2
Agent Planning with World Knowledge ModelCode2
HAKE: A Knowledge Engine Foundation for Human Activity UnderstandingCode2
SfM-Free 3D Gaussian Splatting via Hierarchical TrainingCode2
xVerify: Efficient Answer Verifier for Reasoning Model EvaluationsCode2
CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation ModelsCode2
Fast Feedforward NetworksCode2
Practical Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt CalibrationCode2
GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View DiffusionCode2
Ensembling Prioritized Hybrid Policies for Multi-agent PathfindingCode2
Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal ExamplesCode2
FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Action SegmentationCode2
The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and ChallengesCode2
Generative replay with feedback connections as a general strategy for continual learningCode2
DiffCSE: Difference-based Contrastive Learning for Sentence EmbeddingsCode2
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion ModelsCode2
UniMoMo: Unified Generative Modeling of 3D Molecules for De Novo Binder DesignCode2
Cluster and Predict Latents Patches for Improved Masked Image ModelingCode2
NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and ResultsCode2
Parting with Misconceptions about Learning-based Vehicle Motion PlanningCode2
CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy PredictionCode2
An AI-Ready Multiplex Staining Dataset for Reproducible and Accurate Characterization of Tumor Immune MicroenvironmentCode2
A New Frontier of AI: On-Device AI Training and PersonalizationCode2
MetaOpenFOAM 2.0: Large Language Model Driven Chain of Thought for Automating CFD Simulation and Post-ProcessingCode2
Towards Understanding and Boosting Adversarial Transferability from a Distribution PerspectiveCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
Self-playing Adversarial Language Game Enhances LLM ReasoningCode2
A Pytorch Reproduction of Masked Generative Image TransformerCode2
Unlocking the Capabilities of Thought: A Reasoning Boundary Framework to Quantify and Optimize Chain-of-ThoughtCode2
Show:102550
← PrevPage 230 of 3547Next →