SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1895119000 of 474278 papers

TitleStatusHype
GraspNeRF: Multiview-based 6-DoF Grasp Detection for Transparent and Specular Objects Using Generalizable NeRFCode1
Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal VerificationCode1
Rotation Invariant Transformer for Recognizing Object in UAVsCode1
YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language ModelsCode1
LIMO: Latent Inceptionism for Targeted Molecule GenerationCode1
SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue ResolutionCode1
Audio-Visual Class-Incremental LearningCode1
ACPO: AI-Enabled Compiler FrameworkCode1
ORKG-Leaderboards: A Systematic Workflow for Mining Leaderboards as a Knowledge GraphCode1
Improving fairness for spoken language understanding in atypical speech with Text-to-SpeechCode1
Latent Variable Sequential Set Transformers For Joint Multi-Agent Motion PredictionCode1
AssistQ: Affordance-centric Question-driven Task Completion for Egocentric AssistantCode1
FragNet: A Graph Neural Network for Molecular Property Prediction with Four Levels of InterpretabilityCode1
Improving Generalization in Federated Learning by Seeking Flat MinimaCode1
LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge AugmentationCode1
BERT-ATTACK: Adversarial Attack Against BERT Using BERTCode1
Social Bot-Aware Graph Neural Network for Early Rumor DetectionCode1
DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in ConversationsCode1
Magnification Prior: A Self-Supervised Method for Learning Representations on Breast Cancer Histopathological ImagesCode1
A New Dataset and A Baseline Model for Breast Lesion Detection in Ultrasound VideosCode1
PathAsst: A Generative Foundation AI Assistant Towards Artificial General Intelligence of PathologyCode1
AWAC: Accelerating Online Reinforcement Learning with Offline DatasetsCode1
MarkushGrapher: Joint Visual and Textual Recognition of Markush StructuresCode1
Shapley Values-enabled Progressive Pseudo Bag Augmentation for Whole Slide Image ClassificationCode1
AI4COVID-19: AI Enabled Preliminary Diagnosis for COVID-19 from Cough Samples via an AppCode1
Recovering complex ecological dynamics from time series using state-space universal dynamic equationsCode1
Self-Labeling the Job Shop Scheduling ProblemCode1
Efficient Training of Audio Transformers with PatchoutCode1
On Positional and Structural Node Features for Graph Neural Networks on Non-attributed GraphsCode1
Variational Deep Embedding: An Unsupervised and Generative Approach to ClusteringCode1
Low-Rank Similarity Mining for Multimodal Dataset DistillationCode1
Token Cropr: Faster ViTs for Quite a Few TasksCode1
SC2EGSet: StarCraft II Esport Replay and Game-state DatasetCode1
LSDNet: Trainable Modification of LSD Algorithm for Real-Time Line Segment DetectionCode1
ReCU: Reviving the Dead Weights in Binary Neural NetworksCode1
Learning multi-scale local conditional probability models of imagesCode1
A Challenging Benchmark of Anime Style RecognitionCode1
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward ModelsCode1
Causal thinking for decision making on Electronic Health Records: why and howCode1
Few-Shot Class-Incremental Learning from an Open-Set PerspectiveCode1
MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence FrontiersCode1
Instrumental Variables in Causal Inference and Machine Learning: A SurveyCode1
Reachability Constrained Reinforcement LearningCode1
StreamMOS: Streaming Moving Object Segmentation with Multi-View Perception and Dual-Span MemoryCode1
Interior Attention-Aware Network for Infrared Small Target DetectionCode1
Q-Probe: A Lightweight Approach to Reward Maximization for Language ModelsCode1
Model-Based Transfer Learning for Contextual Reinforcement LearningCode1
High Dynamic Range Image Reconstruction via Deep Explicit Polynomial Curve EstimationCode1
Sample- and Parameter-Efficient Auto-Regressive Image ModelsCode1
Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image SynthesisCode1
Show:102550
← PrevPage 380 of 9486Next →