SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2110121150 of 474278 papers

TitleStatusHype
A is for Absorption: Studying Feature Splitting and Absorption in Sparse AutoencodersCode1
PISR: Polarimetric Neural Implicit Surface Reconstruction for Textureless and Specular ObjectsCode1
TabGraphs: A Benchmark and Strong Baselines for Learning on Graphs with Tabular Node FeaturesCode1
Lidar Panoptic Segmentation in an Open WorldCode1
Towards Model-Agnostic Dataset Condensation by Heterogeneous ModelsCode1
MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation EvaluatorsCode1
What Are They Doing? Joint Audio-Speech Co-ReasoningCode1
UU-Mamba: Uncertainty-aware U-Mamba for Cardiovascular SegmentationCode1
BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical FlowCode1
Instruction Following without Instruction TuningCode1
MSDet: Receptive Field Enhanced Multiscale Detection for Tiny Pulmonary NoduleCode1
StateAct: State Tracking and Reasoning for Acting and Planning with Large Language ModelsCode1
GAInS: Gradient Anomaly-aware Biomedical Instance SegmentationCode1
Content-aware Tile Generation using Exterior Boundary InpaintingCode1
Accelerated Multi-Contrast MRI Reconstruction via Frequency and Spatial Mutual LearningCode1
BRep Boundary and Junction Detection for CAD Reverse EngineeringCode1
ChronoGAN: Supervised and Embedded Generative Adversarial Networks for Time Series GenerationCode1
PromptTA: Prompt-driven Text Adapter for Source-free Domain GeneralizationCode1
ChemEval: A Comprehensive Multi-Level Chemical Evaluation for Large Language ModelsCode1
SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved InformationCode1
FracGM: A Fast Fractional Programming Technique for Geman-McClure Robust EstimatorCode1
LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBenchCode1
FAIR GPT: A virtual consultant for research data management in ChatGPTCode1
Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image ModelingCode1
Advancing Event Causality Identification via Heuristic Semantic Dependency Inquiry NetworkCode1
"I Never Said That": A dataset, taxonomy and baselines on response clarity classificationCode1
Alternate Preference Optimization for Unlearning Factual Knowledge in Large Language ModelsCode1
YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language ModelsCode1
Temporally Aligned Audio for Video with AutoregressionCode1
AVG-LLaVA: A Large Multimodal Model with Adaptive Visual GranularityCode1
Demystifying and Extracting Fault-indicating Information from Logs for Failure DiagnosisCode1
Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning TasksCode1
OneBEV: Using One Panoramic Image for Bird's-Eye-View Semantic MappingCode1
Federated Learning with Label-Masking DistillationCode1
MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression ComprehensionCode1
Efficient and Discriminative Image Feature Extraction for Universal Image RetrievalCode1
A preliminary study on continual learning in computer vision using Kolmogorov-Arnold NetworksCode1
Contextual Compression in Retrieval-Augmented Generation for Large Language Models: A SurveyCode1
PlainUSR: Chasing Faster ConvNet for Efficient Super-ResolutionCode1
Intrinsic Single-Image HDR ReconstructionCode1
Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal ModelCode1
A Personalised 3D+t Mesh Generative Model for Unveiling Normal Heart DynamicsCode1
Multiscale Encoder and Omni-Dimensional Dynamic Convolution Enrichment in nnU-Net for Brain Tumor SegmentationCode1
Cross-Domain Knowledge Transfer for Underwater Acoustic Classification Using Pre-trained ModelsCode1
OATS: Outlier-Aware Pruning Through Sparse and Low Rank DecompositionCode1
Prithvi WxC: Foundation Model for Weather and ClimateCode1
ShizishanGPT: An Agricultural Large Language Model Integrating Tools and ResourcesCode1
Augmenting the Interpretability of GraphCodeBERT for Code Similarity TasksCode1
Exploring Text-Queried Sound Event Detection with Audio Source SeparationCode1
Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image SegmentationCode1
Show:102550
← PrevPage 423 of 9486Next →