SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1920119250 of 474278 papers

TitleStatusHype
Jointly RS Image Deblurring and Super-Resolution with Adjustable-Kernel and Multi-Domain AttentionCode1
RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of ExpertsCode1
Training-Free Bayesianization for Low-Rank Adapters of Large Language ModelsCode1
M^3PC: Test-time Model Predictive Control for Pretrained Masked Trajectory ModelCode1
Remix-DiT: Mixing Diffusion Transformers for Multi-Expert DenoisingCode1
Finite Element Neural Network Interpolation. Part I: Interpretable and Adaptive Discretization for Solving PDEsCode1
CoE: Deep Coupled Embedding for Non-Rigid Point Cloud CorrespondencesCode1
Towards Learning to Reason: Comparing LLMs with Neuro-Symbolic on Arithmetic Relations in Abstract ReasoningCode1
TransitGPT: A Generative AI-based framework for interacting with GTFS data using Large Language ModelsCode1
PrivAgent: Agentic-based Red-teaming for LLM Privacy LeakageCode1
CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual WorldsCode1
Fragmented Layer Grouping in GUI Designs Through Graph Learning Based on Multimodal InformationCode1
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of ExpertsCode1
Slicing Vision Transformer for Flexible InferenceCode1
PyTerrier-GenRank: The PyTerrier Plugin for Reranking with Large Language ModelsCode1
Learning to Translate Noise for Robust Image DenoisingCode1
DEMO: Reframing Dialogue Interaction with Fine-grained Element ModelingCode1
DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNACode1
Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object DetectionCode1
Two stages domain invariant representation learners solve the large co-variate shift in unsupervised domain adaptation with two dimensional data domainsCode1
COOOL: Challenge Of Out-Of-Label A Novel Benchmark for Autonomous DrivingCode1
Extrapolated Urban View Synthesis BenchmarkCode1
Machine Learning-Based mmWave MIMO Beam Tracking in V2I Scenarios: Algorithms and DatasetsCode1
Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual TokensCode1
Explingo: Explaining AI Predictions using Large Language ModelsCode1
Sparse autoencoders reveal selective remapping of visual concepts during adaptationCode1
Transformers Can Navigate Mazes With Multi-Step PredictionCode1
NLP-ADBench: NLP Anomaly Detection BenchmarkCode1
Towards Effective GenAI Multi-Agent Collaboration: Design and Evaluation for Enterprise ApplicationsCode1
Neural Representation for Wireless Radiation Field Reconstruction: A 3D Gaussian Splatting ApproachCode1
DrIFT: Autonomous Drone Dataset with Integrated Real and Synthetic Data, Flexible Views, and Transformed DomainsCode1
Customized Generation Reimagined: Fidelity and Editability HarmonizedCode1
TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in MinecraftCode1
LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image GenerationCode1
SleeperMark: Towards Robust Watermark against Fine-Tuning Text-to-image Diffusion ModelsCode1
MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at ScaleCode1
KNN-MMD: Cross Domain Wireless Sensing via Local Distribution AlignmentCode1
One-shot Federated Learning via Synthetic Distiller-Distillate CommunicationCode1
SoPo: Text-to-Motion Generation Using Semi-Online Preference OptimizationCode1
Smoothie: Label Free Language Model RoutingCode1
SurgBox: Agent-Driven Operating Room Sandbox with Surgery CopilotCode1
Training MLPs on Graphs without SupervisionCode1
PDG2Seq: Periodic Dynamic Graph to Sequence Model for Traffic Flow PredictionCode1
Does your model understand genes? A benchmark of gene properties for biological and text modelsCode1
ProtBoost: protein function prediction with Py-Boost and Graph Neural Networks -- CAFA5 top2 solutionCode1
GRAM: Generalization in Deep RL with a Robust Adaptation ModuleCode1
Grounding Descriptions in Images informs Zero-Shot Visual RecognitionCode1
p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio DecayCode1
3D Part Segmentation via Geometric Aggregation of 2D Visual FeaturesCode1
HyperMARL: Adaptive Hypernetworks for Multi-Agent RLCode1
Show:102550
← PrevPage 385 of 9486Next →