SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1900119050 of 474278 papers

TitleStatusHype
LAiW: A Chinese Legal Large Language Models BenchmarkCode1
Verilog-to-PyG -- A Framework for Graph Learning and Augmentation on RTL DesignsCode1
Pareto Set Learning for Expensive Multi-Objective OptimizationCode1
Class-Balancing Diffusion ModelsCode1
RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k RecommendationCode1
Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-SpeechCode1
A Comprehensive Study on Knowledge Graph Embedding over Relational Patterns Based on Rule LearningCode1
LLM Platform Security: Applying a Systematic Evaluation Framework to OpenAI's ChatGPT PluginsCode1
Exploring Attention-Aware Network Resource Allocation for Customized Metaverse ServicesCode1
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain FeedbackCode1
Evaluating the Robustness of Off-Policy EvaluationCode1
Recurrent Bilinear Optimization for Binary Neural NetworksCode1
Document-Level Relation Extraction with Adaptive Focal Loss and Knowledge DistillationCode1
Real-Time Neural Character Rendering with Pose-Guided Multiplane ImagesCode1
Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention FormulationCode1
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured RepresentationsCode1
PRIMUS: Pretraining IMU Encoders with Multimodal Self-SupervisionCode1
HSIMamba: Hyperpsectral Imaging Efficient Feature Learning with Bidirectional State Space for ClassificationCode1
VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame RateCode1
ACTION: Augmentation and Computation Toolbox for Brain Network Analysis with Functional MRICode1
LaDCast: A Latent Diffusion Model for Medium-Range Ensemble Weather ForecastingCode1
Measuring and Mitigating Bias for Tabular Datasets with Multiple Protected AttributesCode1
LatentEditor: Text Driven Local Editing of 3D ScenesCode1
Sampling-Based Accuracy Testing of Posterior Estimators for General InferenceCode1
Retro-fallback: retrosynthetic planning in an uncertain worldCode1
SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking FacesCode1
BehAVE: Behaviour Alignment of Video Game EncodingsCode1
Cooperation and Fairness in Multi-Agent Reinforcement LearningCode1
Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian SplattingCode1
Image Inpainting via Iteratively Decoupled Probabilistic ModelingCode1
OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object InteractionCode1
Explainable Legal Case Matching via Inverse Optimal Transport-based Rationale ExtractionCode1
Capturing Smile Dynamics with the Quintic Volatility Model: SPX, Skew-Stickiness Ratio and VIXCode1
Language Models as Hierarchy EncodersCode1
SAM-OCTA2: Layer Sequence OCTA Segmentation with Fine-tuned Segment Anything Model 2Code1
Automating MedSAM by Learning Prompts with Weak Few-Shot SupervisionCode1
A Dual-Space Framework for General Knowledge Distillation of Large Language ModelsCode1
SLiM: One-shot Quantization and Sparsity with Low-rank Approximation for LLM Weight CompressionCode1
Spatio-channel Attention Blocks for Cross-modal Crowd CountingCode1
Multimodal 3D Fusion and In-Situ Learning for Spatially Aware AICode1
THUIR@COLIEE 2023: More Parameters and Legal Knowledge for Legal Case EntailmentCode1
Im2Oil: Stroke-Based Oil Painting Rendering with Linearly Controllable Fineness Via Adaptive SamplingCode1
Efficient and Accurate Physics-aware Multiplex Graph Neural Networks for 3D Small Molecules and Macromolecule ComplexesCode1
BEATS: An Open-Source, High-Precision, Multi-Channel EEG Acquisition Tool SystemCode1
Self-Destructing Models: Increasing the Costs of Harmful Dual Uses of Foundation ModelsCode1
Extremal Domain Translation with Neural Optimal TransportCode1
Segmenting Moving Objects via an Object-Centric Layered RepresentationCode1
FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning SimulationsCode1
Epidemiological Agent-Based Modelling Software (Epiabm)Code1
Is GPT-4 a Good Data Analyst?Code1
Show:102550
← PrevPage 381 of 9486Next →