SOTAVerified

valid

Papers

Showing 12011250 of 3589 papers

TitleStatusHype
CREST: A Joint Framework for Rationalization and Counterfactual Text GenerationCode0
STL: Surprisingly Tricky Logic (for System Validation)0
DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion ModelsCode1
Learning and Leveraging Verifiers to Improve Planning Capabilities of Pre-trained Language Models0
Evaluation of Question Generation Needs More References0
Superpixelwise Low-Rank Approximation-Based Partial Label Learning for Hyperspectral Image ClassificationCode0
On the Robustness of Segment Anything0
End-to-End Meta-Bayesian Optimisation with Transformer Neural ProcessesCode0
Exponential Smoothing for Off-Policy Learning0
Flocks of Stochastic Parrots: Differentially Private Prompt Learning for Large Language Models0
Increasing Probability Mass on Answer Choices Does Not Always Improve AccuracyCode0
Short and Straight: Geodesics on Differentiable Manifolds0
Evaluating Evaluation Metrics: A Framework for Analyzing NLG Evaluation Metrics using Measurement TheoryCode1
On Degrees of Freedom in Defining and Testing Natural Language Understanding0
Annotation Imputation to Individualize Predictions: Initial Studies on Distribution Dynamics and Model PredictionsCode0
Uncertainty Quantification over Graph with Conformalized Graph Neural NetworksCode1
Deduction under Perturbed Evidence: Probing Student Simulation Capabilities of Large Language Models0
Enhanced Fine-grained Motion Diffusion for Text-driven Human Motion Synthesis0
Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs0
Grammar-Constrained Decoding for Structured NLP Tasks without FinetuningCode2
Tight conditions for when the NTK approximation is valid0
Many or Few Samples? Comparing Transfer, Contrastive and Meta-Learning in Encrypted Traffic Classification0
A parametric distribution for exact post-selection inference with data carvingCode0
Logic-Based Benders Decomposition in Answer Set Programming for Chronic Outpatients Scheduling0
Robust Counterfactual Explanations for Neural Networks With Probabilistic GuaranteesCode0
Generalized Multiple Intent Conditioned Slot Filling0
Wavefield Networked Sensing: Principles, Algorithms and Applications0
Generation of 3D Molecules in Pockets via Language Model0
Complementary Classifier Induced Partial Label LearningCode0
Finding an ε-close Variation of Parameters in Bayesian Networks0
Score Operator Newton transport0
Learning Linear Embeddings for Non-Linear Network Dynamics with Koopman Message Passing0
How Expressive are Spectral-Temporal Graph Neural Networks for Time Series Forecasting?0
SMATCH++: Standardized and Extended Evaluation of Semantic GraphsCode1
Say What You Mean! Large Language Models Speak Too Positively about Negative Commonsense KnowledgeCode0
Testing for OverfittingCode0
Language models can generate molecules, materials, and protein binding sites directly in three dimensions as XYZ, CIF, and PDB files0
Comparing Foundation Models using Data Kernels0
NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge0
Algebra Error Classification with Large Language ModelsCode0
Non-Autoregressive Math Word Problem Solver with Unified Tree StructureCode1
A nation-wide experiment, part II: the introduction of a 49-Euro-per-month travel pass in Germany -- An empirical study on this fare innovation0
Faithful Question Answering with Monte-Carlo PlanningCode0
ReMask: A Robust Information-Masking Approach for Domain Counterfactual GenerationCode0
CausalAPM: Generalizable Literal Disentanglement for NLU Debiasing0
Doubly Robust Uniform Confidence Bands for Group-Time Conditional Average Treatment Effects in Difference-in-Differences0
Geometric Latent Diffusion Models for 3D Molecule GenerationCode2
Large Linguistic Models: Investigating LLMs' metalinguistic abilities0
How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language modelCode1
POET: A Self-learning Framework for PROFINET Industrial Operations Behaviour0
Show:102550
← PrevPage 25 of 72Next →

No leaderboard results yet.