SOTAVerified

valid

Papers

Showing 451500 of 3589 papers

TitleStatusHype
Statistically Valid Information Bottleneck via Multiple Hypothesis Testing0
Improving Conditional Level Generation using Automated Validation in Match-3 Games0
NSP: A Neuro-Symbolic Natural Language Navigational Planner0
Alignist: CAD-Informed Orientation Distribution Estimation by Fusing Shape and CorrespondencesCode0
Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement0
The Surprising Robustness of Partial Least Squares0
Inference for Large Scale Regression Models with Dependent Errors0
Selective Self-Rehearsal: A Fine-Tuning Approach to Improve Generalization in Large Language Models0
Leveraging Machine Learning for Official Statistics: A Statistical Manifesto0
FuzzCoder: Byte-level Fuzzing Test via Large Language ModelCode1
Federated Prediction-Powered Inference from Decentralized DataCode0
An essay on the history of DSGE models0
Stochastic Monotonicity and Random Utility Models: The Good and The Ugly0
"Is This It?": Towards Ecologically Valid Benchmarks for Situated Collaboration0
The creative psychometric item generator: a framework for item generation and validation using large language models0
Continual learning with the neural tangent ensemble0
Self-supervised learning for crystal property prediction via denoising0
Can Unconfident LLM Annotations Be Used for Confident Conclusions?Code1
Double/Debiased CoCoLASSO of Treatment Effects with Mismeasured High-Dimensional Control Variables0
EVINCE: Optimizing Multi-LLM Dialogues Using Conditional Statistics and Information Theory0
Investigating the effect of Mental Models in User Interaction with an Adaptive Dialog Agent0
RoCP-GNN: Robust Conformal Prediction for Graph Neural Networks in Node-Classification0
Can You Trust Your Metric? Automatic Concatenation-Based Tests for Metric Validity0
Learning Valid Dual Bounds in Constraint Programming: Boosted Lagrangian Decomposition with Self-Supervised Learning0
AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and ResultsCode1
Learning Deep Dissipative DynamicsCode0
A Markovian Model for Learning-to-Optimize0
Optical ISAC: Fundamental Performance Limits and Transceiver Design0
Safety-Critical Stabilization of Force-Controlled Nonholonomic Mobile Robots0
Inference with Many Weak Instruments and Heterogeneity0
Conformalized Interval Arithmetic with Symmetric CalibrationCode0
On Learning Action Costs from Input Plans0
BLADE: Benchmarking Language Model Agents for Data-Driven ScienceCode1
Physics-Aware Combinatorial Assembly Sequence Planning using Data-free Action MaskingCode0
Uncertainty Quantification of Surrogate Models using Conformal PredictionCode1
Importance Weighting Can Help Large Language Models Self-ImproveCode0
Data-driven Conditional Instrumental Variables for Debiasing Recommender Systems0
Generating Automatically Print/Scan Textures for Morphing Attack Detection ApplicationsCode0
Anytime-Valid Inference for Double/Debiased Machine Learning of Causal Parameters0
GraphSPNs: Sum-Product Networks Benefit From Canonical OrderingsCode0
Externally Valid Selection of Experimental Sites via the k-Median Problem0
A Confidence Interval for the _2 Expected Calibration ErrorCode0
A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models0
An Unsupervised Learning Framework Combined with Heuristics for the Maximum Minimal Cut ProblemCode0
Evaluating the Validity of Word-level Adversarial Attacks with Large Language ModelsCode0
QirK: Question Answering via Intermediate Representation on Knowledge Graphs0
Defining and Measuring Disentanglement for non-Independent Factors of Variation0
Design Proteins Using Large Language Models: Enhancements and Comparative AnalysesCode0
Approximating Discrimination Within Models When Faced With Several Non-Binary Sensitive AttributesCode0
People over trust AI-generated medical responses and view them to be as valid as doctors, despite low accuracy0
Show:102550
← PrevPage 10 of 72Next →

No leaderboard results yet.