SOTAVerified

Model Selection

Given a set of candidate models, the goal of Model Selection is to select the model that best approximates the observed data and captures its underlying regularities. Model Selection criteria are defined such that they strike a balance between the goodness of fit, and the generalizability or complexity of the models.

Source: Kernel-based Information Criterion

Papers

Showing 18511900 of 2050 papers

TitleStatusHype
Improved Group Robustness via Classifier Retraining on Independent SplitsCode0
Nearest Neighbour Equilibrium ClusteringCode0
The Curse of Performance Instability in Analysis Datasets: Consequences, Source, and SuggestionsCode0
BiasBed -- Rigorous Texture Bias EvaluationCode0
Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized RecommendationsCode0
ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy ModelsCode0
Pseudo-Labeling for Kernel Ridge Regression under Covariate ShiftCode0
Improving Session Recommendation with Recurrent Neural Networks by Exploiting Dwell TimeCode0
Improving Subseasonal Forecasting in the Western U.S. with Machine LearningCode0
Agreement-on-the-Line: Predicting the Performance of Neural Networks under Distribution ShiftCode0
In all LikelihoodS: How to Reliably Select Pseudo-Labeled Data for Self-Training in Semi-Supervised LearningCode0
IncomeSCM: From tabular data set to time-series simulator and causal estimation benchmarkCode0
A survey of probabilistic generative frameworks for molecular simulationsCode0
Increasing certainty in systems biology models using Bayesian multimodel inferenceCode0
Referenced Thermodynamic Integration for Bayesian Model Selection: Application to COVID-19 Model SelectionCode0
Towards Measuring Representational Similarity of Large Language ModelsCode0
Indian Buffet process for model selection in convolved multiple-output Gaussian processesCode0
Individualized Prediction of COVID-19 Adverse outcomes with MLHOCode0
Neural Architecture Search with Bayesian Optimisation and Optimal TransportCode0
Neural Bayes inference for complex bivariate extremal dependence modelsCode0
Deep Learning and Linear Programming for Automated Ensemble Forecasting and InterpretationCode0
INFaaS: A Model-less and Managed Inference Serving SystemCode0
Neural Vector Spaces for Unsupervised Information RetrievalCode0
Reliable Time Prediction in the Markov Stochastic Block ModelCode0
Comparative Evaluation of Learning Models for Bionic Robots: Non-Linear Transfer Function IdentificationsCode0
Inferring Convolutional Neural Networks' accuracies from their architectural characterizationsCode0
Framework for Inferring Following Strategies from Time Series of Movement DataCode0
Comparative and Interpretative Analysis of CNN and Transformer Models in Predicting Wildfire Spread Using Remote Sensing DataCode0
Combining UPerNet and ConvNeXt for Contrails Identification to reduce Global WarmingCode0
Infinite Action Contextual Bandits with Reusable Data ExhaustCode0
Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient EvaluationCode0
Diagnostic Tool for Out-of-Sample Model EvaluationCode0
Automated Dependence PlotsCode0
Simultaneous Dimensionality and Complexity Model Selection for Spectral Graph ClusteringCode0
A general technique for the estimation of farm animal body part weights from CT scans and its applications in a rabbit breeding programCode0
Quality Estimation for Image Captions Based on Large-scale Human EvaluationsCode0
Towards Model Selection using Learning Curve Cross-ValidationCode0
In Search of Insights, Not Magic Bullets: Towards Demystification of the Model Selection Dilemma in Heterogeneous Treatment Effect EstimationCode0
DGM-DR: Domain Generalization with Mutual Information Regularized Diabetic Retinopathy ClassificationCode0
Beyond Benchmarks: Evaluating Embedding Model Similarity for Retrieval Augmented Generation SystemsCode0
Quantifying Robustness: A Benchmarking Framework for Deep Learning Forecasting in Cyber-Physical SystemsCode0
ANA at SemEval-2020 Task 4: mUlti-task learNIng for cOmmonsense reasoNing (UNION)Code0
When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model LeaderboardsCode0
Interpretability of Multivariate Brain Maps in Brain Decoding: Definition and QuantificationCode0
Understanding new tasks through the lens of training data via exponential tiltingCode0
ARDA: Automatic Relational Data Augmentation for Machine LearningCode0
Deep Learning in a Generalized HJM-type Framework Through Arbitrage-Free RegularizationCode0
DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM PerformanceCode0
AALF: Almost Always Linear ForecastingCode0
Degrees of Freedom and Model Selection for k-means ClusteringCode0
Show:102550
← PrevPage 38 of 41Next →

No leaderboard results yet.