SOTAVerified

Model Selection

Given a set of candidate models, the goal of Model Selection is to select the model that best approximates the observed data and captures its underlying regularities. Model Selection criteria are defined such that they strike a balance between the goodness of fit, and the generalizability or complexity of the models.

Source: Kernel-based Information Criterion

Papers

Showing 150 of 2050 papers

TitleStatusHype
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging FaceCode6
M-Prometheus: A Suite of Open Multilingual LLM JudgesCode5
aeon: a Python toolkit for learning from time seriesCode5
MOSPAT: AutoML based Model Selection and Parameter Tuning for Time Series Anomaly DetectionCode5
TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning BenchmarksCode4
ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis TestingCode4
INTERS: Unlocking the Power of Large Language Models in Search with Instruction TuningCode3
Uni-QSAR: an Auto-ML Tool for Molecular Property PredictionCode3
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement LearningCode2
AD-AGENT: A Multi-agent Framework for End-to-end Anomaly DetectionCode2
FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series ForecastingCode2
Optimizing Model Selection for Compound AI SystemsCode2
Foundational Large Language Models for Materials ResearchCode2
LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language InterpretationCode2
BSD: a Bayesian framework for parametric models of neural spectraCode2
Peeling Back the Layers: An In-Depth Evaluation of Encoder Architectures in Neural News RecommendersCode2
Source-Free Domain Adaptation for YOLO Object DetectionCode2
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU TasksCode2
The future of cosmological likelihood-based inference: accelerated high-dimensional parameter estimation and model comparisonCode2
The CAST package for training and assessment of spatial prediction models in RCode2
Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal InputsCode2
LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied AgentsCode2
Efficient and Effective Time-Series Forecasting with Spiking Neural NetworksCode2
Specializing Smaller Language Models towards Multi-Step ReasoningCode2
Out-of-sample scoring and automatic selection of causal estimatorsCode2
scikit-fda: A Python Package for Functional Data AnalysisCode2
IoT Data Analytics in Dynamic Environments: From An Automated Machine Learning PerspectiveCode2
DeepDPM: Deep Clustering With an Unknown Number of ClustersCode2
Tuning the Right Foundation Models is What you Need for Partial Label LearningCode1
DeSocial: Blockchain-based Decentralized Social NetworksCode1
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster ManagementCode1
LENSLLM: Unveiling Fine-Tuning Dynamics for LLM SelectionCode1
Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and ClassificationCode1
Time Series Embedding Methods for Classification Tasks: A ReviewCode1
Stochastic gradient descent estimation of generalized matrix factorization models with application to single-cell RNA sequencing dataCode1
Towards Unsupervised Model Selection for Domain Adaptive Object DetectionCode1
AD-LLM: Benchmarking Large Language Models for Anomaly DetectionCode1
NLP-ADBench: NLP Anomaly Detection BenchmarkCode1
Evaluating Language Models as Synthetic Data GeneratorsCode1
AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein EngineeringCode1
Noether's razor: Learning Conserved QuantitiesCode1
SePPO: Semi-Policy Preference Optimization for Diffusion AlignmentCode1
Triple equivalence for the emergence of biological intelligenceCode1
Towards Autonomous Cybersecurity: An Intelligent AutoML Framework for Autonomous Intrusion DetectionCode1
Automated Machine Learning in InsuranceCode1
Hologram Reasoning for Solving Algebra Problems with Geometry DiagramsCode1
ClinicRealm: Re-evaluating Large Language Models with Conventional Machine Learning for Non-Generative Clinical Prediction TasksCode1
Binary Bleed: Fast Distributed and Parallel Method for Automatic Model SelectionCode1
Superior Scoring Rules for Probabilistic Evaluation of Single-Label Multi-Class Classification TasksCode1
SKADA-Bench: Benchmarking Unsupervised Domain Adaptation Methods with Realistic Validation On Diverse ModalitiesCode1
Show:102550
← PrevPage 1 of 41Next →

No leaderboard results yet.