| DPPA: Pruning Method for Large Language Model to Model Merging | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On Exploiting Hitting Sets for Model Reconciliation | Dec 16, 2020 | model | CodeCode Available | 0 |
| CEB Improves Model Robustness | Feb 13, 2020 | Adversarial RobustnessData Augmentation | CodeCode Available | 0 |
| Causal Inference for Human-Language Model Collaboration | Mar 30, 2024 | Causal Inferencecounterfactual | CodeCode Available | 0 |
| Hyperparameter Power Impact in Transformer Language Model Training | Nov 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Hyperbolic normal stochastic volatility model | Sep 11, 2018 | model | CodeCode Available | 0 |
| Hyperbolic Interaction Model For Hierarchical Multi-Label Classification | May 26, 2019 | ClassificationGeneral Classification | CodeCode Available | 0 |
| A Model of the Fed's View on Inflation | Jun 25, 2020 | modelTime Series | CodeCode Available | 0 |
| Hydra: Preserving Ensemble Diversity for Model Distillation | Jan 14, 2020 | Diversitymodel | CodeCode Available | 0 |
| Does Dataset Complexity Matters for Model Explainers? | Jul 6, 2021 | AttributeExplainable artificial intelligence | CodeCode Available | 0 |
| Towards Backdoor Stealthiness in Model Parameter Space | Jan 10, 2025 | backdoor defensemodel | CodeCode Available | 0 |
| hULMonA: The Universal Language Model in Arabic | Aug 1, 2019 | Arabic Sentiment AnalysisGeneral Classification | CodeCode Available | 0 |
| How Good is the Model in Model-in-the-loop Event Coreference Resolution Annotation? | Jun 6, 2023 | coreference-resolutionCoreference Resolution | CodeCode Available | 0 |
| Rethinking Robustness of Model Attributions | Dec 16, 2023 | Diversitymodel | CodeCode Available | 0 |
| Online Ensemble Model Compression using Knowledge Distillation | Nov 15, 2020 | Knowledge Distillationmodel | CodeCode Available | 0 |
| Spatial machine-learning model diagnostics: a model-agnostic distance-based approach | Nov 13, 2021 | BIG-bench Machine LearningDiagnostic | CodeCode Available | 0 |
| Online Influence Maximization under Decreasing Cascade Model | May 19, 2023 | model | CodeCode Available | 0 |
| Metabolic Model-based Ecological Modeling for Probiotic Design | Oct 6, 2022 | model | CodeCode Available | 0 |
| How consistent is my model with the data? Information-Theoretic Model Check | Dec 7, 2017 | model | CodeCode Available | 0 |
| Rethinking the CSC Model for Natural Images | Sep 12, 2019 | Color Image DenoisingDenoising | CodeCode Available | 0 |
| Rethinking Weight-Averaged Model-merging | Nov 14, 2024 | model | CodeCode Available | 0 |
| Causal Discovery using Model Invariance through Knockoff Interventions | Jul 8, 2022 | Causal Discoverymodel | CodeCode Available | 0 |
| Do deep reinforcement learning agents model intentions? | May 15, 2018 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| Vanishing Feature: Diagnosing Model Merging and Beyond | Feb 5, 2024 | Linear Mode Connectivitymodel | CodeCode Available | 0 |
| A model for efficient dynamical ranking in networks | Jul 25, 2023 | model | CodeCode Available | 0 |
| HMM Model for Brain Tumor Detection and Classification | Jun 22, 2021 | Brain Tumor ClassificationBrain Tumor Segmentation | CodeCode Available | 0 |
| ArthModel: Enhance Arithmetic Skills to Large Language Model | Nov 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Online Probabilistic Model Identification using Adaptive Recursive MCMC | Oct 23, 2022 | modelparameter estimation | CodeCode Available | 0 |
| Bivariate Causal Discovery using Bayesian Model Selection | Jun 5, 2023 | Causal Discoverymodel | CodeCode Available | 0 |
| Metaphor Detection with Cross-Lingual Model Transfer | Jun 1, 2014 | Decision MakingMachine Translation | CodeCode Available | 0 |
| Online Simultaneous Semi-Parametric Dynamics Model Learning | Oct 9, 2019 | model | CodeCode Available | 0 |
| Effective Causal Discovery under Identifiable Heteroscedastic Noise Model | Dec 20, 2023 | Causal Discoverymodel | CodeCode Available | 0 |
| Meta-Uncertainty in Bayesian Model Comparison | Oct 13, 2022 | model | CodeCode Available | 0 |
| Divergence Triangle for Joint Training of Generator Model, Energy-based Model, and Inference Model | Dec 28, 2018 | model | CodeCode Available | 0 |
| HMM-LSTM Fusion Model for Economic Forecasting | Jan 1, 2025 | model | CodeCode Available | 0 |
| Revealing the structure of language model capabilities | Jun 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Revisiting Bellman Errors for Offline Model Selection | Jan 31, 2023 | Atari Gamesmodel | CodeCode Available | 0 |
| Toward Scalable Neural Dialogue State Tracking Model | Dec 3, 2018 | Dialogue State Trackingmodel | CodeCode Available | 0 |
| Cardinality-Regularized Hawkes-Granger Model | Aug 23, 2022 | Managementmodel | CodeCode Available | 0 |
| mHuBERT-147: A Compact Multilingual HuBERT Model | Jun 10, 2024 | Automatic Speech Recognition (ASR)Diversity | CodeCode Available | 0 |
| A Flexible Storage Model for Power Network Optimization | Apr 29, 2020 | modelScheduling | CodeCode Available | 0 |
| Achieving Model Robustness through Discrete Adversarial Training | Apr 11, 2021 | model | CodeCode Available | 0 |
| Distributional Model Equivalence for Risk-Sensitive Reinforcement Learning | Jul 4, 2023 | Distributional Reinforcement Learningmodel | CodeCode Available | 0 |
| High-Resolution Speech Restoration with Latent Diffusion Model | Sep 17, 2024 | modelSpeech Enhancement | CodeCode Available | 0 |
| Speaker Adaptive Training using Model Agnostic Meta-Learning | Oct 23, 2019 | Meta-Learningmodel | CodeCode Available | 0 |
| Can recurrent neural networks learn process model structure? | Dec 13, 2022 | modelPredictive Process Monitoring | CodeCode Available | 0 |
| RanDeS: Randomized Delta Superposition for Multi-Model Compression | May 16, 2025 | modelModel Compression | CodeCode Available | 0 |
| Speaker Sensitive Response Evaluation Model | Jun 12, 2020 | modelResponse Generation | CodeCode Available | 0 |
| The Deep Promotion Time Cure Model | May 19, 2023 | Computational Efficiencymodel | CodeCode Available | 0 |
| MiniALBERT: Model Distillation via Parameter-Efficient Recursive Transformers | Oct 12, 2022 | model | CodeCode Available | 0 |