| Linguistic Versus Latent Relations for Modeling Coherent Flow in Paragraphs | Aug 30, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Accommodating Audio Modality in CLIP for Multimodal Processing | Mar 12, 2023 | AudioCapsContrastive Learning | CodeCode Available | 0 |
| Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between Actions | Mar 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HateBERT: Retraining BERT for Abusive Language Detection in English | Oct 23, 2020 | Abusive LanguageHate Speech Detection | CodeCode Available | 0 |
| HATE-ITA: New Baselines for Hate Speech Detection in Italian | Jul 1, 2022 | BenchmarkingHate Speech Detection | CodeCode Available | 0 |
| Is Training Data Quality or Quantity More Impactful to Small Language Model Performance? | Nov 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Debiasing Pre-Trained Language Models via Efficient Fine-Tuning | May 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DATETIME: A new benchmark to measure LLM translation and reasoning capabilities | Apr 22, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Attention-Seeker: Dynamic Self-Attention Scoring for Unsupervised Keyphrase Extraction | Sep 17, 2024 | Keyphrase ExtractionLanguage Modeling | CodeCode Available | 0 |
| Large Language Model Can Be a Foundation for Hidden Rationale-Based Retrieval | Dec 21, 2024 | Computational EfficiencyLanguage Modeling | CodeCode Available | 0 |
| Attention as a Guide for Simultaneous Speech Translation | Dec 15, 2022 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Attacks on Third-Party APIs of Large Language Models | Apr 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Large Language Model Capabilities in Perioperative Risk Prediction and Prognostication | Jan 3, 2024 | ClassificationICU Admission | CodeCode Available | 0 |
| DataVisT5: A Pre-trained Language Model for Jointly Understanding Text and Data Visualization | Aug 14, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 0 |
| Is Your Large Language Model Knowledgeable or a Choices-Only Cheater? | Jul 2, 2024 | Graph MiningLanguage Modeling | CodeCode Available | 0 |
| Heaps' Law in GPT-Neo Large Language Model Emulated Corpora | Nov 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Transformer with Stack Attention | May 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Data Similarity is Not Enough to Explain Language Model Performance | Nov 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Alibaba-Translate China's Submission for WMT 2022 Quality Estimation Shared Task | Oct 18, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning | Oct 12, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| "It doesn't look good for a date": Transforming Critiques into Preferences for Conversational Recommendation Systems | Sep 15, 2021 | Common Sense ReasoningConversational Recommendation | CodeCode Available | 0 |
| “It doesn’t look good for a date”: Transforming Critiques into Preferences for Conversational Recommendation Systems | Nov 1, 2021 | Common Sense ReasoningConversational Recommendation | CodeCode Available | 0 |
| Helpful assistant or fruitful facilitator? Investigating how personas affect language model behavior | Jul 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Data Selection for Fine-tuning Large Language Models Using Transferred Shapley Values | Jun 16, 2023 | Data ValuationLanguage Modeling | CodeCode Available | 0 |
| Large Language Model Critics for Execution-Free Evaluation of Code Changes | Jan 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Data Noising as Smoothing in Neural Network Language Models | Mar 7, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts? | Oct 17, 2024 | AllLanguage Modeling | CodeCode Available | 0 |
| Data-Informed Global Sparseness in Attention Mechanisms for Deep Neural Networks | Nov 20, 2020 | GPULanguage Modeling | CodeCode Available | 0 |
| Learning to Maximize Mutual Information for Chain-of-Thought Distillation | Mar 5, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 0 |
| Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task | Oct 18, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Item-side Fairness of Large Language Model-based Recommendation System | Feb 23, 2024 | FairnessLanguage Modeling | CodeCode Available | 0 |
| A Toolkit for Efficient Learning of Lexical Units for Speech Recognition | May 1, 2014 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Learning to Plan for Language Modeling from Unlabeled Data | Mar 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DataGpt-SQL-7B: An Open-Source Language Model for Text-to-SQL | Sep 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DATA: Differentiable ArchiTecture Approximation | Dec 1, 2019 | image-classificationImage Classification | CodeCode Available | 0 |
| DataChat: Prototyping a Conversational Agent for Dataset Search and Visualization | May 26, 2023 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| Can discrete information extraction prompts generalize across language models? | Feb 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Data augmentation using prosody and false starts to recognize non-native children's speech | Aug 29, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Iterative Counterfactual Data Augmentation | Feb 25, 2025 | counterfactualData Augmentation | CodeCode Available | 0 |
| Data Augmentation for Biomedical Factoid Question Answering | Apr 10, 2022 | Data AugmentationInformation Retrieval | CodeCode Available | 0 |
| Linking Theories and Methods in Cognitive Sciences via Joint Embedding of the Scientific Literature: The Example of Cognitive Control | Mar 16, 2022 | Graph EmbeddingLanguage Modeling | CodeCode Available | 0 |
| Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers | Oct 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Heterogeneous Subgraph Transformer for Fake News Detection | Apr 19, 2024 | Fake News DetectionLanguage Modeling | CodeCode Available | 0 |
| AlgebraNets | Jun 12, 2020 | Computational Efficiencyimage-classification | CodeCode Available | 0 |
| Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models | Oct 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DarijaBanking: A New Resource for Overcoming Language Barriers in Banking Intent Detection for Moroccan Arabic Speakers | May 26, 2024 | intent-classificationIntent Classification | CodeCode Available | 0 |
| DALLMi: Domain Adaption for LLM-based Multi-label Classifier | May 3, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| Large Language Model-Driven Curriculum Design for Mobile Networks | May 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Cynical Selection of Language Model Training Data | Sep 7, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 |