| ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation | Oct 17, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Large Language Model Prediction Capabilities: Evidence from a Real-World Forecasting Tournament | Oct 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Large Language Model for Automatic Evolving of Industrial Data-Centric R&D Cycle | Oct 17, 2023 | Anomaly DetectionDecision Making | —Unverified | 0 |
| Learn Your Tokens: Word-Pooled Tokenization for Language Modeling | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition | Oct 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Correction Focused Language Model Training for Speech Recognition | Oct 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models | Oct 17, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Interactive Task Planning with Language Models | Oct 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance | Oct 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Contextual Data Augmentation for Task-Oriented Dialog Systems | Oct 16, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| ForceGen: End-to-end de novo protein generation based on nonlinear mechanical unfolding responses using a protein language diffusion model | Oct 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DavIR: Data Selection via Implicit Reward for Large Language Models | Oct 16, 2023 | Causal Language ModelingGSM8K | —Unverified | 0 |
| Improving Large Language Model Fine-tuning for Solving Math Problems | Oct 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge | Oct 16, 2023 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset | Oct 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Use of probabilistic phrases in a coordination game: human versus GPT-4 | Oct 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reformulating NLP tasks to Capture Longitudinal Manifestation of Language Disorders in People with Dementia | Oct 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Aware In-Context Learning for Code Generation | Oct 15, 2023 | Code GenerationContrastive Learning | —Unverified | 0 |
| Assessing the Reliability of Large Language Model Knowledge | Oct 15, 2023 | HallucinationKnowledge Probing | CodeCode Available | 0 |
| Domain-Specific Language Model Post-Training for Indonesian Financial NLP | Oct 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ACES: Generating Diverse Programming Puzzles with with Autotelic Generative Models | Oct 15, 2023 | Code SearchDiversity | —Unverified | 0 |
| Large Vocabulary Spontaneous Speech Recognition for Tigrigna | Oct 15, 2023 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| Empirical study of pretrained multilingual language models for zero-shot cross-lingual knowledge transfer in generation | Oct 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Farzi Data: Autoregressive Data Distillation | Oct 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Segmentation: Road Network Generation with Multi-Modal LLMs | Oct 15, 2023 | Autonomous NavigationLanguage Modeling | —Unverified | 0 |
| A study of the impact of generative AI-based data augmentation on software metadata classification | Oct 14, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Enhancing Binary Code Comment Quality Classification: Integrating Generative AI for Improved Accuracy | Oct 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Legend at ArAIEval Shared Task: Persuasion Technique Detection using a Language-Agnostic Text Representation Model | Oct 14, 2023 | ArticlesLanguage Modeling | —Unverified | 0 |
| Software Metadata Classification based on Generative Artificial Intelligence | Oct 14, 2023 | ClassificationLanguage Modeling | —Unverified | 0 |
| PuoBERTa: Training and evaluation of a curated language model for Setswana | Oct 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Consensus Game: Language Model Generation via Equilibrium Search | Oct 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation | Oct 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A ML-LLM pairing for better code comment classification | Oct 13, 2023 | ClassificationInformation Retrieval | —Unverified | 0 |
| Human-in-the-loop Machine Translation with Large Language Model | Oct 13, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Interactive Navigation in Environments with Traversable Obstacles Using Large Language and Vision-Language Models | Oct 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Case-Based Persistent Memory for a Large Language Model | Oct 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Incremental Object Detection with CLIP | Oct 13, 2023 | Class-Incremental Object DetectionIncremental Learning | —Unverified | 0 |
| ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction | Oct 13, 2023 | Click-Through Rate PredictionLanguage Modeling | —Unverified | 0 |
| AgentCF: Collaborative Learning with Autonomous Language Agents for Recommender Systems | Oct 13, 2023 | Collaborative FilteringDecision Making | —Unverified | 0 |
| Enhancing BERT-Based Visual Question Answering through Keyword-Driven Sentence Selection | Oct 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GameGPT: Multi-agent Collaborative Framework for Game Development | Oct 12, 2023 | Code GenerationHallucination | —Unverified | 0 |
| Is attention required for ICL? Exploring the Relationship Between Model Architecture and In-Context Learning Ability | Oct 12, 2023 | Causal Language ModelingIn-Context Learning | CodeCode Available | 0 |
| Expanding the Vocabulary of BERT for Knowledge Base Construction | Oct 12, 2023 | Knowledge Base ConstructionKnowledge Base Population | CodeCode Available | 0 |
| Harnessing Large Language Models' Empathetic Response Generation Capabilities for Online Mental Health Counselling Support | Oct 12, 2023 | Empathetic Response GenerationLanguage Modeling | —Unverified | 0 |
| GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models | Oct 12, 2023 | Answer GenerationHallucination | CodeCode Available | 0 |
| Large language models can replicate cross-cultural differences in personality | Oct 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning | Oct 12, 2023 | Image CaptioningImage-text Retrieval | —Unverified | 0 |
| Toward Joint Language Modeling for Speech Units and Text | Oct 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multimodal Large Language Model for Visual Navigation | Oct 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Measuring Feature Sparsity in Language Models | Oct 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |