| Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models | Oct 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GenDistiller: Distilling Pre-trained Language Models based on Generative Models | Oct 20, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| FATA-Trans: Field And Time-Aware Transformer for Sequential Tabular Data | Oct 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exploring the Impact of Corpus Diversity on Financial Pretrained Language Models | Oct 20, 2023 | DiversityLanguage Modeling | —Unverified | 0 |
| Ask Language Model to Clean Your Noisy Translation Data | Oct 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Zero-Shot Crypto Sentiment with Fine-tuned Language Model and Prompt Engineering | Oct 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MoqaGPT : Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model | Oct 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LASER: Linear Compression in Wireless Distributed Optimization | Oct 19, 2023 | Distributed OptimizationLanguage Modeling | —Unverified | 0 |
| Reliable Academic Conference Question Answering: A Study Based on Large Language Model | Oct 19, 2023 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer | Oct 19, 2023 | 8kComputational Efficiency | —Unverified | 0 |
| Exploring In-Context Learning of Textless Speech Language Model for Speech Classification Tasks | Oct 19, 2023 | Few-Shot LearningIn-Context Learning | —Unverified | 0 |
| Label-Aware Automatic Verbalizer for Few-Shot Text Classification | Oct 19, 2023 | Few-Shot Text ClassificationLanguage Modeling | —Unverified | 0 |
| Knowledge-Augmented Language Model Verification | Oct 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter | Oct 19, 2023 | Contrastive LearningIUPAC Name Prediction | CodeCode Available | 1 |
| Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing | Oct 19, 2023 | DecoderLanguage Model Evaluation | —Unverified | 0 |
| Lost in Translation: When GPT-4V(ision) Can't See Eye to Eye with Text. A Vision-Language-Consistency Analysis of VLLMs and Beyond | Oct 19, 2023 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model | Oct 19, 2023 | Causal DiscoveryLanguage Modeling | CodeCode Available | 0 |
| Named Entity Recognition for Monitoring Plant Health Threats in Tweets: a ChouBERT Approach | Oct 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ICU: Conquering Language Barriers in Vision-and-Language Modeling by Dividing the Tasks into Image Captioning and Language Understanding | Oct 19, 2023 | Image CaptioningLanguage Modeling | CodeCode Available | 0 |
| GestureGPT: Toward Zero-Shot Free-Form Hand Gesture Understanding with Large Language Model Agents | Oct 19, 2023 | Common Sense ReasoningForm | CodeCode Available | 0 |
| A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems | Oct 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model for Multi-objective Evolutionary Optimization | Oct 19, 2023 | Evolutionary AlgorithmsLanguage Modeling | CodeCode Available | 1 |
| Loop Copilot: Conducting AI Ensembles for Music Generation and Iterative Editing | Oct 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CLAIR: Evaluating Image Captions with Large Language Models | Oct 19, 2023 | DiversityImage Captioning | —Unverified | 0 |
| Character-level Chinese Backpack Language Models | Oct 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization Problems | Oct 19, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| TabuLa: Harnessing Language Models for Tabular Data Synthesis | Oct 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Data Augmentations for Improved (Large) Language Model Generalization | Oct 19, 2023 | Attributecounterfactual | —Unverified | 0 |
| Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture | Oct 18, 2023 | 4kimage-classification | CodeCode Available | 2 |
| Solving the multiplication problem of a large language model system using a graph-based method | Oct 18, 2023 | ChatbotLanguage Modeling | —Unverified | 0 |
| Preference Optimization for Molecular Language Models | Oct 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Document-Level Language Models for Machine Translation | Oct 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pseudointelligence: A Unifying Framework for Language Model Evaluation | Oct 18, 2023 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| Harnessing Dataset Cartography for Improved Compositional Generalization in Transformers | Oct 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Zero-shot Faithfulness Evaluation for Text Summarization with Foundation Language Model | Oct 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fast Multipole Attention: A Divide-and-Conquer Attention Mechanism for Long Sequences | Oct 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Solving Hard Analogy Questions with Relation Embedding Chains | Oct 18, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| Large Language Model Prediction Capabilities: Evidence from a Real-World Forecasting Tournament | Oct 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition | Oct 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-stage Large Language Model Correction for Speech Recognition | Oct 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges | Oct 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BitNet: Scaling 1-bit Transformers for Large Language Models | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Leveraging Large Language Model for Automatic Evolving of Industrial Data-Centric R&D Cycle | Oct 17, 2023 | Anomaly DetectionDecision Making | —Unverified | 0 |
| ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Correction Focused Language Model Training for Speech Recognition | Oct 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Learn Your Tokens: Word-Pooled Tokenization for Language Modeling | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Watermarking LLMs with Weight Quantization | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation | Oct 17, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Utilising a Large Language Model to Annotate Subject Metadata: A Case Study in an Australian National Research Data Catalogue | Oct 17, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |