| Named Entity Recognition for Monitoring Plant Health Threats in Tweets: a ChouBERT Approach | Oct 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ICU: Conquering Language Barriers in Vision-and-Language Modeling by Dividing the Tasks into Image Captioning and Language Understanding | Oct 19, 2023 | Image CaptioningLanguage Modeling | CodeCode Available | 0 |
| Lost in Translation: When GPT-4V(ision) Can't See Eye to Eye with Text. A Vision-Language-Consistency Analysis of VLLMs and Beyond | Oct 19, 2023 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems | Oct 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CLAIR: Evaluating Image Captions with Large Language Models | Oct 19, 2023 | DiversityImage Captioning | —Unverified | 0 |
| Exploring In-Context Learning of Textless Speech Language Model for Speech Classification Tasks | Oct 19, 2023 | Few-Shot LearningIn-Context Learning | —Unverified | 0 |
| LASER: Linear Compression in Wireless Distributed Optimization | Oct 19, 2023 | Distributed OptimizationLanguage Modeling | —Unverified | 0 |
| Data Augmentations for Improved (Large) Language Model Generalization | Oct 19, 2023 | Attributecounterfactual | —Unverified | 0 |
| Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing | Oct 19, 2023 | DecoderLanguage Model Evaluation | —Unverified | 0 |
| GestureGPT: Toward Zero-Shot Free-Form Hand Gesture Understanding with Large Language Model Agents | Oct 19, 2023 | Common Sense ReasoningForm | CodeCode Available | 0 |
| Label-Aware Automatic Verbalizer for Few-Shot Text Classification | Oct 19, 2023 | Few-Shot Text ClassificationLanguage Modeling | —Unverified | 0 |
| Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization Problems | Oct 19, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer | Oct 19, 2023 | 8kComputational Efficiency | —Unverified | 0 |
| Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model | Oct 19, 2023 | Causal DiscoveryLanguage Modeling | CodeCode Available | 0 |
| Document-Level Language Models for Machine Translation | Oct 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fast Multipole Attention: A Divide-and-Conquer Attention Mechanism for Long Sequences | Oct 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Harnessing Dataset Cartography for Improved Compositional Generalization in Transformers | Oct 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Solving the multiplication problem of a large language model system using a graph-based method | Oct 18, 2023 | ChatbotLanguage Modeling | —Unverified | 0 |
| Preference Optimization for Molecular Language Models | Oct 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Pseudointelligence: A Unifying Framework for Language Model Evaluation | Oct 18, 2023 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| Solving Hard Analogy Questions with Relation Embedding Chains | Oct 18, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| Utilising a Large Language Model to Annotate Subject Metadata: A Case Study in an Australian National Research Data Catalogue | Oct 17, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Multi-stage Large Language Model Correction for Speech Recognition | Oct 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges | Oct 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |