| OptiMUS: Optimization Modeling Using MIP Solvers and large language models | Oct 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Transformers and Large Language Models for Chemistry and Drug Discovery | Oct 9, 2023 | Drug DiscoveryLanguage Modeling | —Unverified | 0 |
| Scaling Studies for Efficient Parameter Search and Parallelism for Large Language Model Pre-training | Oct 9, 2023 | DecoderGPU | —Unverified | 0 |
| Rethinking Memory and Communication Cost for Efficient Large Language Model Training | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Factual and Personalized Recommendations using Language Models and Reinforcement Learning | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Estimating Numbers without Regression | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation | Oct 9, 2023 | Action RecognitionImage Generation | CodeCode Available | 4 |
| Transcending the Attention Paradigm: Representation Learning from Geospatial Social Media Data | Oct 9, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| NEFTune: Noisy Embeddings Improve Instruction Finetuning | Oct 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model Attribution | Oct 9, 2023 | AttributeLanguage Modeling | CodeCode Available | 0 |
| A Meta-Learning Perspective on Transformers for Causal Language Modeling | Oct 9, 2023 | Causal Language ModelingLanguage Modeling | —Unverified | 0 |
| Guiding Language Model Reasoning with Planning Tokens | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GraphLLM: Boosting Graph Reasoning Ability of Large Language Model | Oct 9, 2023 | Graph LearningLanguage Modeling | CodeCode Available | 1 |
| Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transformer Fusion with Optimal Transport | Oct 9, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| CCAE: A Corpus of Chinese-based Asian Englishes | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Breaking Down Word Semantics from Pre-trained Language Models through Layer-wise Dimension Selection | Oct 8, 2023 | Binary ClassificationLanguage Modeling | —Unverified | 0 |
| InstructDET: Diversifying Referring Object Detection with Generalized Instructions | Oct 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Synslator: An Interactive Machine Translation Tool with Online Learning | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model | Oct 8, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| MindfulDiary: Harnessing Large Language Model to Support Psychiatric Patients' Journaling | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generative Spoken Language Model based on continuous word-sized audio tokens | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Optimizing Large Language Models to Expedite the Development of Smart Contracts | Oct 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model (LLM) as a System of Multiple Expert Agents: An Approach to solve the Abstraction and Reasoning Corpus (ARC) Challenge | Oct 8, 2023 | ARCLanguage Modeling | CodeCode Available | 1 |
| ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data | Oct 8, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Prompt-to-OS (P2OS): Revolutionizing Operating Systems and Human-Computer Interaction with Integrated AI Generative Models | Oct 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling | Oct 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Tree-GPT: Modular Large Language Model Expert System for Forest Remote Sensing Image Understanding and Interactive Analysis | Oct 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Question-focused Summarization by Decomposing Articles into Facts and Opinions and Retrieving Entities | Oct 7, 2023 | ArticlesDecision Making | —Unverified | 0 |
| ILuvUI: Instruction-tuned LangUage-Vision modeling of UIs from Machine Conversations | Oct 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BrainSCUBA: Fine-Grained Natural Language Captions of Visual Cortex Selectivity | Oct 6, 2023 | Image GenerationLanguage Modeling | —Unverified | 0 |
| RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation | Oct 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Quantized Transformer Language Model Implementations on Edge Devices | Oct 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Functional Interpolation for Relative Positions Improves Long Context Transformers | Oct 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From task structures to world models: What do LLMs know? | Oct 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An In-Context Learning Agent for Formal Theorem-Proving | Oct 6, 2023 | Automated Theorem ProvingIn-Context Learning | CodeCode Available | 1 |
| Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations | Oct 6, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| PepMLM: Target Sequence-Conditioned Generation of Therapeutic Peptide Binders via Span Masked Language Modeling | Oct 5, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| LLM Based Multi-Document Summarization Exploiting Main-Event Biased Monotone Submodular Content Extraction | Oct 5, 2023 | Document SummarizationInformativeness | —Unverified | 0 |
| Neural Language Model Pruning for Automatic Speech Recognition | Oct 5, 2023 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| Controllable Multi-document Summarization: Coverage & Coherence Intuitive Policy with Large Language Model Based Rewards | Oct 5, 2023 | Document SummarizationLanguage Modeling | —Unverified | 0 |
| GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction | Oct 5, 2023 | Event Argument ExtractionEvent Extraction | CodeCode Available | 2 |
| TRAM: Bridging Trust Regions and Sharpness Aware Minimization | Oct 5, 2023 | Cross-Lingual TransferDomain Generalization | CodeCode Available | 0 |
| DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines | Oct 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization | Oct 5, 2023 | AllLanguage Modeling | CodeCode Available | 1 |
| Deep Representations of First-person Pronouns for Prediction of Depression Symptom Severity | Oct 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A 5' UTR Language Model for Decoding Untranslated Regions of mRNA and Function Predictions | Oct 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DOMINO: A Dual-System for Multi-step Visual Language Reasoning | Oct 4, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 |
| Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning | Oct 4, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |