| Motif: Intrinsic Motivation from Artificial Intelligence Feedback | Sep 29, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| LoRA ensembles for large language model fine-tuning | Sep 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training | Sep 29, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| LLM-grounded Video Diffusion Models | Sep 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unsupervised Large Language Model Alignment for Information Retrieval via Contrastive Feedback | Sep 29, 2023 | Data AugmentationInformation Retrieval | —Unverified | 0 |
| A Large Language Model Approach to Educational Survey Feedback Analysis | Sep 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Code-switching Speech Recognition with Interactive Language Biases | Sep 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficient Streaming Language Models with Attention Sinks | Sep 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Chatmap : Large Language Model Interaction with Cartographic Data | Sep 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Soft Ideologization via AI-Self-Consciousness | Sep 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language models in molecular discovery | Sep 28, 2023 | ChatbotComputational chemistry | —Unverified | 0 |
| Qwen Technical Report | Sep 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Unsupervised Pretraining for Fact Verification by Language Model Distillation | Sep 28, 2023 | Fact VerificationLanguage Modeling | CodeCode Available | 0 |
| Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR | Sep 28, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Augmenting LLMs with Knowledge: A survey on hallucination prevention | Sep 28, 2023 | HallucinationLanguage Modeling | —Unverified | 0 |
| MotionLM: Multi-Agent Motion Forecasting as Language Modeling | Sep 28, 2023 | Autonomous VehiclesLanguage Modeling | CodeCode Available | 1 |
| RLLTE: Long-Term Evolution Project of Reinforcement Learning | Sep 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Effective Long-Context Scaling of Foundation Models | Sep 27, 2023 | Continual PretrainingLanguage Modeling | CodeCode Available | 2 |
| AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model | Sep 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Model Routing with Benchmark Datasets | Sep 27, 2023 | Binary ClassificationLanguage Modeling | —Unverified | 0 |
| Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models | Sep 27, 2023 | HumanEvalLanguage Modeling | CodeCode Available | 0 |
| Conversational Feedback in Scripted versus Spontaneous Dialogues: A Comparative Analysis | Sep 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ChatCounselor: A Large Language Models for Mental Health Support | Sep 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Graph Neural Prompting with Large Language Models | Sep 27, 2023 | Graph Neural NetworkKnowledge Graphs | CodeCode Available | 1 |
| MSG-BART: Multi-granularity Scene Graph-Enhanced Encoder-Decoder Language Model for Video-grounded Dialogue Generation | Sep 26, 2023 | DecoderDialogue Generation | —Unverified | 0 |
| Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition | Sep 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Post-training Quantization with FP8 Formats | Sep 26, 2023 | image-classificationImage Classification | CodeCode Available | 4 |
| Large Language Model Alignment: A Survey | Sep 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Robustness of the Random Language Model | Sep 26, 2023 | Language AcquisitionLanguage Modeling | —Unverified | 0 |
| XGV-BERT: Leveraging Contextualized Language Model and Graph Neural Network for Efficient Software Vulnerability Detection | Sep 26, 2023 | Graph Neural NetworkLanguage Modeling | —Unverified | 0 |
| Speaker anonymization using neural audio codec language models | Sep 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LogGPT: Log Anomaly Detection via GPT | Sep 25, 2023 | Anomaly DetectionLanguage Modeling | CodeCode Available | 1 |
| Introducing DictaLM -- A Large Generative Language Model for Modern Hebrew | Sep 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Analyzing the Efficacy of an LLM-Only Approach for Image-based Document Question Answering | Sep 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Guess & Sketch: Language Model Guided Transpilation | Sep 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Connecting Speech Encoder and Large Language Model for ASR | Sep 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| pLMFPPred: a novel approach for accurate prediction of functional peptides integrating embedding from pre-trained protein language model and imbalanced learning | Sep 25, 2023 | feature selectionLanguage Modeling | —Unverified | 0 |
| Unsupervised Accent Adaptation Through Masked Language Model Correction Of Discrete Self-Supervised Speech Units | Sep 25, 2023 | Accented Speech RecognitionLanguage Modeling | —Unverified | 0 |
| Cluster Language Model for Improved E-Commerce Retrieval and Ranking: Leveraging Query Similarity and Fine-Tuning for Personalized Results | Sep 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention | Sep 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Towards General-Purpose Text-Instruction-Guided Voice Conversion | Sep 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Relation between Internal Language Model and Sequence Discriminative Training for Neural Transducers | Sep 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Keeping in Time: Adding Temporal Context to Sentiment Analysis Models | Sep 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Substituting Data Annotation with Balanced Updates and Collective Loss in Multi-label Text Classification | Sep 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| EvalLM: Interactive Evaluation of Large Language Model Prompts on User-Defined Criteria | Sep 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Calibrating LLM-Based Evaluator | Sep 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| An In-depth Survey of Large Language Model-based Artificial Intelligence Agents | Sep 23, 2023 | AI AgentLanguage Modeling | —Unverified | 0 |
| BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models | Sep 23, 2023 | Code CompletionHallucination | CodeCode Available | 1 |
| From Text to Source: Results in Detecting Large Language Model-Generated Content | Sep 23, 2023 | AttributeLanguage Modeling | —Unverified | 0 |
| Resolving References in Visually-Grounded Dialogue via Text Generation | Sep 23, 2023 | Image RetrievalLanguage Modeling | CodeCode Available | 0 |