| Large Language Models as Generalizable Policies for Embodied Tasks | Oct 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Lil-Bevo: Explorations of Strategies for Training Language Models in More Humanlike Ways | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Content-based Controls For Music Large Language Modeling | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CompeteAI: Understanding the Competition Dynamics in Large Language Model-based Agents | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LightLM: A Lightweight Deep and Narrow Language Model for Generative Recommendation | Oct 26, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs | Oct 25, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| BOOST: Harnessing Black-Box Control to Boost Commonsense in LMs' Generation | Oct 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unraveling Feature Extraction Mechanisms in Neural Networks | Oct 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining | Oct 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| math-PVS: A Large Language Model Framework to Map Scientific Publications to PVS Theories | Oct 25, 2023 | Automated Theorem ProvingLanguage Modeling | —Unverified | 0 |
| Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation | Oct 25, 2023 | Conversational RecommendationData Augmentation | CodeCode Available | 0 |
| URL-BERT: Training Webpage Representations via Social Media Engagements | Oct 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation | Oct 25, 2023 | Data-to-Text GenerationHallucination | CodeCode Available | 0 |
| Controlled Decoding from Language Models | Oct 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transformer-based Live Update Generation for Soccer Matches from Microblog Posts | Oct 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Conditionally Combining Robot Skills using Large Language Models | Oct 25, 2023 | Deep Reinforcement LearningLanguage Modeling | CodeCode Available | 0 |
| General Point Model with Autoencoding and Autoregressive | Oct 25, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Zephyr: Direct Distillation of LM Alignment | Oct 25, 2023 | 2D Cyclist DetectionFew-Shot Learning | CodeCode Available | 5 |
| RCAgent: Cloud Root Cause Analysis by Autonomous Agents with Tool-Augmented Large Language Models | Oct 25, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training | Oct 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution | Oct 25, 2023 | DenoisingLanguage Modeling | CodeCode Available | 2 |
| XFEVER: Exploring Fact Verification across Languages | Oct 25, 2023 | BenchmarkingFact Verification | CodeCode Available | 0 |
| SkyMath: Technical Report | Oct 25, 2023 | GSM8KLanguage Modeling | CodeCode Available | 3 |
| Multiple Key-value Strategy in Recommendation Systems Incorporating Large Language Model | Oct 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning | Oct 25, 2023 | Federated LearningLanguage Modeling | —Unverified | 0 |
| Faithful Path Language Modeling for Explainable Recommendation over Knowledge Graph | Oct 25, 2023 | Explainable RecommendationKnowledge Graph Embeddings | —Unverified | 0 |
| DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence Understanding | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery | Oct 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| BLP-2023 Task 2: Sentiment Analysis | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Locally Differentially Private Document Generation Using Zero Shot Prompting | Oct 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| FLTrojan: Privacy Leakage Attacks against Federated Language Models Through Selective Weight Tampering | Oct 24, 2023 | Federated LearningLanguage Modeling | —Unverified | 0 |
| TCRA-LLM: Token Compression Retrieval Augmented Large Language Model for Inference Cost Reduction | Oct 24, 2023 | Food recommendationIn-Context Learning | —Unverified | 0 |
| Prevalence and prevention of large language model use in crowd work | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PromptInfuser: How Tightly Coupling AI and UI Design Impacts Designers' Workflows | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning | Oct 24, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection | Oct 24, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DALE: Generative Data Augmentation for Low-Resource Legal NLP | Oct 24, 2023 | Data AugmentationDecoder | CodeCode Available | 1 |
| CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model | Oct 24, 2023 | ClusteringLanguage Modeling | CodeCode Available | 0 |
| AutoDiff: combining Auto-encoder and Diffusion model for tabular data synthesizing | Oct 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Rosetta Stone at KSAA-RD Shared Task: A Hop From Language Modeling To Word--Definition Alignment | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Language Model with Limited Memory Capacity Captures Interference in Human Sentence Processing | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific Literature | Oct 24, 2023 | Abstractive Text SummarizationInformation Retrieval | CodeCode Available | 1 |
| Facilitating Self-Guided Mental Health Interventions Through Human-Language Model Interaction: A Case Study of Cognitive Restructuring | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TRAMS: Training-free Memory Selection for Long-range Language Modeling | Oct 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MindLLM: Pre-training Lightweight Large Language Model from Scratch, Evaluations and Domain Applications | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unnatural language processing: How do language models handle machine-generated prompts? | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WebWISE: Web Interface Control and Sequential Exploration with Large Language Models | Oct 24, 2023 | Imitation LearningIn-Context Learning | —Unverified | 0 |
| E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |