| ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation | Aug 4, 2023 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 |
| Is GPT-4 a reliable rater? Evaluating Consistency in GPT-4 Text Ratings | Aug 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Displays Emergent Ability to Interpret Novel Literary Metaphors | Aug 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| InterAct: Exploring the Potentials of ChatGPT as a Cooperative Agent | Aug 3, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Specious Sites: Tracking the Spread and Sway of Spurious News Stories at Scale | Aug 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Evaluating ChatGPT text-mining of clinical records for obesity monitoring | Aug 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge-aware Collaborative Filtering with Pre-trained Language Model for Personalized Review-based Rating Prediction | Aug 2, 2023 | Collaborative FilteringLanguage Modeling | CodeCode Available | 0 |
| A Practical Deep Learning-Based Acoustic Side Channel Attack on Keyboards | Aug 2, 2023 | Deep LearningLanguage Modeling | CodeCode Available | 1 |
| Teaching Smaller Language Models To Generalise To Unseen Compositional Questions | Aug 2, 2023 | ARCInformation Retrieval | CodeCode Available | 0 |
| Do Multilingual Language Models Think Better in English? | Aug 2, 2023 | Common Sense ReasoningCross-Lingual Natural Language Inference | CodeCode Available | 1 |
| Arithmetic with Language Models: from Memorization to Computation | Aug 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatMOF: An Autonomous AI System for Predicting and Generating Metal-Organic Frameworks | Aug 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Detecting Cloud Presence in Satellite Images Using the RGB-based CLIP Vision-Language Model | Aug 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| JIANG: Chinese Open Foundation Language Model | Aug 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Advancing Beyond Identification: Multi-bit Watermark for Large Language Models | Aug 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code | Aug 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Effective Ancient Chinese Translation: Dataset, Model, and Evaluation | Aug 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LISA: Reasoning Segmentation via Large Language Model | Aug 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| An Effective Data Creation Pipeline to Generate High-quality Financial Instruction Data for Large Language Model | Jul 31, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HouYi: An open-source large language model specially designed for renewable energy and carbon neutrality field | Jul 31, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FinVis-GPT: A Multimodal Large Language Model for Financial Chart Analysis | Jul 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LP-MusicCaps: LLM-Based Pseudo Music Captioning | Jul 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Generative Models as a Complex Systems Science: How can we make sense of large language model behavior? | Jul 31, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Camoscio: an Italian Instruction-tuned LLaMA | Jul 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GeneMask: Fast Pretraining of Gene Sequences to Enable Few-Shot Learning | Jul 29, 2023 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| VeriGen: A Large Language Model for Verilog Code Generation | Jul 28, 2023 | Code GenerationLanguage Modeling | —Unverified | 0 |
| The Hydra Effect: Emergent Self-repair in Language Model Computations | Jul 28, 2023 | FormLanguage Modeling | —Unverified | 0 |
| TrafficSafetyGPT: Tuning a Pre-trained Large Language Model to a Domain-Specific Expert in Transportation Safety | Jul 28, 2023 | 2kLanguage Modeling | CodeCode Available | 1 |
| RSGPT: A Remote Sensing Vision Language Model and Benchmark | Jul 28, 2023 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Multilingual Tourist Assistance using ChatGPT: Comparing Capabilities in Hindi, Telugu, and Kannada | Jul 28, 2023 | General KnowledgeLanguage Modeling | —Unverified | 0 |
| Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding | Jul 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Robust Distortion-free Watermarks for Language Models | Jul 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation | Jul 28, 2023 | ArticlesLanguage Modeling | —Unverified | 0 |
| A Critical Review of Large Language Models: Sensitivity, Bias, and the Path Toward Specialized AI | Jul 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation | Jul 27, 2023 | 3D geometryFew-Shot Learning | CodeCode Available | 2 |
| TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer | Jul 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ArcGPT: A Large Language Model Tailored for Real-world Archival Applications | Jul 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs | Jul 27, 2023 | Document ClassificationKnowledge Distillation | —Unverified | 0 |
| A Transformer-based Approach for Arabic Offline Handwritten Text Recognition | Jul 27, 2023 | Handwriting RecognitionHandwritten Text Recognition | —Unverified | 0 |
| SuperCLUE: A Comprehensive Chinese Large Language Model Benchmark | Jul 27, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Geometric Notion of Causal Probing | Jul 27, 2023 | counterfactualLanguage Modeling | CodeCode Available | 0 |
| How User Language Affects Conflict Fatality Estimates in ChatGPT | Jul 26, 2023 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models | Jul 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Data Augmentation for Neural Machine Translation using Generative Language Model | Jul 26, 2023 | Data AugmentationDiversity | —Unverified | 0 |
| Utilizing Large Language Models for Natural Interface to Pharmacology Databases | Jul 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CliniDigest: A Case Study in Large Language Model Based Large-Scale Summarization of Clinical Trial Descriptions | Jul 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FinTree: Financial Dataset Pretrain Transformer Encoder for Relation Extraction | Jul 26, 2023 | Financial Relation ExtractionLanguage Modeling | CodeCode Available | 0 |
| A Predictive Model of Digital Information Engagement: Forecasting User Engagement With English Words by Incorporating Cognitive Biases, Computational Linguistics and Natural Language Processing | Jul 26, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A large language model-assisted education tool to provide feedback on open-ended responses | Jul 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| XDLM: Cross-lingual Diffusion Language Model for Machine Translation | Jul 25, 2023 | Image GenerationLanguage Modeling | —Unverified | 0 |