| Exploring Failure Cases in Multimodal Reasoning About Physical Dynamics | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Empowering Large Language Model Agents through Action Learning | Feb 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Self-Retrieval: End-to-End Information Retrieval with One Large Language Model | Feb 23, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Fine-Grained Self-Endorsement Improves Factuality and Reasoning | Feb 23, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG) | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Item-side Fairness of Large Language Model-based Recommendation System | Feb 23, 2024 | FairnessLanguage Modeling | CodeCode Available | 0 |
| PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning | Feb 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 |
| Repetition Improves Language Model Embeddings | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Substrate Prediction for RiPP Biosynthetic Enzymes via Masked Language Modeling and Transfer Learning | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ArabianGPT: Native Arabic GPT-based Large Language Model | Feb 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SIMPLOT: Enhancing Chart Question Answering by Distilling Essentials | Feb 22, 2024 | Chart Question AnsweringLanguage Modeling | CodeCode Available | 1 |
| INSTRUCTIR: A Benchmark for Instruction Following of Information Retrieval Models | Feb 22, 2024 | Information RetrievalInstruction Following | CodeCode Available | 1 |
| Optimizing Language Models for Human Preferences is a Causal Inference Problem | Feb 22, 2024 | Causal InferenceLanguage Modeling | —Unverified | 0 |
| Watermarking Makes Language Models Radioactive | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Learning to Reduce: Optimal Representations of Structured Data in Prompting Large Language Models | Feb 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMBind: A Unified Modality-Task Integration Framework | Feb 22, 2024 | AI AgentAudio Generation | CodeCode Available | 1 |
| Subobject-level Image Tokenization | Feb 22, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| Dependency Annotation of Ottoman Turkish with Multilingual BERT | Feb 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Balanced Data Sampling for Language Model Training with Clustering | Feb 22, 2024 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health | Feb 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automating psychological hypothesis generation with AI: when large language models meet causal graph | Feb 22, 2024 | ArticlesKnowledge Graphs | —Unverified | 0 |
| PALO: A Polyglot Large Multimodal Model for 5B People | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |