| MATHWELL: Generating Educational Math Word Problems Using Teacher Annotations | Feb 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Empowering Large Language Model Agents through Action Learning | Feb 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Retrieval: End-to-End Information Retrieval with One Large Language Model | Feb 23, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Fine-Grained Self-Endorsement Improves Factuality and Reasoning | Feb 23, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG) | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning | Feb 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Substrate Prediction for RiPP Biosynthetic Enzymes via Masked Language Modeling and Transfer Learning | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Item-side Fairness of Large Language Model-based Recommendation System | Feb 23, 2024 | FairnessLanguage Modeling | CodeCode Available | 0 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 |
| Repetition Improves Language Model Embeddings | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| ArabianGPT: Native Arabic GPT-based Large Language Model | Feb 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SIMPLOT: Enhancing Chart Question Answering by Distilling Essentials | Feb 22, 2024 | Chart Question AnsweringLanguage Modeling | CodeCode Available | 1 |
| Watermarking Makes Language Models Radioactive | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LLMBind: A Unified Modality-Task Integration Framework | Feb 22, 2024 | AI AgentAudio Generation | CodeCode Available | 1 |
| INSTRUCTIR: A Benchmark for Instruction Following of Information Retrieval Models | Feb 22, 2024 | Information RetrievalInstruction Following | CodeCode Available | 1 |
| Optimizing Language Models for Human Preferences is a Causal Inference Problem | Feb 22, 2024 | Causal InferenceLanguage Modeling | —Unverified | 0 |
| Learning to Reduce: Optimal Representations of Structured Data in Prompting Large Language Models | Feb 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PALO: A Polyglot Large Multimodal Model for 5B People | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Dependency Annotation of Ottoman Turkish with Multilingual BERT | Feb 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Balanced Data Sampling for Language Model Training with Clustering | Feb 22, 2024 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health | Feb 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automating psychological hypothesis generation with AI: when large language models meet causal graph | Feb 22, 2024 | ArticlesKnowledge Graphs | —Unverified | 0 |
| Uncertainty-Aware Evaluation for Vision-Language Models | Feb 22, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| RelayAttention for Efficient Large Language Model Serving with Long System Prompts | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| COMPASS: Computational Mapping of Patient-Therapist Alliance Strategies with Language Modeling | Feb 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Keywords to Structured Summaries: Streamlining Scholarly Information Access | Feb 22, 2024 | ArticlesInformation Retrieval | —Unverified | 0 |
| Noise-BERT: A Unified Perturbation-Robust Framework with Noise Alignment Pre-training for Noisy Slot Filling Task | Feb 22, 2024 | Adversarial AttackContrastive Learning | —Unverified | 0 |
| Cleaner Pretraining Corpus Curation with Neural Web Scraping | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Subobject-level Image Tokenization | Feb 22, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| Q-Probe: A Lightweight Approach to Reward Maximization for Language Models | Feb 22, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| CEV-LM: Controlled Edit Vector Language Model for Shaping Natural Language Generations | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Understanding the Dataset Practitioners Behind Large Language Model Development | Feb 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Combining Language and Graph Models for Semi-structured Information Extraction on the Web | Feb 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistance | Feb 21, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 0 |
| Breaking the HISCO Barrier: Automatic Occupational Standardization with OccCANINE | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Breaking the Barrier: Utilizing Large Language Models for Industrial Recommendation Systems through an Inferential Knowledge Graph | Feb 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BIRCO: A Benchmark of Information Retrieval Tasks with Complex Objectives | Feb 21, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Analysing The Impact of Sequence Composition on Language Model Pre-Training | Feb 21, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| CriticEval: Evaluating Large Language Model as Critic | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Knowledge Graph Enhanced Large Language Model Editing | Feb 21, 2024 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Towards Building Multilingual Language Model for Medicine | Feb 21, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 3 |
| GCOF: Self-iterative Text Generation for Copywriting Using Large Language Model | Feb 21, 2024 | Feature EngineeringLanguage Modeling | —Unverified | 0 |
| Privacy-Preserving Instructions for Aligning Large Language Models | Feb 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning | Feb 21, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Kuaiji: the First Chinese Accounting Large Language Model | Feb 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Round Trip Translation Defence against Large Language Model Jailbreaking Attacks | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach | Feb 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |