| Towards Optimal Learning of Language Models | Feb 27, 2024 | Data CompressionLanguage Modeling | —Unverified | 0 |
| Stable LM 2 1.6B Technical Report | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Topic-to-essay generation with knowledge-based content selection | Feb 26, 2024 | DecoderDiversity | —Unverified | 0 |
| Retrieval Augmented Generation Systems: Automatic Dataset Creation, Evaluation and Boolean Agent Setup | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Nemotron-4 15B Technical Report | Feb 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding | Feb 26, 2024 | DecoderInstruction Following | —Unverified | 0 |
| OncoGPT: A Medical Conversational Model Tailored with Oncology Domain Expertise on a Large Language Model Meta-AI (LLaMA) | Feb 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On Languaging a Simulation Engine | Feb 26, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| GROUNDHOG: Grounding Large Language Models to Holistic Segmentation | Feb 26, 2024 | Causal Language ModelingGeneralized Referring Expression Segmentation | —Unverified | 0 |
| ESG Sentiment Analysis: comparing human and language model performance including GPT | Feb 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning | Feb 26, 2024 | Data Augmentationdocument understanding | —Unverified | 0 |
| A Comprehensive Evaluation of Quantization Strategies for Large Language Models | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs | Feb 25, 2024 | BenchmarkingChatbot | CodeCode Available | 0 |
| Building Flexible Machine Learning Models for Scientific Computing at Scale | Feb 25, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration | Feb 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bootstrapping Cognitive Agents with a Large Language Model | Feb 25, 2024 | General KnowledgeLanguage Modeling | —Unverified | 0 |
| PIDformer: Transformer Meets Control Theory | Feb 25, 2024 | Image SegmentationLanguage Modeling | —Unverified | 0 |
| Training a Bilingual Language Model by Mapping Tokens onto a Shared Character Space | Feb 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NeSy is alive and well: A LLM-driven symbolic approach for better code comment data generation and classification | Feb 25, 2024 | ClassificationData Augmentation | CodeCode Available | 0 |
| PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ByteComposer: a Human-like Melody Composition Method based on Language Model Agent | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics | Feb 24, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Foot In The Door: Understanding Large Language Model Jailbreaking via Cognitive Psychology | Feb 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Exploring Failure Cases in Multimodal Reasoning About Physical Dynamics | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhanced User Interaction in Operating Systems through Machine Learning Language Models | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Cloud-Based Large Language Model Processing with Elasticsearch and Transformer Models | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ArabianGPT: Native Arabic GPT-based Large Language Model | Feb 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fine-Grained Self-Endorsement Improves Factuality and Reasoning | Feb 23, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| Item-side Fairness of Large Language Model-based Recommendation System | Feb 23, 2024 | FairnessLanguage Modeling | CodeCode Available | 0 |
| Substrate Prediction for RiPP Biosynthetic Enzymes via Masked Language Modeling and Transfer Learning | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning | Feb 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Noise-BERT: A Unified Perturbation-Robust Framework with Noise Alignment Pre-training for Noisy Slot Filling Task | Feb 22, 2024 | Adversarial AttackContrastive Learning | —Unverified | 0 |
| Optimizing Language Models for Human Preferences is a Causal Inference Problem | Feb 22, 2024 | Causal InferenceLanguage Modeling | —Unverified | 0 |
| CEV-LM: Controlled Edit Vector Language Model for Shaping Natural Language Generations | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health | Feb 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| COMPASS: Computational Mapping of Patient-Therapist Alliance Strategies with Language Modeling | Feb 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dependency Annotation of Ottoman Turkish with Multilingual BERT | Feb 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Reduce: Optimal Representations of Structured Data in Prompting Large Language Models | Feb 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automating psychological hypothesis generation with AI: when large language models meet causal graph | Feb 22, 2024 | ArticlesKnowledge Graphs | —Unverified | 0 |
| From Keywords to Structured Summaries: Streamlining Scholarly Information Access | Feb 22, 2024 | ArticlesInformation Retrieval | —Unverified | 0 |
| BIRCO: A Benchmark of Information Retrieval Tasks with Complex Objectives | Feb 21, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach | Feb 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Graph Enhanced Large Language Model Editing | Feb 21, 2024 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Combining Language and Graph Models for Semi-structured Information Extraction on the Web | Feb 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Breaking the Barrier: Utilizing Large Language Models for Industrial Recommendation Systems through an Inferential Knowledge Graph | Feb 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistance | Feb 21, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 0 |
| GCOF: Self-iterative Text Generation for Copywriting Using Large Language Model | Feb 21, 2024 | Feature EngineeringLanguage Modeling | —Unverified | 0 |
| Kuaiji: the First Chinese Accounting Large Language Model | Feb 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |