| Towards Optimal Learning of Language Models | Feb 27, 2024 | Data CompressionLanguage Modeling | —Unverified | 0 |
| Stable LM 2 1.6B Technical Report | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Topic-to-essay generation with knowledge-based content selection | Feb 26, 2024 | DecoderDiversity | —Unverified | 0 |
| Retrieval Augmented Generation Systems: Automatic Dataset Creation, Evaluation and Boolean Agent Setup | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Nemotron-4 15B Technical Report | Feb 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding | Feb 26, 2024 | DecoderInstruction Following | —Unverified | 0 |
| OncoGPT: A Medical Conversational Model Tailored with Oncology Domain Expertise on a Large Language Model Meta-AI (LLaMA) | Feb 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On Languaging a Simulation Engine | Feb 26, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| GROUNDHOG: Grounding Large Language Models to Holistic Segmentation | Feb 26, 2024 | Causal Language ModelingGeneralized Referring Expression Segmentation | —Unverified | 0 |
| ESG Sentiment Analysis: comparing human and language model performance including GPT | Feb 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning | Feb 26, 2024 | Data Augmentationdocument understanding | —Unverified | 0 |
| A Comprehensive Evaluation of Quantization Strategies for Large Language Models | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs | Feb 25, 2024 | BenchmarkingChatbot | CodeCode Available | 0 |
| Building Flexible Machine Learning Models for Scientific Computing at Scale | Feb 25, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration | Feb 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bootstrapping Cognitive Agents with a Large Language Model | Feb 25, 2024 | General KnowledgeLanguage Modeling | —Unverified | 0 |
| PIDformer: Transformer Meets Control Theory | Feb 25, 2024 | Image SegmentationLanguage Modeling | —Unverified | 0 |
| Training a Bilingual Language Model by Mapping Tokens onto a Shared Character Space | Feb 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NeSy is alive and well: A LLM-driven symbolic approach for better code comment data generation and classification | Feb 25, 2024 | ClassificationData Augmentation | CodeCode Available | 0 |
| PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ByteComposer: a Human-like Melody Composition Method based on Language Model Agent | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics | Feb 24, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Foot In The Door: Understanding Large Language Model Jailbreaking via Cognitive Psychology | Feb 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Exploring Failure Cases in Multimodal Reasoning About Physical Dynamics | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |