| Random Silicon Sampling: Simulating Human Sub-Population Opinion Using a Large Language Model Based on Group-Level Demographic Information | Feb 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Language Model based Framework for New Concept Placement in Ontologies | Feb 27, 2024 | Contrastive LearningEntity Linking | CodeCode Available | 1 |
| Stable LM 2 1.6B Technical Report | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Tower: An Open Multilingual Large Language Model for Translation-Related Tasks | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations | Feb 27, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| BASES: Large-scale Web Search User Simulation with Large Language Model based Agents | Feb 27, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Towards Optimal Learning of Language Models | Feb 27, 2024 | Data CompressionLanguage Modeling | —Unverified | 0 |
| Retrieval is Accurate Generation | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long Documents | Feb 27, 2024 | Document ClassificationLanguage Modeling | CodeCode Available | 1 |
| MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning | Feb 27, 2024 | 8kLanguage Modeling | CodeCode Available | 0 |
| ShapeLLM: Universal 3D Object Understanding for Embodied Interaction | Feb 27, 2024 | 3D geometry3D Object Captioning | CodeCode Available | 3 |
| A Neural Rewriting System to Solve Algorithmic Problems | Feb 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SongComposer: A Large Language Model for Lyric and Melody Generation in Song Composition | Feb 27, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 3 |
| Large Language Model for Participatory Urban Planning | Feb 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web | Feb 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning | Feb 26, 2024 | Data Augmentationdocument understanding | —Unverified | 0 |
| Retrieval Augmented Generation Systems: Automatic Dataset Creation, Evaluation and Boolean Agent Setup | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Long-Context Language Modeling with Parallel Context Encoding | Feb 26, 2024 | In-Context LearningInstruction Following | CodeCode Available | 2 |
| ESG Sentiment Analysis: comparing human and language model performance including GPT | Feb 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OncoGPT: A Medical Conversational Model Tailored with Oncology Domain Expertise on a Large Language Model Meta-AI (LLaMA) | Feb 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Nemotron-4 15B Technical Report | Feb 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GROUNDHOG: Grounding Large Language Models to Holistic Segmentation | Feb 26, 2024 | Causal Language ModelingGeneralized Referring Expression Segmentation | —Unverified | 0 |
| Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding | Feb 26, 2024 | DecoderInstruction Following | —Unverified | 0 |
| A Comprehensive Evaluation of Quantization Strategies for Large Language Models | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation | Feb 26, 2024 | Code Documentation GenerationCode Generation | CodeCode Available | 4 |
| Topic-to-essay generation with knowledge-based content selection | Feb 26, 2024 | DecoderDiversity | —Unverified | 0 |
| On Languaging a Simulation Engine | Feb 26, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bootstrapping Cognitive Agents with a Large Language Model | Feb 25, 2024 | General KnowledgeLanguage Modeling | —Unverified | 0 |
| Building Flexible Machine Learning Models for Scientific Computing at Scale | Feb 25, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Training a Bilingual Language Model by Mapping Tokens onto a Shared Character Space | Feb 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step | Feb 25, 2024 | Code GenerationHumanEval | CodeCode Available | 4 |
| GraphWiz: An Instruction-Following Language Model for Graph Problems | Feb 25, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| PIDformer: Transformer Meets Control Theory | Feb 25, 2024 | Image SegmentationLanguage Modeling | —Unverified | 0 |
| HiGPT: Heterogeneous Graph Language Model | Feb 25, 2024 | Graph LearningLanguage Modeling | CodeCode Available | 2 |
| HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs | Feb 25, 2024 | BenchmarkingChatbot | CodeCode Available | 0 |
| NeSy is alive and well: A LLM-driven symbolic approach for better code comment data generation and classification | Feb 25, 2024 | ClassificationData Augmentation | CodeCode Available | 0 |
| Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration | Feb 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression | Feb 25, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Enhancing Cloud-Based Large Language Model Processing with Elasticsearch and Transformer Models | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhanced User Interaction in Operating Systems through Machine Learning Language Models | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ByteComposer: a Human-like Melody Composition Method based on Language Model Agent | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MATHWELL: Generating Educational Math Word Problems Using Teacher Annotations | Feb 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation | Feb 24, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Foot In The Door: Understanding Large Language Model Jailbreaking via Cognitive Psychology | Feb 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |