| Randomized Geometric Algebra Methods for Convex Neural Networks | Jun 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Edit Distance Robust Watermarks via Indexing Pseudorandom Codes | Jun 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis | Jun 4, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Block Transformer: Global-to-Local Language Modeling for Fast Inference | Jun 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Diver: Large Language Model Decoding with Span-Level Mutual Information Verification | Jun 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities | Jun 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RKLD: Reverse KL-Divergence-based Knowledge Distillation for Unlearning Personal Information in Large Language Models | Jun 4, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Conditional Language Learning with Context | Jun 4, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 0 |
| Assessing the Performance of Chinese Open Source Large Language Models in Information Extraction Tasks | Jun 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models | Jun 4, 2024 | Document DatingLanguage Modeling | —Unverified | 0 |
| Radar Spectra-Language Model for Automotive Scene Parsing | Jun 4, 2024 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| Scalable MatMul-free Language Modeling | Jun 4, 2024 | GPULanguage Modeling | CodeCode Available | 7 |
| Meta-Designing Quantum Experiments with Language Models | Jun 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DrEureka: Language Model Guided Sim-To-Real Transfer | Jun 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An Independence-promoting Loss for Music Generation with Language Models | Jun 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Why Would You Suggest That? Human Trust in Language Model Responses | Jun 4, 2024 | Decision MakingHeadline Generation | —Unverified | 0 |
| Large Language Model-Enabled Multi-Agent Manufacturing Systems | Jun 4, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| MaskSR: Masked Language Model for Full-band Speech Restoration | Jun 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing | Jun 4, 2024 | ClassificationGPU | CodeCode Available | 1 |
| HoneyGPT: Breaking the Trilemma in Terminal Honeypots with Large Language Model | Jun 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TruthEval: A Dataset to Evaluate LLM Truthfulness and Reliability | Jun 4, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| LongSSM: On the Length Extension of State-space Models in Language Modelling | Jun 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Zyda: A 1.3T Dataset for Open Language Modeling | Jun 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model | Jun 3, 2024 | geo-localizationLanguage Modeling | CodeCode Available | 2 |
| Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models | Jun 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| VerilogReader: LLM-Aided Hardware Test Generation | Jun 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM | Jun 3, 2024 | DecoderGPU | CodeCode Available | 2 |
| Large Language Model Assisted Optimal Bidding of BESS in FCAS Market: An AI-agent based Approach | Jun 3, 2024 | AI AgentDeep Reinforcement Learning | —Unverified | 0 |
| Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding | Jun 3, 2024 | counterfactualDistractor Generation | —Unverified | 0 |
| OLoRA: Orthonormal Low-Rank Adaptation of Large Language Models | Jun 3, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Towards Harnessing Large Language Models for Comprehension of Conversational Grounding | Jun 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer | Jun 3, 2024 | Audio GenerationIn-Context Learning | CodeCode Available | 2 |
| Scalable Ensembling For Mitigating Reward Overoptimisation | Jun 3, 2024 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models | Jun 3, 2024 | Data AugmentationDomain Generalization | —Unverified | 0 |
| LLM and GNN are Complementary: Distilling LLM for Multimodal Graph Learning | Jun 3, 2024 | Graph LearningLanguage Modeling | —Unverified | 0 |
| Understanding Token Probability Encoding in Output Embeddings | Jun 3, 2024 | Causal Language ModelingLanguage Modeling | —Unverified | 0 |
| VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model | Jun 3, 2024 | Image OutpaintingLanguage Modeling | CodeCode Available | 1 |
| Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients | Jun 3, 2024 | Federated Learningimage-classification | —Unverified | 0 |
| L-MAGIC: Language Model Assisted Generation of Images with Coherence | Jun 3, 2024 | Depth EstimationLanguage Modeling | CodeCode Available | 0 |
| Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow | Jun 3, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Superhuman performance in urology board questions by an explainable large language model enabled for context integration of the European Association of Urology guidelines: the UroBot study | Jun 3, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost | Jun 3, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| HBTP: Heuristic Behavior Tree Planning with Large Language Model Reasoning | Jun 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MultiMax: Sparse and Multi-Modal Attention Learning | Jun 3, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Towards a copilot in BIM authoring tool using a large language model-based agent for intelligent human-machine interaction | Jun 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Harnessing Business and Media Insights with Large Language Models | Jun 2, 2024 | Data VisualizationLanguage Modeling | —Unverified | 0 |
| Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions | Jun 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models | Jun 2, 2024 | Continual PretrainingInformation Retrieval | —Unverified | 0 |
| Aligning Language Models with Demonstrated Feedback | Jun 2, 2024 | ArticlesAvg | CodeCode Available | 2 |
| Large Language Model Confidence Estimation via Black-Box Access | Jun 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |