| Language Alignment via Nash-learning and Adaptive feedback | Jun 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models | Jun 22, 2024 | DiversityLanguage Modeling | CodeCode Available | 0 |
| Reading Is Believing: Revisiting Language Bottleneck Models for Image Classification | Jun 22, 2024 | Classificationimage-classification | —Unverified | 0 |
| CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans | Jun 22, 2024 | BenchmarkingDecision Making | —Unverified | 0 |
| TacoLM: GaTed Attention Equipped Codec Language Model are Efficient Zero-Shot Text to Speech Synthesizers | Jun 22, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| FIRST: Faster Improved Listwise Reranking with Single Token Decoding | Jun 21, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 2 |
| Inferring Pluggable Types with Machine Learning | Jun 21, 2024 | 16kLanguage Modeling | —Unverified | 0 |
| TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings | Jun 21, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| GiusBERTo: A Legal Language Model for Personal Data De-identification in Italian Court of Auditors Decisions | Jun 21, 2024 | De-identificationLanguage Modeling | —Unverified | 0 |
| Brain-Like Language Processing via a Shallow Untrained Multihead Attention Network | Jun 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation | Jun 21, 2024 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression | Jun 21, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Safely Learning with Private Data: A Federated Learning Framework for Large Language Model | Jun 21, 2024 | Federated LearningLanguage Modeling | CodeCode Available | 1 |
| InternLM-Law: An Open Source Chinese Legal Large Language Model | Jun 21, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation | Jun 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TemPrompt: Multi-Task Prompt Learning for Temporal Relation Extraction in RAG-based Crowdsourcing Systems | Jun 21, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Unsupervised Morphological Tree Tokenizer | Jun 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM Pipelines | Jun 20, 2024 | BenchmarkingDecision Making | CodeCode Available | 0 |
| LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning | Jun 20, 2024 | Autonomous NavigationHeuristic Search | CodeCode Available | 2 |
| Advantage Alignment Algorithms | Jun 20, 2024 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate | Jun 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Factual Dialogue Summarization via Learning from Large Language Models | Jun 20, 2024 | Contrastive LearningData Augmentation | —Unverified | 0 |
| A Large Language Model Outperforms Other Computational Approaches to the High-Throughput Phenotyping of Physician Notes | Jun 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors | Jun 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning | Jun 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation | Jun 20, 2024 | GSM8KLanguage Model Evaluation | CodeCode Available | 0 |
| Demystifying Language Model Forgetting with Low-rank Example Associations | Jun 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SPL: A Socratic Playground for Learning Powered by Large Language Model | Jun 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods | Jun 20, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Healing Powers of BERT: How Task-Specific Fine-Tuning Recovers Corrupted Language Models | Jun 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Jun 20, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking | Jun 20, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Ranking LLMs by compression | Jun 20, 2024 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Exploring Spatial Representations in the Historical Lake District Texts with LLM-based Relation Extraction | Jun 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics | Jun 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model | Jun 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LiveMind: Low-latency Large Language Models with Simultaneous Inference | Jun 20, 2024 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 |
| Information Guided Regularization for Fine-tuning Language Models | Jun 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Measuring Sample Importance in Data Pruning for Language Models based on Information Entropy | Jun 20, 2024 | Data CompressionInformativeness | —Unverified | 0 |
| Enhancing the LLM-Based Robot Manipulation Through Human-Robot Collaboration | Jun 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transferable speech-to-text large language model alignment module | Jun 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding | Jun 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts | Jun 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| The Impact of Auxiliary Patient Data on Automated Chest X-Ray Report Generation and How to Incorporate It | Jun 19, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| LIT: Large Language Model Driven Intention Tracking for Proactive Human-Robot Collaboration -- A Robot Sous-Chef Application | Jun 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Block-level Text Spotting with LLMs | Jun 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| In-Context Former: Lightning-fast Compressing Context for Large Language Model | Jun 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation | Jun 19, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models | Jun 19, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| Towards Holistic Language-video Representation: the language model-enhanced MSR-Video to Text Dataset | Jun 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |