| Fine-tuning Strategies for Domain Specific Question Answering under Low Annotation Budget Constraints | Jan 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Document-level Translation of Large Language Model via Translation Mixed-instructions | Jan 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Into the crossfire: evaluating the use of a language model to crowdsource gun violence reports | Jan 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World | Jan 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Imagine: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination | Jan 16, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| When Large Language Model Agents Meet 6G Networks: Perception, Grounding, and Alignment | Jan 15, 2024 | Integrated sensing and communicationLanguage Modeling | —Unverified | 0 |
| On the importance of Data Scale in Pretraining Arabic Language Models | Jan 15, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Stability Analysis of ChatGPT-based Sentiment Analysis in AI Quality Assurance | Jan 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERT | Jan 15, 2024 | Binary ClassificationClassification | CodeCode Available | 0 |
| Your Instructions Are Not Always Helpful: Assessing the Efficacy of Instruction Fine-tuning for Software Vulnerability Detection | Jan 15, 2024 | Deep LearningFeature Engineering | —Unverified | 0 |
| Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization | Jan 15, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| A character-based steganography using masked language modeling | Jan 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Activations and Gradients Compression for Model-Parallel Training | Jan 15, 2024 | image-classificationImage Classification | CodeCode Available | 0 |
| ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering | Jan 14, 2024 | Audio GenerationLanguage Modeling | —Unverified | 0 |
| Distilling Event Sequence Knowledge From Large Language Models | Jan 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation | Jan 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reinforcement Learning from LLM Feedback to Counteract Goal Misgeneralization | Jan 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Small Language Model Can Self-correct | Jan 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Parameter-Efficient Detoxification with Contrastive Decoding | Jan 13, 2024 | AttributeGPU | —Unverified | 0 |
| Tracing the Genealogies of Ideas with Large Language Model Embeddings | Jan 13, 2024 | Abstract Meaning RepresentationLanguage Modeling | —Unverified | 0 |
| Evolving Code with A Large Language Model | Jan 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dynamic Behaviour of Connectionist Speech Recognition with Strong Latency Constraints | Jan 12, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Generalizing Visual Question Answering from Synthetic to Human-Written Questions via a Chain of QA with a Large Language Model | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| InRanker: Distilled Rankers for Zero-shot Information Retrieval | Jan 12, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| A systematic review of geospatial location embedding approaches in large language models: A path to spatial AI systems | Jan 12, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| PersianMind: A Cross-Lingual Persian-English Large Language Model | Jan 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| XLS-R Deep Learning Model for Multilingual ASR on Low- Resource Languages: Indonesian, Javanese, and Sundanese | Jan 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein | Jan 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems | Jan 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EpilepsyLLM: Domain-Specific Large Language Model Fine-tuned with Epilepsy Medical Knowledge | Jan 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigating Data Contamination for Pre-training Language Models | Jan 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How Teachers Can Use Large Language Models and Bloom's Taxonomy to Create Educational Quizzes | Jan 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distilling Vision-Language Models on Millions of Videos | Jan 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LEGOBench: Scientific Leaderboard Generation Benchmark | Jan 11, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Combating Adversarial Attacks with Multi-Agent Debate | Jan 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| AugSumm: towards generalizable speech summarization using synthetic labels from large language model | Jan 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Less is More: A Closer Look at Semantic-based Few-Shot Learning | Jan 10, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| Hierarchical Classification of Transversal Skills in Job Ads Based on Sentence Embeddings | Jan 10, 2024 | ClassificationLanguage Modeling | —Unverified | 0 |
| ChatGPT, Let us Chat Sign Language: Experiments, Architectural Elements, Challenges and Research Directions | Jan 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Sharing in Manufacturing using Large Language Models: User Evaluation and Model Benchmarking | Jan 10, 2024 | BenchmarkingInformation Retrieval | —Unverified | 0 |
| Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding | Jan 10, 2024 | DecoderDiversity | CodeCode Available | 0 |
| How predictable is language model benchmark performance? | Jan 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Exploring Prompt-Based Methods for Zero-Shot Hypernym Prediction with Large Language Models | Jan 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TwinBooster: Synergising Large Language Models with Barlow Twins and Gradient Boosting for Enhanced Molecular Property Prediction | Jan 9, 2024 | Drug DiscoveryLanguage Modeling | CodeCode Available | 0 |
| The Butterfly Effect of Altering Prompts: How Small Changes and Jailbreaks Affect Large Language Model Performance | Jan 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Why Solving Multi-agent Path Finding with Large Language Model has not Succeeded Yet | Jan 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sparse Meets Dense: A Hybrid Approach to Enhance Scientific Document Retrieval | Jan 8, 2024 | Deep LearningInformation Retrieval | —Unverified | 0 |
| IDoFew: Intermediate Training Using Dual-Clustering in Language Models for Few Labels Text Classification | Jan 8, 2024 | ClusteringLanguage Modeling | —Unverified | 0 |
| FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference | Jan 8, 2024 | GPULanguage Modeling | —Unverified | 0 |
| DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving | Jan 8, 2024 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |