| PhotoBot: Reference-Guided Interactive Photography via Natural Language | Jan 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition | Jan 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automated Scoring of Clinical Patient Notes using Advanced NLP and Pseudo Labeling | Jan 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Excuse me, sir? Your language model is leaking (information) | Jan 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals? | Jan 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Fast, Performant, Secure Distributed Training Framework For Large Language Model | Jan 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access | Jan 18, 2024 | Constituency ParsingLanguage Modeling | CodeCode Available | 0 |
| Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation | Jan 18, 2024 | Caption GenerationLanguage Modeling | —Unverified | 0 |
| Lateral Phishing With Large Language Models: A Large Organization Comparative Study | Jan 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Gradable ChatGPT Translation Evaluation | Jan 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VMamba: Visual State Space Model | Jan 18, 2024 | Computational EfficiencyLanguage Modeling | CodeCode Available | 7 |
| Evolutionary Multi-Objective Optimization of Large Language Model Prompts for Balancing Sentiments | Jan 18, 2024 | Evolutionary AlgorithmsLanguage Modeling | —Unverified | 0 |
| Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap | Jan 18, 2024 | Code GenerationEvolutionary Algorithms | CodeCode Available | 2 |
| Self-Rewarding Language Models | Jan 18, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model | Jan 18, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Spatial-Temporal Large Language Model for Traffic Prediction | Jan 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Impact of Large Language Model Assistance on Patients Reading Clinical Notes: A Mixed-Methods Study | Jan 17, 2024 | Action UnderstandingLanguage Modeling | —Unverified | 0 |
| ADCNet: a unified framework for predicting the activity of antibody-drug conjugates | Jan 17, 2024 | Activity PredictionLanguage Modeling | CodeCode Available | 1 |
| Asynchronous Local-SGD Training for Language Modeling | Jan 17, 2024 | Distributed OptimizationLanguage Modeling | CodeCode Available | 1 |
| Fine-tuning Strategies for Domain Specific Question Answering under Low Annotation Budget Constraints | Jan 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation | Jan 16, 2024 | Emotion RecognitionEmotion Recognition in Conversation | CodeCode Available | 1 |
| Into the crossfire: evaluating the use of a language model to crowdsource gun violence reports | Jan 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World | Jan 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Imagine: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination | Jan 16, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| Enhancing Document-level Translation of Large Language Model via Translation Mixed-instructions | Jan 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A character-based steganography using masked language modeling | Jan 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Stability Analysis of ChatGPT-based Sentiment Analysis in AI Quality Assurance | Jan 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding | Jan 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| On the importance of Data Scale in Pretraining Arabic Language Models | Jan 15, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERT | Jan 15, 2024 | Binary ClassificationClassification | CodeCode Available | 0 |
| Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization | Jan 15, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| When Large Language Model Agents Meet 6G Networks: Perception, Grounding, and Alignment | Jan 15, 2024 | Integrated sensing and communicationLanguage Modeling | —Unverified | 0 |
| Activations and Gradients Compression for Model-Parallel Training | Jan 15, 2024 | image-classificationImage Classification | CodeCode Available | 0 |
| Your Instructions Are Not Always Helpful: Assessing the Efficacy of Instruction Fine-tuning for Software Vulnerability Detection | Jan 15, 2024 | Deep LearningFeature Engineering | —Unverified | 0 |
| Walert: Putting Conversational Search Knowledge into Action by Building and Evaluating a Large Language Model-Powered Chatbot | Jan 14, 2024 | ChatbotConversational Search | CodeCode Available | 1 |
| Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation | Jan 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distilling Event Sequence Knowledge From Large Language Models | Jan 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Small Language Model Can Self-correct | Jan 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reinforcement Learning from LLM Feedback to Counteract Goal Misgeneralization | Jan 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering | Jan 14, 2024 | Audio GenerationLanguage Modeling | —Unverified | 0 |
| Tracing the Genealogies of Ideas with Large Language Model Embeddings | Jan 13, 2024 | Abstract Meaning RepresentationLanguage Modeling | —Unverified | 0 |
| Parameter-Efficient Detoxification with Contrastive Decoding | Jan 13, 2024 | AttributeGPU | —Unverified | 0 |
| Graph Language Models | Jan 13, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 2 |
| Evolving Code with A Large Language Model | Jan 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A systematic review of geospatial location embedding approaches in large language models: A path to spatial AI systems | Jan 12, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| InRanker: Distilled Rankers for Zero-shot Information Retrieval | Jan 12, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| XLS-R Deep Learning Model for Multilingual ASR on Low- Resource Languages: Indonesian, Javanese, and Sundanese | Jan 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-Task Learning for Front-End Text Processing in TTS | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |