| On the Relationship between Truth and Political Bias in Language Models | Sep 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| STLLM-DF: A Spatial-Temporal Large Language Model with Diffusion for Enhanced Multi-Mode Traffic System Forecasting | Sep 8, 2024 | DenoisingLanguage Modeling | —Unverified | 0 |
| MuAP: Multi-step Adaptive Prompt Learning for Vision-Language Model with Missing Modality | Sep 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| POINTS: Improving Your Vision-language Model with Affordable Strategies | Sep 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VidLPRO: A Video-Language Pre-training Framework for Robotic and Laparoscopic Surgery | Sep 7, 2024 | Computational EfficiencyContrastive Learning | —Unverified | 0 |
| Achieving Peak Performance for Large Language Models: A Systematic Review | Sep 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reward Guidance for Reinforcement Learning Tasks Based on Large Language Models: The LMGT Framework | Sep 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Retrieval Augmented Generation-Based Incident Resolution Recommendation System for IT Support | Sep 6, 2024 | Answer GenerationLanguage Modeling | —Unverified | 0 |
| Sparse Rewards Can Self-Train Dialogue Agents | Sep 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| How Does Code Pretraining Affect Language Model Task Performance? | Sep 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning | Sep 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Confidential Computing on NVIDIA Hopper GPUs: A Performance Benchmark Study | Sep 6, 2024 | CPUGPU | —Unverified | 0 |
| AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model | Sep 6, 2024 | AttributeAutoML | CodeCode Available | 1 |
| Multi-Programming Language Ensemble for Code Generation in Large Language Model | Sep 6, 2024 | Code GenerationHumanEval | CodeCode Available | 0 |
| Using Large Language Models to Generate Authentic Multi-agent Knowledge Work Datasets | Sep 6, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| A Fused Large Language Model for Predicting Startup Success | Sep 5, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification | Sep 5, 2024 | Data AugmentationDiversity | CodeCode Available | 0 |
| LAST: Language Model Aware Speech Tokenization | Sep 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The AdEMAMix Optimizer: Better, Faster, Older | Sep 5, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| N-gram Prediction and Word Difference Representations for Language Modeling | Sep 5, 2024 | Causal Language ModelingLanguage Modeling | —Unverified | 0 |
| Sorbet: A Neuromorphic Hardware-Compatible Transformer-Based Spiking Language Model | Sep 4, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Irrelevant Alternatives Bias Large Language Model Hiring Decisions | Sep 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ISO: Overlap of Computation and Communication within Seqenence For LLM Inference | Sep 4, 2024 | GPULanguage Modeling | —Unverified | 0 |
| MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model | Sep 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Pre-training data selection for biomedical domain adaptation using journal impact metrics | Sep 4, 2024 | ArticlesDomain Adaptation | —Unverified | 0 |
| Oddballness: universal anomaly detection with language models | Sep 4, 2024 | Anomaly DetectionGrammatical Error Detection | —Unverified | 0 |
| Creating Domain-Specific Translation Memories for Machine Translation Fine-tuning: The TRENCARD Bilingual Cardiology Corpus | Sep 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| "Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation | Sep 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Model-Based Agents for Software Engineering: A Survey | Sep 4, 2024 | AI AgentLanguage Modeling | CodeCode Available | 4 |
| Historical German Text Normalization Using Type- and Token-Based Language Modeling | Sep 4, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Standing on the Shoulders of Giants: Reprogramming Visual-Language Model for General Deepfake Detection | Sep 4, 2024 | DeepFake DetectionFace Swapping | —Unverified | 0 |
| A Medical Multimodal Large Language Model for Pediatric Pneumonia | Sep 4, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Language Model Powered Digital Biology with BRAD | Sep 4, 2024 | ChatbotCode Generation | CodeCode Available | 2 |
| RouterRetriever: Routing over a Mixture of Expert Embedding Models | Sep 4, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models | Sep 4, 2024 | Decision MakingFew-Shot Learning | —Unverified | 0 |
| Accelerating Large Language Model Training with Hybrid GPU-based Compression | Sep 4, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling | Sep 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VSLLaVA: a pipeline of large multimodal foundation model for industrial vibration signal analysis | Sep 3, 2024 | Fault DiagnosisLanguage Modeling | —Unverified | 0 |
| An Implementation of Werewolf Agent That does not Truly Trust LLMs | Sep 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FuzzCoder: Byte-level Fuzzing Test via Large Language Model | Sep 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Foundations of Large Language Model Compression -- Part 1: Weight Quantization | Sep 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| How to Determine the Preferred Image Distribution of a Black-Box Vision-Language Model? | Sep 3, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models | Sep 3, 2024 | Image RestorationLanguage Modeling | CodeCode Available | 1 |
| Dynamic Motion Synthesis: Masked Audio-Text Conditioned Spatio-Temporal Transformers | Sep 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SmileyLlama: Modifying Large Language Models for Directed Chemical Space Exploration | Sep 3, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| OLMoE: Open Mixture-of-Experts Language Models | Sep 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| LASP: Surveying the State-of-the-Art in Large Language Model-Assisted AI Planning | Sep 3, 2024 | Autonomous VehiclesLanguage Modeling | —Unverified | 0 |
| Agentic Society: Merging skeleton from real world and texture from Large Language Model | Sep 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| EEG-Language Modeling for Pathology Detection | Sep 2, 2024 | Contrastive LearningEEG | —Unverified | 0 |
| The Compressor-Retriever Architecture for Language Model OS | Sep 2, 2024 | CPUIn-Context Learning | CodeCode Available | 1 |