| Promises, Outlooks and Challenges of Diffusion Language Modeling | Jun 17, 2024 | ARCHellaSwag | —Unverified | 0 |
| Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels | Jun 17, 2024 | Dataset GenerationInformation Retrieval | —Unverified | 0 |
| CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics | Jun 16, 2024 | ClassificationInformativeness | CodeCode Available | 0 |
| RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning | Jun 16, 2024 | knowledge editingLanguage Modeling | CodeCode Available | 0 |
| Avoiding Copyright Infringement via Large Language Model Unlearning | Jun 16, 2024 | General KnowledgeLanguage Modeling | CodeCode Available | 0 |
| Taking a Deep Breath: Enhancing Language Modeling of Large Language Models with Sentinel Tokens | Jun 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Logit Separability-Driven Samples and Multiple Class-Related Words Selection for Advancing In-Context Learning | Jun 16, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Large Language Models for Dysfluency Detection in Stuttered Speech | Jun 16, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation | Jun 16, 2024 | Continual LearningGSM8K | CodeCode Available | 0 |
| Optimization of Armv9 architecture general large language model inference performance based on Llama.cpp | Jun 16, 2024 | Compiler OptimizationLanguage Modeling | CodeCode Available | 0 |
| VCEval: Rethinking What is a Good Educational Video and How to Automatically Evaluate It | Jun 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CancerLLM: A Large Language Model in Cancer Domain | Jun 15, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Reactor Mk.1 performances: MMLU, HumanEval and BBH test results | Jun 15, 2024 | BenchmarkingHumanEval | —Unverified | 0 |
| RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics | Jun 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MALLM-GAN: Multi-Agent Large Language Model as Generative Adversarial Network for Synthesizing Tabular Data | Jun 15, 2024 | Generative Adversarial NetworkLanguage Modeling | —Unverified | 0 |
| Self-Supervised Representation Learning with Spatial-Temporal Consistency for Sign Language Recognition | Jun 15, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Large Language Model Enhanced Clustering for News Event Detection | Jun 15, 2024 | ClusteringEvent Detection | —Unverified | 0 |
| CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training | Jun 15, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| PARSE-Ego4D: Personal Action Recommendation Suggestions for Egocentric Videos | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Probability--Quality Trade-off in Aligned Language Models and its Relation to Sampling Adaptors | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment | Jun 14, 2024 | Image Quality AssessmentLanguage Modeling | —Unverified | 0 |
| GEB-1.3B: Open Lightweight Large Language Model | Jun 14, 2024 | CPULanguage Modeling | —Unverified | 0 |
| Large language model validity via enhanced conformal prediction methods | Jun 14, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| OpenECAD: An Efficient Visual Language Model for Editable 3D-CAD Design | Jun 14, 2024 | 3D Object ReconstructionLanguage Modeling | —Unverified | 0 |
| Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs | Jun 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |