| MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting | Jun 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Discrete Diffusion Language Model for Long Text Summarization | Jun 25, 2024 | Abstractive Text SummarizationDecoder | —Unverified | 0 |
| High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model | Jun 25, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| LABOR-LLM: Language-Based Occupational Representations with Large Language Models | Jun 25, 2024 | In-Context LearningJob Prediction | —Unverified | 0 |
| Native Design Bias: Studying the Impact of English Nativeness on Language Model Performance | Jun 25, 2024 | DiversityLanguage Modeling | CodeCode Available | 0 |
| Multi-property Steering of Large Language Models with Dynamic Activation Composition | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks | Jun 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model | Jun 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Jun 25, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| CTBench: A Comprehensive Benchmark for Evaluating Language Model Capabilities in Clinical Trial Design | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Semi-supervised classification of dental conditions in panoramic radiographs using large language model and instance segmentation: A real-world dataset evaluation | Jun 25, 2024 | DiagnosticInstance Segmentation | —Unverified | 0 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment | Jun 25, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale | Jun 25, 2024 | ARCLanguage Modeling | CodeCode Available | 1 |
| VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation | Jun 25, 2024 | ARCBenchmarking | CodeCode Available | 0 |
| CogMG: Collaborative Augmentation Between Large Language Model and Knowledge Graph | Jun 25, 2024 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 1 |
| Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification | Jun 25, 2024 | Contrastive Learningfew-shot-htc | CodeCode Available | 1 |
| Enhancing Tool Retrieval with Iterative Feedback from Large Language Models | Jun 25, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Understanding Language Model Circuits through Knowledge Editing | Jun 25, 2024 | knowledge editingLanguage Modeling | —Unverified | 0 |
| Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training | Jun 25, 2024 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| AG-LSEC: Audio Grounded Lexical Speaker Error Correction | Jun 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization? | Jun 25, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| From Distributional to Overton Pluralism: Investigating Large Language Model Alignment | Jun 25, 2024 | DiversityLanguage Modeling | CodeCode Available | 0 |
| A Comprehensive Solution to Connect Speech Encoder and Large Language Model for ASR | Jun 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Modulating Language Model Experiences through Frictions | Jun 24, 2024 | FrictionInformation Retrieval | —Unverified | 0 |