| MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting | Jun 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Discrete Diffusion Language Model for Long Text Summarization | Jun 25, 2024 | Abstractive Text SummarizationDecoder | —Unverified | 0 |
| High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model | Jun 25, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| LABOR-LLM: Language-Based Occupational Representations with Large Language Models | Jun 25, 2024 | In-Context LearningJob Prediction | —Unverified | 0 |
| Understanding Language Model Circuits through Knowledge Editing | Jun 25, 2024 | knowledge editingLanguage Modeling | —Unverified | 0 |
| Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification | Jun 25, 2024 | Contrastive Learningfew-shot-htc | CodeCode Available | 1 |
| Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks | Jun 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment | Jun 25, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation | Jun 25, 2024 | ARCBenchmarking | CodeCode Available | 0 |
| From Distributional to Overton Pluralism: Investigating Large Language Model Alignment | Jun 25, 2024 | DiversityLanguage Modeling | CodeCode Available | 0 |
| Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model | Jun 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CTBench: A Comprehensive Benchmark for Evaluating Language Model Capabilities in Clinical Trial Design | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Multi-property Steering of Large Language Models with Dynamic Activation Composition | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Jun 25, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Native Design Bias: Studying the Impact of English Nativeness on Language Model Performance | Jun 25, 2024 | DiversityLanguage Modeling | CodeCode Available | 0 |
| CogMG: Collaborative Augmentation Between Large Language Model and Knowledge Graph | Jun 25, 2024 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 1 |
| Semi-supervised classification of dental conditions in panoramic radiographs using large language model and instance segmentation: A real-world dataset evaluation | Jun 25, 2024 | DiagnosticInstance Segmentation | —Unverified | 0 |
| The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale | Jun 25, 2024 | ARCLanguage Modeling | CodeCode Available | 1 |
| Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training | Jun 25, 2024 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| Enhancing Tool Retrieval with Iterative Feedback from Large Language Models | Jun 25, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| AG-LSEC: Audio Grounded Lexical Speaker Error Correction | Jun 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization? | Jun 25, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| A Comprehensive Solution to Connect Speech Encoder and Large Language Model for ASR | Jun 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Classification of Geological Borehole Descriptions Using a Domain Adapted Large Language Model | Jun 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Modulating Language Model Experiences through Frictions | Jun 24, 2024 | FrictionInformation Retrieval | —Unverified | 0 |
| tcrLM: a lightweight protein language model for predicting T cell receptor and epitope binding specificity | Jun 24, 2024 | DiversityLanguage Modeling | CodeCode Available | 0 |
| AnnotatedTables: A Large Tabular Dataset with Language Model Annotations | Jun 24, 2024 | AutoMLFew-Shot Learning | —Unverified | 0 |
| Does Cross-Cultural Alignment Change the Commonsense Morality of Language Models? | Jun 24, 2024 | EthicsLanguage Modeling | —Unverified | 0 |
| ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance | Jun 24, 2024 | 4kDenoising | —Unverified | 0 |
| Is your benchmark truly adversarial? AdvScore: Evaluating Human-Grounded Adversarialness | Jun 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers | Jun 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Inducing Group Fairness in Prompt-Based Language Model Decisions | Jun 24, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| GPT-4V Explorations: Mining Autonomous Driving | Jun 24, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Guardrails for avoiding harmful medical product recommendations and off-label promotion in generative AI models | Jun 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model | Jun 24, 2024 | Answer GenerationInformation Retrieval | —Unverified | 0 |
| C-LLM: Learn to Check Chinese Spelling Errors Character by Character | Jun 24, 2024 | Chinese Spell CheckingLanguage Modeling | CodeCode Available | 1 |
| Large Vocabulary Size Improves Large Language Models | Jun 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| UniCoder: Scaling Code Large Language Model via Universal Code | Jun 24, 2024 | Code GenerationCode Translation | —Unverified | 0 |
| RaTEScore: A Metric for Radiology Report Generation | Jun 24, 2024 | DiagnosticEntity Embeddings | CodeCode Available | 4 |
| RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale | Jun 24, 2024 | Code GenerationHumanEval | CodeCode Available | 1 |
| Long Context Transfer from Language to Vision | Jun 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser | Jun 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Understanding and Mitigating Tokenization Bias in Language Models | Jun 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods | Jun 23, 2024 | Inference AttackLanguage Modeling | —Unverified | 0 |
| First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning | Jun 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries? | Jun 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and Voting | Jun 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Language Alignment via Nash-learning and Adaptive feedback | Jun 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unveiling Entity-Level Unlearning for Large Language Models: A Comprehensive Analysis | Jun 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |