| MeMDLM: De Novo Membrane Protein Design with Masked Discrete Diffusion Protein Language Models | Oct 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Scalable Influence and Fact Tracing for Large Language Model Pretraining | Oct 22, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment | Oct 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Pinterest Search Relevance Using Large Language Models | Oct 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DeLLiriuM: A large language model for delirium prediction in the ICU using structured EHR | Oct 22, 2024 | ICU AdmissionLanguage Modeling | —Unverified | 0 |
| Adsorb-Agent: Autonomous Identification of Stable Adsorption Configurations via Large Language Model Agent | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Automated Spinal MRI Labelling from Reports Using a Large Language Model | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Continuous Speech Tokenizer in Text To Speech | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DIRI: Adversarial Patient Reidentification with Large Language Models for Evaluating Clinical Text Anonymization | Oct 22, 2024 | De-identificationLanguage Modeling | —Unverified | 0 |
| PAPILLON: Privacy Preservation from Internet-based and Local Language Model Ensembles | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Satori: Towards Proactive AR Assistant with Belief-Desire-Intention User Modeling | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DNAHLM -- DNA sequence and Human Language mixed large language Model | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Science Out of Its Ivory Tower: Improving Accessibility with Reinforcement Learning | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Frontiers in Intelligent Colonoscopy | Oct 22, 2024 | Image Captioning | CodeCode Available | 2 |
| Navigating Noisy Feedback: Enhancing Reinforcement Learning with Error-Prone Language Models | Oct 22, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| SaVe-TAG: Semantic-aware Vicinal Risk Minimization for Long-Tailed Text-Attributed Graphs | Oct 22, 2024 | ClassificationData Augmentation | —Unverified | 0 |
| MiniPLM: Knowledge Distillation for Pre-Training Language Models | Oct 22, 2024 | DiversityKnowledge Distillation | CodeCode Available | 2 |
| Remote Timing Attacks on Efficient Language Model Inference | Oct 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-calibration for Language Model Quantization and Pruning | Oct 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chatting with Bots: AI, Speech Acts, and the Edge of Assertion | Oct 22, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| PLDR-LLM: Large Language Model from Power Law Decoder Representations | Oct 22, 2024 | DecoderGraph Attention | CodeCode Available | 0 |
| Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes | Oct 22, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 |
| GeoCode-GPT: A Large Language Model for Geospatial Code Generation Tasks | Oct 22, 2024 | Code GenerationCode Summarization | —Unverified | 0 |
| Exploring Possibilities of AI-Powered Legal Assistance in Bangladesh through Large Language Modeling | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Exploring Forgetting in Large Language Model Pre-Training | Oct 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Subword Embedding from Bytes Gains Privacy without Sacrificing Accuracy and Complexity | Oct 21, 2024 | Federated LearningLanguage Modeling | —Unverified | 0 |
| No more hard prompts: SoftSRV prompting for synthetic data generation | Oct 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication | Oct 21, 2024 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| Building A Coding Assistant via the Retrieval-Augmented Language Model | Oct 21, 2024 | Code CompletionCode Generation | CodeCode Available | 1 |
| SeisLM: a Foundation Model for Seismic Waveforms | Oct 21, 2024 | Event DetectionLanguage Modeling | CodeCode Available | 1 |
| From Tokens to Materials: Leveraging Language Models for Scientific Discovery | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CPE-Pro: A Structure-Sensitive Deep Learning Method for Protein Representation and Origin Evaluation | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Tokenization as Finite-State Transduction | Oct 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Contamination Report for Multilingual Benchmarks | Oct 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ComPO: Community Preferences for Language Model Personalization | Oct 21, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Language Models are Symbolic Learners in Arithmetic | Oct 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Body Language Models | Oct 21, 2024 | Gesture GenerationLanguage Modeling | —Unverified | 0 |
| Generalized Probabilistic Attention Mechanism in Transformers | Oct 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Residual vector quantization for KV cache compression in large language model | Oct 21, 2024 | Audio CompressionLanguage Modeling | CodeCode Available | 1 |
| A Realistic Threat Model for Large Language Model Jailbreaks | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exploring Continual Fine-Tuning for Enhancing Language Ability in Large Language Model | Oct 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Neural Search Space in Gboard Decoder | Oct 21, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style | Oct 21, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| AutoTrain: No-code training for state-of-the-art models | Oct 21, 2024 | Classificationimage-classification | CodeCode Available | 7 |
| Improve Vision Language Model Chain-of-thought Reasoning | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs | Oct 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Deep Learning and Data Augmentation for Detecting Self-Admitted Technical Debt | Oct 21, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec | Oct 21, 2024 | DisentanglementLanguage Modeling | —Unverified | 0 |
| The effect of fine-tuning on language model toxicity | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Are Language Model Logits Calibrated? | Oct 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |