| PepDoRA: A Unified Peptide Language Model via Weight-Decomposed Low-Rank Adaptation | Oct 28, 2024 | Activity PredictionContrastive Learning | —Unverified | 0 |
| Energy-Based Diffusion Language Models for Text Generation | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment | Oct 28, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Large Language Model Benchmarks in Medical Tasks | Oct 28, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games | Oct 28, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Visualizing attention zones in machine reading comprehension models | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Guided Prediction Toward Quantum Materials Synthesis | Oct 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Graph-based Uncertainty Metrics for Long-form Language Model Outputs | Oct 28, 2024 | FormInformativeness | CodeCode Available | 0 |
| Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training | Oct 28, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| BongLLaMA: LLaMA for Bangla Language | Oct 28, 2024 | BenchmarkingData Augmentation | —Unverified | 0 |
| Retrieval-Enhanced Mutation Mastery: Augmenting Zero-Shot Prediction of Protein Language Model | Oct 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce | Oct 28, 2024 | Benchmarkinggraph construction | —Unverified | 0 |
| Large Language Model-assisted Speech and Pointing Benefits Multiple 3D Object Selection in Virtual Reality | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation | Oct 27, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| TrajAgent: An Agent Framework for Unified Trajectory Modelling | Oct 27, 2024 | Future predictionLanguage Modeling | CodeCode Available | 1 |
| Sequential Large Language Model-Based Hyper-parameter Optimization | Oct 27, 2024 | Bayesian OptimizationBenchmarking | CodeCode Available | 0 |
| MedGo: A Chinese Medical Large Language Model | Oct 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders | Oct 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Inevitable Trade-off between Watermark Strength and Speculative Sampling Efficiency for Language Models | Oct 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chemical Language Model Linker: blending text and molecules with modular adapters | Oct 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Centaur: a foundation model of human cognition | Oct 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| A Multimodal Approach For Endoscopic VCE Image Classification Using BiomedCLIP-PubMedBERT | Oct 25, 2024 | Diagnosticimage-classification | CodeCode Available | 0 |
| IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation | Oct 25, 2024 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| Autonomous Building Cyber-Physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model | Oct 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Computational Bottlenecks of Training Small-scale Large Language Models | Oct 25, 2024 | GPULanguage Modeling | —Unverified | 0 |
| COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training | Oct 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| FedBaF: Federated Learning Aggregation Biased by a Foundation Model | Oct 24, 2024 | Federated LearningLanguage Modeling | —Unverified | 0 |
| AlignCap: Aligning Speech Emotion Captioning to Human Preferences | Oct 24, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| GCoder: Improving Large Language Model for Generalized Graph Problem Solving | Oct 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Scaling up Masked Diffusion Models on Text | Oct 24, 2024 | GSM8KLanguage Modeling | CodeCode Available | 3 |
| Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks | Oct 24, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Zero-shot Object Navigation with Vision-Language Models Reasoning | Oct 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LOGO -- Long cOntext aliGnment via efficient preference Optimization | Oct 24, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms | Oct 24, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Structure Language Models for Protein Conformation Generation | Oct 24, 2024 | Drug DiscoveryLanguage Modeling | —Unverified | 0 |
| Taipan: Efficient and Expressive State Space Language Models with Selective Attention | Oct 24, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Provably Robust Watermarks for Open-Source Language Models | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generalizations across filler-gap dependencies in neural language models | Oct 23, 2024 | Language AcquisitionLanguage Modeling | CodeCode Available | 0 |
| CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation | Oct 23, 2024 | GPULanguage Modeling | —Unverified | 0 |
| LEGO: Language Model Building Blocks | Oct 23, 2024 | Federated LearningLanguage Modeling | —Unverified | 0 |
| MojoBench: Language Modeling and Benchmarks for Mojo | Oct 23, 2024 | Code GenerationHumanEval | —Unverified | 0 |
| Scaling Diffusion Language Models via Adaptation from Autoregressive Models | Oct 23, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 3 |
| Cross-model Control: Improving Multiple Large Language Models in One-time Training | Oct 23, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration | Oct 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Lightweight Neural App Control | Oct 23, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| LMLPA: Language Model Linguistic Personality Assessment | Oct 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |