| Autonomous Building Cyber-Physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model | Oct 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training | Oct 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs | Oct 25, 2024 | BenchmarkingFairness | —Unverified | 0 |
| AlignCap: Aligning Speech Emotion Captioning to Human Preferences | Oct 24, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| GCoder: Improving Large Language Model for Generalized Graph Problem Solving | Oct 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| The Stepwise Deception: Simulating the Evolution from True News to Fake News with LLM Agents | Oct 24, 2024 | Large Language ModelMisinformation | —Unverified | 0 |
| Unbounded: A Generative Infinite Game of Character Life Simulation | Oct 24, 2024 | Instruction FollowingLanguage Modelling | —Unverified | 0 |
| Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks | Oct 24, 2024 | image-classificationImage Classification | —Unverified | 0 |
| A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Provably Robust Watermarks for Open-Source Language Models | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms | Oct 24, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation | Oct 23, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models | Oct 23, 2024 | Instruction FollowingLanguage Modelling | CodeCode Available | 2 |
| GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration | Oct 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation | Oct 23, 2024 | Large Language ModelSpoken Dialogue Systems | —Unverified | 0 |
| Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output Generation | Oct 22, 2024 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| Scalable Influence and Fact Tracing for Large Language Model Pretraining | Oct 22, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Exploring Possibilities of AI-Powered Legal Assistance in Bangladesh through Large Language Modeling | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Exploring Forgetting in Large Language Model Pre-Training | Oct 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment | Oct 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PLDR-LLM: Large Language Model from Power Law Decoder Representations | Oct 22, 2024 | DecoderGraph Attention | CodeCode Available | 0 |
| SaVe-TAG: Semantic-aware Vicinal Risk Minimization for Long-Tailed Text-Attributed Graphs | Oct 22, 2024 | ClassificationData Augmentation | —Unverified | 0 |
| An Eye for an AI: Evaluating GPT-4o's Visual Perception Skills and Geometric Reasoning Skills Using Computer Graphics Questions | Oct 22, 2024 | Large Language Model | —Unverified | 0 |
| DNAHLM -- DNA sequence and Human Language mixed large language Model | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DIRI: Adversarial Patient Reidentification with Large Language Models for Evaluating Clinical Text Anonymization | Oct 22, 2024 | De-identificationLanguage Modeling | —Unverified | 0 |