| FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation | Oct 29, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Improving In-Context Learning with Small Language Model Ensembles | Oct 29, 2024 | Domain LabellingIn-Context Learning | CodeCode Available | 0 |
| Democratizing Reward Design for Personal and Representative Value-Alignment | Oct 29, 2024 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Abrupt Learning in Transformers: A Case Study on Matrix Completion | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents | Oct 29, 2024 | Decision MakingIntent Discovery | —Unverified | 0 |
| From melodic note sequences to pitches using word2vec | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning and Unlearning of Fabricated Knowledge in Language Models | Oct 29, 2024 | Data PoisoningLanguage Modeling | —Unverified | 0 |
| Discrete Modeling via Boundary Conditional Diffusion Processes | Oct 29, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Anticipating Future with Large Language Model for Simultaneous Machine Translation | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Are VLMs Really Blind | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CurateGPT: A flexible language-model assisted biocuration tool | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reliable Semantic Understanding for Real World Zero-shot Object Goal Navigation | Oct 29, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Rethinking Code Refinement: Learning to Judge Code Efficiency | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration | Oct 29, 2024 | GPULanguage Modeling | —Unverified | 0 |
| MARCO: Multi-Agent Real-time Chat Orchestration | Oct 29, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Multimodal Quantum Natural Language Processing: A Novel Framework for using Quantum Methods to Analyse Real Data | Oct 29, 2024 | Data IntegrationImage-text Classification | CodeCode Available | 0 |
| Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PerSRV: Personalized Sticker Retrieval with Vision-Language Model | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding | Oct 29, 2024 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Visualizing attention zones in machine reading comprehension models | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PepDoRA: A Unified Peptide Language Model via Weight-Decomposed Low-Rank Adaptation | Oct 28, 2024 | Activity PredictionContrastive Learning | —Unverified | 0 |
| Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training | Oct 28, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Thank You, Stingray: Multilingual Large Language Models Can Not (Yet) Disambiguate Cross-Lingual Word Sense | Oct 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games | Oct 28, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| BongLLaMA: LLaMA for Bangla Language | Oct 28, 2024 | BenchmarkingData Augmentation | —Unverified | 0 |
| ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Graph-based Uncertainty Metrics for Long-form Language Model Outputs | Oct 28, 2024 | FormInformativeness | CodeCode Available | 0 |
| Energy-Based Diffusion Language Models for Text Generation | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce | Oct 28, 2024 | Benchmarkinggraph construction | —Unverified | 0 |
| Large Language Model-assisted Speech and Pointing Benefits Multiple 3D Object Selection in Virtual Reality | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An Actor-Critic Approach to Boosting Text-to-SQL Large Language Model | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Benchmarks in Medical Tasks | Oct 28, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Large Language Model-Guided Prediction Toward Quantum Materials Synthesis | Oct 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Inevitable Trade-off between Watermark Strength and Speculative Sampling Efficiency for Language Models | Oct 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MedGo: A Chinese Medical Large Language Model | Oct 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sequential Large Language Model-Based Hyper-parameter Optimization | Oct 27, 2024 | Bayesian OptimizationBenchmarking | CodeCode Available | 0 |
| Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation | Oct 27, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| Chemical Language Model Linker: blending text and molecules with modular adapters | Oct 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Computational Bottlenecks of Training Small-scale Large Language Models | Oct 25, 2024 | GPULanguage Modeling | —Unverified | 0 |
| A Multimodal Approach For Endoscopic VCE Image Classification Using BiomedCLIP-PubMedBERT | Oct 25, 2024 | Diagnosticimage-classification | CodeCode Available | 0 |
| IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation | Oct 25, 2024 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| Autonomous Building Cyber-Physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model | Oct 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FedBaF: Federated Learning Aggregation Biased by a Foundation Model | Oct 24, 2024 | Federated LearningLanguage Modeling | —Unverified | 0 |
| AlignCap: Aligning Speech Emotion Captioning to Human Preferences | Oct 24, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks | Oct 24, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms | Oct 24, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Provably Robust Watermarks for Open-Source Language Models | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |