ATTNChecker: Highly-Optimized Fault Tolerant Attention for Large Language Model Training Oct 15, 2024 Language Modeling Language Modelling
— Unverified 0Retrieval Augmented Spelling Correction for E-Commerce Applications Oct 15, 2024 Language Modeling Language Modelling
— Unverified 0SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing Oct 15, 2024 Language Modelling Large Language Model
— Unverified 0Y-Mol: A Multiscale Biomedical Knowledge-Guided Large Language Model for Drug Development Oct 15, 2024 Drug Design Knowledge Graphs
— Unverified 0RATE: Causal Explainability of Reward Models with Imperfect Counterfactuals Oct 15, 2024 Attribute Language Modeling
Code Code Available 0G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks Oct 15, 2024 HumanEval Language Modelling
— Unverified 0Process Reward Model with Q-Value Rankings Oct 15, 2024 Decision Making Language Modeling
Code Code Available 2Language Model Preference Evaluation with Multiple Weak Evaluators Oct 14, 2024 Denoising Language Modeling
Code Code Available 0Fine-tuning the ESM2 protein language model to understand the functional impact of missense variants Oct 14, 2024 Language Modeling Language Modelling
Code Code Available 0Skill Learning Using Process Mining for Large Language Model Plan Generation Oct 14, 2024 Decision Making Language Modeling
— Unverified 0Not All Options Are Created Equal: Textual Option Weighting for Token-Efficient LLM-Based Knowledge Tracing Oct 14, 2024 All Binary Classification
— Unverified 0Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios? Oct 14, 2024 Language Modeling Language Modelling
— Unverified 0How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective Oct 14, 2024 Density Ratio Estimation GSM8K
Code Code Available 0PRACTIQ: A Practical Conversational Text-to-SQL dataset with Ambiguous and Unanswerable Queries Oct 14, 2024 Language Modelling Large Language Model
— Unverified 0Large Language Model Evaluation via Matrix Nuclear-Norm Oct 14, 2024 Computational Efficiency Data Compression
Code Code Available 0Character-aware audio-visual subtitling in context Oct 14, 2024 Language Modelling Large Language Model
— Unverified 0Local and Global Decoding in Text Generation Oct 14, 2024 Language Modeling Language Modelling
Code Code Available 0Large Language Model-Enhanced Reinforcement Learning for Generic Bus Holding Control Strategies Oct 14, 2024 In-Context Learning Language Modeling
— Unverified 0A Multi-Task Text Classification Pipeline with Natural Language Explanations: A User-Centric Evaluation in Sentiment Analysis and Offensive Language Identification in Greek Tweets Oct 14, 2024 Feature Importance Language Identification
— Unverified 0KBLaM: Knowledge Base augmented Language Model Oct 14, 2024 8k GPU
Code Code Available 5Learning to Ground VLMs without Forgetting Oct 14, 2024 Decoder Language Modelling
— Unverified 0LOBG:Less Overfitting for Better Generalization in Vision-Language Model Oct 14, 2024 Language Modeling Language Modelling
— Unverified 0ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization Oct 14, 2024 Explanation Generation Image Forgery Detection
— Unverified 0Predicting from Strings: Language Model Embeddings for Bayesian Optimization Oct 14, 2024 Bayesian Optimization Experimental Design
Code Code Available 3A Simple Baseline for Predicting Events with Auto-Regressive Tabular Transformers Oct 14, 2024 Causal Language Modeling Language Modeling
Code Code Available 0LG-CAV: Train Any Concept Activation Vector with Language Guidance Oct 14, 2024 Language Modeling Language Modelling
Code Code Available 0Model-based Large Language Model Customization as Service Oct 14, 2024 Language Modeling Language Modelling
— Unverified 0Unified Representation of Genomic and Biomedical Concepts through Multi-Task, Multi-Source Contrastive Learning Oct 14, 2024 Contrastive Learning Data Integration
— Unverified 0Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization Oct 13, 2024 Language Modeling Language Modelling
— Unverified 0MisinfoEval: Generative AI in the Era of "Alternative Facts" Oct 13, 2024 Language Modelling Large Language Model
— Unverified 0Conversational Code Generation: a Case Study of Designing a Dialogue System for Generating Driving Scenarios for Testing Autonomous Vehicles Oct 13, 2024 Autonomous Vehicles Code Generation
— Unverified 0MoIN: Mixture of Introvert Experts to Upcycle an LLM Oct 13, 2024 GPU Language Modeling
— Unverified 0Adaptive Reasoning and Acting in Medical Language Agents Oct 13, 2024 Decision Making Diagnostic
— Unverified 0HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics Oct 13, 2024 Language Modeling Language Modelling
Code Code Available 1EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs Oct 13, 2024 Language Modeling Language Modelling
Code Code Available 1Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code Oct 13, 2024 Code Generation Hallucination
— Unverified 0EchoPrime: A Multi-Video View-Informed Vision-Language Model for Comprehensive Echocardiography Interpretation Oct 13, 2024 Contrastive Learning Language Modeling
— Unverified 0LoRE: Logit-Ranked Retriever Ensemble for Enhancing Open-Domain Question Answering Oct 13, 2024 Answer Generation Language Modeling
— Unverified 0Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation Oct 12, 2024 Code Generation Language Modeling
— Unverified 0COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement Oct 12, 2024 Code Generation Computational Efficiency
Code Code Available 0Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment Oct 12, 2024 Language Modelling Philosophy
Code Code Available 9Extended Japanese Commonsense Morality Dataset with Masked Token and Label Enhancement Oct 12, 2024 Language Modelling Large Language Model
— Unverified 0LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning Oct 12, 2024 Knowledge Graphs Language Modeling
Code Code Available 0Enterprise Benchmarks for Large Language Model Evaluation Oct 11, 2024 Benchmarking Language Model Evaluation
Code Code Available 0LLMD: A Large Language Model for Interpreting Longitudinal Medical Records Oct 11, 2024 Language Modeling Language Modelling
— Unverified 0nach0-pc: Multi-task Language Model with Molecular Point Cloud Encoder Oct 11, 2024 Drug Discovery Language Modeling
— Unverified 0ACER: Automatic Language Model Context Extension via Retrieval Oct 11, 2024 Language Modeling Language Modelling
— Unverified 0The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling Oct 11, 2024 Language Modeling Language Modelling
— Unverified 0Can a large language model be a gaslighter? Oct 11, 2024 Language Modeling Language Modelling
Code Code Available 0Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization Oct 11, 2024 GSM8K Language Modeling
Code Code Available 2