Exploring the Landscape for Generative Sequence Models for Specialized Data Synthesis Nov 4, 2024 Language Modeling Language Modelling
Code Code Available 0Training Compute-Optimal Protein Language Models Nov 4, 2024 Language Modeling Language Modelling
Code Code Available 1Regress, Don't Guess -- A Regression-like Loss on Number Tokens for Language Models Nov 4, 2024 Inductive Bias Language Modeling
Code Code Available 1RAGViz: Diagnose and Visualize Retrieval-Augmented Generation Nov 4, 2024 Answer Generation GPU
Code Code Available 2Context Parallelism for Scalable Million-Token Inference Nov 4, 2024 GPU Language Modeling
— Unverified 0High-performance automated abstract screening with large language model ensembles Nov 3, 2024 Binary Classification Language Modeling
— Unverified 0A Deep Dive Into Large Language Model Code Generation Mistakes: What and Why? Nov 3, 2024 Code Generation Language Modeling
— Unverified 0GraphXForm: Graph transformer for computer-aided molecular design Nov 3, 2024 Drug Design Drug Discovery
Code Code Available 1Enriching Tabular Data with Contextual LLM Embeddings: A Comprehensive Ablation Study for Ensemble Classifiers Nov 3, 2024 Ensemble Learning Feature Engineering
— Unverified 0Large Language Model Supply Chain: Open Problems From the Security Perspective Nov 3, 2024 Language Modeling Language Modelling
— Unverified 0Can Multimodal Large Language Model Think Analogically? Nov 2, 2024 Language Modeling Language Modelling
— Unverified 0Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks Nov 2, 2024 Language Modeling Language Modelling
— Unverified 0A Mechanistic Explanatory Strategy for XAI Nov 2, 2024 Decision Making Language Modeling
— Unverified 0Rule Based Rewards for Language Model Safety Nov 2, 2024 Language Modeling Language Modelling
Code Code Available 3PRIMO: Progressive Induction for Multi-hop Open Rule Generation Nov 2, 2024 Diversity Language Modeling
— Unverified 0Can Large Language Model Predict Employee Attrition? Nov 2, 2024 Language Modeling Language Modelling
— Unverified 0Privacy Leakage Overshadowed by Views of AI: A Study on Human Oversight of Privacy in Language Model Agent Nov 2, 2024 Language Modeling Language Modelling
— Unverified 0Interacting Large Language Model Agents. Interpretable Models and Social Learning Nov 2, 2024 Bayesian Inference Language Modeling
— Unverified 0Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations Nov 1, 2024 Informativeness Language Modeling
— Unverified 0Enhancing AAC Software for Dysarthric Speakers in e-Health Settings: An Evaluation Using TORGO Nov 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging Large Language Models for Code-Mixed Data Augmentation in Sentiment Analysis Nov 1, 2024 Data Augmentation Language Modeling
Code Code Available 0Improving Few-Shot Cross-Domain Named Entity Recognition by Instruction Tuning a Word-Embedding based Retrieval Augmented Large Language Model Nov 1, 2024 Benchmarking Cross-Domain Named Entity Recognition
— Unverified 0Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models Nov 1, 2024 Decision Making Informativeness
Code Code Available 1Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformers Nov 1, 2024 Language Modeling Language Modelling
Code Code Available 0LLM-KT: A Versatile Framework for Knowledge Transfer from Large Language Models to Collaborative Filtering Nov 1, 2024 Collaborative Filtering Language Modeling
— Unverified 0Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback Nov 1, 2024 Language Modeling Language Modelling
— Unverified 0RadFlag: A Black-Box Hallucination Detection Method for Medical Vision Language Models Nov 1, 2024 Hallucination Language Modeling
— Unverified 0Unified Generative and Discriminative Training for Multi-modal Large Language Models Nov 1, 2024 Dynamic Time Warping Image-text Classification
— Unverified 0SPRING Lab IITM's submission to Low Resource Indic Language Translation Shared Task Nov 1, 2024 Language Modelling Translation
— Unverified 0ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents Nov 1, 2024 Decision Making Language Modeling
— Unverified 0Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software Improvement Nov 1, 2024 Language Modeling Language Modelling
Code Code Available 3Randomized Autoregressive Visual Generation Nov 1, 2024 Image Generation Language Modeling
Code Code Available 5DEREC-SIMPRO: unlock Language Model benefits to advance Synthesis in Data Clean Room Oct 31, 2024 Language Modeling Language Modelling
— Unverified 0LLaMo: Large Language Model-based Molecular Graph Assistant Oct 31, 2024 Instruction Following IUPAC Name Prediction
Code Code Available 1MESS+: Energy-Optimal Inferencing in Language Model Zoos with Service Level Guarantees Oct 31, 2024 Language Modeling Language Modelling
— Unverified 0Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking Oct 31, 2024 Data Augmentation Dialogue State Tracking
— Unverified 0Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning Oct 31, 2024 Dictionary Learning Language Modeling
— Unverified 0Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility Prediction Oct 31, 2024 Disaster Response Language Modeling
Code Code Available 1Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts Oct 31, 2024 Language Modeling Language Modelling
— Unverified 0EchoNarrator: Generating natural text explanations for ejection fraction predictions Oct 31, 2024 Language Modeling Language Modelling
Code Code Available 0Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning Oct 31, 2024 Language Modeling Language Modelling
— Unverified 0π_0: A Vision-Language-Action Flow Model for General Robot Control Oct 31, 2024 Language Modeling Language Modelling
— Unverified 0Matchmaker: Self-Improving Large Language Model Programs for Schema Matching Oct 31, 2024 Data Integration Language Modeling
— Unverified 0Interpretable Language Modeling via Induction-head Ngram Models Oct 31, 2024 Causal Language Modeling Human fMRI response prediction
Code Code Available 1What is Wrong with Perplexity for Long-context Language Modeling? Oct 31, 2024 Document Summarization In-Context Learning
Code Code Available 2GPT or BERT: why not both? Oct 31, 2024 Causal Language Modeling Language Modeling
Code Code Available 2Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach Oct 31, 2024 Language Modeling Language Modelling
— Unverified 0Morphological Typology in BPE Subword Productivity and Language Modeling Oct 31, 2024 Language Modeling Language Modelling
— Unverified 0Weight decay induces low-rank attention layers Oct 31, 2024 L2 Regularization Language Modelling
— Unverified 0Towards Reliable Alignment: Uncertainty-aware RLHF Oct 31, 2024 Language Modelling Stochastic Optimization
— Unverified 0