A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition Mar 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning Mar 2, 2024 Language Modeling Language Modelling
— Unverified 0OpenGraph: Towards Open Graph Foundation Models Mar 2, 2024 Data Augmentation Graph Learning
Code Code Available 3AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks Mar 2, 2024 Computer Security Language Modeling
— Unverified 0Towards Accurate Lip-to-Speech Synthesis in-the-Wild Mar 2, 2024 Language Modelling Lip to Speech Synthesis
— Unverified 0Chaining thoughts and LLMs to learn DNA structural biophysics Mar 2, 2024 Language Modeling Language Modelling
Code Code Available 0SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code Mar 2, 2024 Language Modeling Language Modelling
— Unverified 0IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact Mar 2, 2024 Language Modeling Language Modelling
Code Code Available 3NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention Mar 2, 2024 16k CPU
Code Code Available 1DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling Mar 2, 2024 Language Modelling Large Language Model
Code Code Available 1LAB: Large-Scale Alignment for ChatBots Mar 2, 2024 Instruction Following Language Modeling
Code Code Available 5BasedAI: A decentralized P2P network for Zero Knowledge Large Language Models (ZK-LLMs) Mar 1, 2024 Language Modeling Language Modelling
— Unverified 0AXOLOTL: Fairness through Assisted Self-Debiasing of Large Language Model Outputs Mar 1, 2024 Fairness Language Modeling
— Unverified 0SoftTiger: A Clinical Foundation Model for Healthcare Workflows Mar 1, 2024 Language Modelling Large Language Model
Code Code Available 7Spurious Feature Eraser: Stabilizing Test-Time Adaptation for Vision-Language Foundation Model Mar 1, 2024 Fine-Grained Image Classification image-classification
Code Code Available 0Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training Mar 1, 2024 Language Modelling
Code Code Available 0Enhancing Jailbreak Attacks with Diversity Guidance Mar 1, 2024 Diversity Language Modelling
— Unverified 0An Interpretable Ensemble of Graph and Language Models for Improving Search Relevance in E-Commerce Mar 1, 2024 Language Modeling Language Modelling
Code Code Available 1Merging Text Transformer Models from Different Initializations Mar 1, 2024 Language Modeling Language Modelling
Code Code Available 1Leveraging pre-trained language models for code generation Feb 29, 2024 Code Generation Language Modelling
Code Code Available 0Resonance RoPE: Improving Context Length Generalization of Large Language Models Feb 29, 2024 Language Modeling Language Modelling
Code Code Available 1FAC^2E: Better Understanding Large Language Model Capabilities by Dissociating Language and Cognition Feb 29, 2024 Language Modeling Language Modelling
— Unverified 0LLM-Ensemble: Optimal Large Language Model Ensemble Method for E-commerce Product Attribute Value Extraction Feb 29, 2024 Attribute Attribute Extraction
— Unverified 0Large Language Models are Learnable Planners for Long-Term Recommendation Feb 29, 2024 Decision Making Language Modelling
Code Code Available 1RiNALMo: General-Purpose RNA Language Models Can Generalize Well on Structure Prediction Tasks Feb 29, 2024 Language Modeling Language Modelling
Code Code Available 3Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning Feb 29, 2024 Continual Learning Language Modelling
Code Code Available 1TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings Feb 29, 2024 Conditional Text Generation Decoder
Code Code Available 1Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model Feb 29, 2024 Language Modeling Language Modelling
— Unverified 0Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models Feb 29, 2024 Language Modelling
— Unverified 0PlanGPT: Enhancing Urban Planning with Tailored Language Model and Efficient Retrieval Feb 29, 2024 Language Modeling Language Modelling
— Unverified 0FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning Feb 29, 2024 GPU Language Modeling
Code Code Available 5The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations? Feb 29, 2024 Code Generation Language Modelling
— Unverified 0Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Feb 29, 2024 Language Modelling Mamba
Code Code Available 7PaECTER: Patent-level Representation Learning using Citation-informed Transformers Feb 29, 2024 Citation Prediction Language Modeling
— Unverified 0ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Feb 29, 2024 Language Modeling Language Modelling
Code Code Available 3A Protein Structure Prediction Approach Leveraging Transformer and CNN Integration Feb 29, 2024 Language Modeling Language Modelling
— Unverified 0VIXEN: Visual Text Comparison Network for Image Difference Captioning Feb 29, 2024 Language Modeling Language Modelling
Code Code Available 0Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction Feb 29, 2024 image-classification Image Classification
Code Code Available 1ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training Feb 28, 2024 In-Context Learning Language Modeling
Code Code Available 2Merino: Entropy-driven Design for Generative Language Models on IoT Devices Feb 28, 2024 CPU Language Modeling
— Unverified 0Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners Feb 28, 2024 Language Modeling Language Modelling
— Unverified 0Learning to Deliver: a Foundation Model for the Montreal Capacitated Vehicle Routing Problem Feb 28, 2024 Language Modelling Large Language Model
— Unverified 0Data Interpreter: An LLM Agent For Data Science Feb 28, 2024 Code Generation Language Modelling
— Unverified 0Multi-objective Differentiable Neural Architecture Search Feb 28, 2024 Decoder Language Modelling
Code Code Available 1SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model Feb 28, 2024 Image Generation Language Modeling
Code Code Available 1Grounding Language Models for Visual Entity Recognition Feb 28, 2024 Language Modeling Language Modelling
Code Code Available 1Trends, Applications, and Challenges in Human Attention Modelling Feb 28, 2024 Language Modelling
Code Code Available 2Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling Feb 28, 2024 Computational Efficiency image-classification
— Unverified 0Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension Feb 28, 2024 Language Modeling Language Modelling
Code Code Available 1Multi-FAct: Assessing Factuality of Multilingual LLMs using FActScore Feb 28, 2024 Diversity Form
Code Code Available 0