Mixture of Experts with Mixture of Precisions for Tuning Quality of Service Jul 19, 2024 CPU GPU
— Unverified 0Visual Text Generation in the Wild Jul 19, 2024 Language Modelling Large Language Model
— Unverified 0Longhorn: State Space Models are Amortized Online Learners Jul 19, 2024 Language Modeling Language Modelling
Code Code Available 2RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering Jul 19, 2024 Domain Generalization Form
Code Code Available 2Handling Numeric Expressions in Automatic Speech Recognition Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Learning Visual Grounding from Generative Vision and Language Model Jul 18, 2024 Attribute Language Modeling
— Unverified 0FANTAstic SEquences and Where to Find Them: Faithful and Efficient API Call Generation through State-tracked Constrained Decoding and Reranking Jul 18, 2024 In-Context Learning Language Modeling
— Unverified 0Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle Jul 18, 2024 Benchmarking Language Modeling
— Unverified 0ViLLa: Video Reasoning Segmentation with Large Language Model Jul 18, 2024 Image Segmentation Language Modeling
Code Code Available 1FuLG: 150B Romanian Corpus for Language Model Pretraining Jul 18, 2024 Language Modeling Language Modelling
— Unverified 0Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization Jul 18, 2024 Language Modeling Language Modelling
— Unverified 0EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing Jul 18, 2024 Instruction Following Language Modeling
Code Code Available 1Attention Overflow: Language Model Input Blur during Long-Context Missing Items Recommendation Jul 18, 2024 Language Modeling Language Modelling
— Unverified 0Combining Constraint Programming Reasoning with Large Language Model Predictions Jul 18, 2024 Language Modeling Language Modelling
— Unverified 0Towards Zero-Shot Multimodal Machine Translation Jul 18, 2024 Language Modelling Machine Translation
Code Code Available 0TrialEnroll: Predicting Clinical Trial Enrollment Success with Deep & Cross Network and Large Language Models Jul 18, 2024 Language Modeling Language Modelling
— Unverified 0AlcLaM: Arabic Dialectal Language Model Jul 18, 2024 Language Modeling Language Modelling
Code Code Available 0SegPoint: Segment Any Point Cloud via Large Language Model Jul 18, 2024 3D Semantic Segmentation Language Modeling
— Unverified 0Affordance Perception by a Knowledge-Guided Vision-Language Model with Efficient Error Correction Jul 18, 2024 Autonomous Navigation Language Modeling
— Unverified 0Research on Tibetan Tourism Viewpoints information generation system based on LLM Jul 18, 2024 Language Modeling Language Modelling
— Unverified 0Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models Jul 18, 2024 Language Modeling Language Modelling
— Unverified 0Rethinking Video-Text Understanding: Retrieval from Counterfactually Augmented Data Jul 18, 2024 Language Modelling Large Language Model
— Unverified 0Do These LLM Benchmarks Agree? Fixing Benchmark Evaluation with BenchBench Jul 18, 2024 Language Modelling
Code Code Available 1Transformer-based Single-Cell Language Model: A Survey Jul 18, 2024 Language Modeling Language Modelling
— Unverified 0BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models Jul 18, 2024 Hallucination Language Modelling
— Unverified 0Retrieval-Enhanced Machine Learning: Synthesis and Opportunities Jul 17, 2024 Information Retrieval Language Modeling
— Unverified 0R+X: Retrieval and Execution from Everyday Human Videos Jul 17, 2024 Imitation Learning In-Context Learning
— Unverified 0Krutrim LLM: A Novel Tokenization Strategy for Multilingual Indic Languages with Petabyte-Scale Data Processing Jul 17, 2024 Articles Language Modeling
— Unverified 0Analyzing the Generalization and Reliability of Steering Vectors Jul 17, 2024 Language Modeling Language Modelling
Code Code Available 1SENTAUR: Security EnhaNced Trojan Assessment Using LLMs Against Undesirable Revisions Jul 17, 2024 Language Modelling Large Language Model
— Unverified 0VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions Jul 17, 2024 Autonomous Vehicles Language Modeling
— Unverified 0Conversational Query Reformulation with the Guidance of Retrieved Documents Jul 17, 2024 Conversational Question Answering Conversational Search
— Unverified 0LLM Inference Serving: Survey of Recent Advances and Opportunities Jul 17, 2024 Language Modeling Language Modelling
— Unverified 0Beyond Next Token Prediction: Patch-Level Training for Large Language Models Jul 17, 2024 Language Modeling Language Modelling
Code Code Available 2Spectra: Surprising Effectiveness of Pretraining Ternary Language Models at Scale Jul 17, 2024 GPU LAMBADA
Code Code Available 2F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions Jul 17, 2024 Human-Object Interaction Detection Language Modelling
— Unverified 0LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models Jul 17, 2024 Benchmarking Language Modelling
— Unverified 0BadRobot: Jailbreaking Embodied LLMs in the Physical World Jul 16, 2024 Language Modeling Language Modelling
— Unverified 0SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning Jul 16, 2024 Instruction Following Language Modelling
Code Code Available 1Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain Jul 16, 2024 Decision Making Language Modeling
Code Code Available 0LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task Jul 16, 2024 image-classification Image Classification
Code Code Available 1A Language Modeling Approach to Diacritic-Free Hebrew TTS Jul 16, 2024 Language Modeling Language Modelling
— Unverified 0UrbanWorld: An Urban World Model for 3D City Generation Jul 16, 2024 Decision Making Language Modelling
Code Code Available 2A Pilot Study of GSLM-based Simulation of Foreign Accentuation Only Using Native Speech Corpora Jul 16, 2024 Language Modeling Language Modelling
— Unverified 0InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains Jul 16, 2024 Decision Making Language Modeling
Code Code Available 1Exploring Quantization for Efficient Pre-Training of Transformer Language Models Jul 16, 2024 Language Modeling Language Modelling
Code Code Available 1LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction Jul 16, 2024 Language Modeling Language Modelling
Code Code Available 2GPT Assisted Annotation of Rhetorical and Linguistic Features for Interpretable Propaganda Technique Detection in News Text Jul 16, 2024 Feature Engineering Language Modelling
— Unverified 0How Personality Traits Influence Negotiation Outcomes? A Simulation based on Large Language Models Jul 16, 2024 Decision Making Language Modeling
Code Code Available 0XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach Jul 16, 2024 Data Augmentation Explanation Generation
Code Code Available 0