When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets? Nov 25, 2024 Knowledge Distillation Language Modeling
Code Code Available 0Towards Agentic Schema Refinement Nov 25, 2024 Language Modeling Language Modelling
— Unverified 0Tree Transformers are an Ineffective Model of Syntactic Constituency Nov 25, 2024 Language Modeling Language Modelling
— Unverified 0StructFormer: Document Structure-based Masked Attention and its Impact on Language Model Pre-Training Nov 25, 2024 document understanding Language Modeling
— Unverified 0SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text Nov 25, 2024 Language Modeling Language Modelling
— Unverified 0Is Training Data Quality or Quantity More Impactful to Small Language Model Performance? Nov 24, 2024 Language Modeling Language Modelling
Code Code Available 0Generative Prompt Internalization Nov 24, 2024 Language Modeling Language Modelling
Code Code Available 0Can a Large Language Model Learn Matrix Functions In Context? Nov 24, 2024 In-Context Learning Language Modeling
Code Code Available 0Ensuring Fair LLM Serving Amid Diverse Applications Nov 24, 2024 Fairness Language Modeling
— Unverified 0AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset Nov 23, 2024 Language Modeling Language Modelling
— Unverified 0From MTEB to MTOB: Retrieval-Augmented Classification for Descriptive Grammars Nov 23, 2024 Descriptive In-Context Learning
Code Code Available 0Enabling Efficient Serverless Inference Serving for LLM (Large Language Model) in the Cloud Nov 23, 2024 GPU Language Modeling
— Unverified 0Automatic High-quality Verilog Assertion Generation through Subtask-Focused Fine-Tuned LLMs and Iterative Prompting Nov 23, 2024 Language Modeling Language Modelling
— Unverified 0MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language Model Nov 23, 2024 Language Modeling Language Modelling
Code Code Available 0Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment Nov 23, 2024 Language Modeling Language Modelling
Code Code Available 0The BS-meter: A ChatGPT-Trained Instrument to Detect Sloppy Language-Games Nov 22, 2024 Language Modeling Language Modelling
— Unverified 0Astro-HEP-BERT: A bidirectional language model for studying the meanings of concepts in astrophysics and high energy physics Nov 22, 2024 Articles Language Modeling
— Unverified 0Effective SAM Combination for Open-Vocabulary Semantic Segmentation Nov 22, 2024 Decoder Language Modeling
— Unverified 0ElastiFormer: Learned Redundancy Reduction in Transformer via Self-Distillation Nov 22, 2024 Causal Language Modeling Language Modeling
— Unverified 0A Framework for Evaluating LLMs Under Task Indeterminacy Nov 21, 2024 Language Modeling Language Modelling
— Unverified 0DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization Nov 21, 2024 Language Modeling Language Modelling
Code Code Available 0Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge Nov 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Memory Backdoor Attacks on Neural Networks Nov 21, 2024 Backdoor Attack Federated Learning
— Unverified 0Schemato -- An LLM for Netlist-to-Schematic Conversion Nov 21, 2024 Language Modeling Language Modelling
— Unverified 0PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation Nov 21, 2024 Language Modeling Language Modelling
Code Code Available 0Patience Is The Key to Large Language Model Reasoning Nov 20, 2024 GSM8K Language Modeling
— Unverified 0S^2ALM: Sequence-Structure Pre-trained Large Language Model for Comprehensive Antibody Representation Learning Nov 20, 2024 Language Modeling Language Modelling
— Unverified 0LightLLM: A Versatile Large Language Model for Predictive Light Sensing Nov 20, 2024 Language Modeling Language Modelling
— Unverified 0Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry Nov 20, 2024 Language Modeling Language Modelling
Code Code Available 0MERLOT: A Distilled LLM-based Mixture-of-Experts Framework for Scalable Encrypted Traffic Classification Nov 20, 2024 Decoder Language Modeling
— Unverified 0Multimodal large language model for wheat breeding: a new exploration of smart breeding Nov 20, 2024 Language Modeling Language Modelling
— Unverified 0Advancing Complex Medical Communication in Arabic with Sporo AraSum: Surpassing Existing Large Language Models Nov 20, 2024 Decision Making Language Modeling
— Unverified 0Beyond Visual Understanding: Introducing PARROT-360V for Vision Language Model Benchmarking Nov 20, 2024 Benchmarking Language Modeling
— Unverified 0Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders Nov 20, 2024 compressed sensing Language Modeling
— Unverified 0Existential Conversations with Large Language Models: Content, Community, and Culture Nov 20, 2024 Language Modeling Language Modelling
— Unverified 0Unlocking Historical Clinical Trial Data with ALIGN: A Compositional Large Language Model System for Medical Coding Nov 20, 2024 Code Generation Data Integration
— Unverified 0Watermark under Fire: A Robustness Evaluation of LLM Watermarking Nov 20, 2024 Language Modeling Language Modelling
Code Code Available 0RadPhi-3: Small Language Models for Radiology Nov 19, 2024 4k Language Modeling
— Unverified 0Med-2E3: A 2D-Enhanced 3D Medical Multimodal Large Language Model Nov 19, 2024 Language Modeling Language Modelling
— Unverified 0StreetviewLLM: Extracting Geographic Information Using a Chain-of-Thought Multimodal Large Language Model Nov 19, 2024 Decision Making Language Modeling
— Unverified 0Ranking Unraveled: Recipes for LLM Rankings in Head-to-Head AI Combat Nov 19, 2024 Language Modeling Language Modelling
Code Code Available 0Strengthening Fake News Detection: Leveraging SVM and Sophisticated Text Vectorization Techniques. Defying BERT? Nov 19, 2024 Fake News Detection Language Modeling
— Unverified 0Probing the Capacity of Language Model Agents to Operationalize Disparate Experiential Context Despite Distraction Nov 19, 2024 Language Modeling Language Modelling
Code Code Available 0HouseLLM: LLM-Assisted Two-Phase Text-to-Floorplan Generation Nov 19, 2024 Language Modeling Language Modelling
— Unverified 0CUE-M: Contextual Understanding and Enhanced Search with Multimodal Large Language Model Nov 19, 2024 Information Retrieval Language Modeling
— Unverified 0A Layered Architecture for Developing and Enhancing Capabilities in Large Language Model-based Software Systems Nov 19, 2024 Language Modeling Language Modelling
— Unverified 0Generative Timelines for Instructed Visual Assembly Nov 19, 2024 Language Modelling
— Unverified 0CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs Nov 19, 2024 Hallucination Language Modeling
— Unverified 0PSA-VLM: Enhancing Vision-Language Model Safety through Progressive Concept-Bottleneck-Driven Alignment Nov 18, 2024 Language Modeling Language Modelling
— Unverified 0Addressing Hallucinations in Language Models with Knowledge Graph Embeddings as an Additional Modality Nov 18, 2024 Entity Linking Knowledge Graph Embeddings
— Unverified 0