Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms Mar 26, 2024 Language Modelling
Code Code Available 2ELLEN: Extremely Lightly Supervised Learning For Efficient Named Entity Recognition Mar 26, 2024 Language Modelling named-entity-recognition
Code Code Available 0MIND Your Language: A Multilingual Dataset for Cross-lingual News Recommendation Mar 26, 2024 Cross-Lingual Transfer Language Modelling
Code Code Available 2Graph Language Model (GLM): A new graph-based approach to detect social instabilities Mar 26, 2024 Language Modeling Language Modelling
— Unverified 0LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning Mar 26, 2024 GPU GSM8K
Code Code Available 9Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance Mar 25, 2024 Language Modeling Language Modelling
Code Code Available 2RepairAgent: An Autonomous, LLM-Based Agent for Program Repair Mar 25, 2024 Language Modelling Large Language Model
Code Code Available 2Cross-lingual Contextualized Phrase Retrieval Mar 25, 2024 Contrastive Learning Language Modelling
Code Code Available 0Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators Mar 25, 2024 Language Modeling Language Modelling
Code Code Available 1VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild Mar 25, 2024 Decoder Language Modeling
Code Code Available 9Extracting Social Support and Social Isolation Information from Clinical Psychiatry Notes: Comparing a Rule-based NLP System and a Large Language Model Mar 25, 2024 Language Modeling Language Modelling
— Unverified 0Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows Mar 25, 2024 Language Modeling Language Modelling
— Unverified 0The Role of n-gram Smoothing in the Age of Neural Networks Mar 25, 2024 Language Modeling Language Modelling
— Unverified 0A Hybrid Approach To Aspect Based Sentiment Analysis Using Transfer Learning Mar 25, 2024 Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA)
— Unverified 0New Intent Discovery with Attracting and Dispersing Prototype Mar 25, 2024 Intent Discovery Language Modeling
— Unverified 0Can tweets predict article retractions? A comparison between human and LLM labelling Mar 25, 2024 Articles Language Modeling
— Unverified 0DreamLIP: Language-Image Pre-training with Long Captions Mar 25, 2024 Contrastive Learning Image-text Retrieval
Code Code Available 2Understanding Long Videos with Multimodal Language Models Mar 25, 2024 Action Recognition Fine-grained Action Recognition
Code Code Available 2SPACE-IDEAS: A Dataset for Salient Information Detection in Space Innovation Mar 25, 2024 Language Modeling Language Modelling
Code Code Available 0Generation of Asset Administration Shell with Large Language Model Agents: Toward Semantic Interoperability in Digital Twins in the Context of Industry 4.0 Mar 25, 2024 Language Modeling Language Modelling
Code Code Available 1AIOS: LLM Agent Operating System Mar 25, 2024 AI Agent Language Modelling
Code Code Available 0Play to Your Strengths: Collaborative Intelligence of Conventional Recommender Models and Large Language Models Mar 25, 2024 Language Modelling Large Language Model
— Unverified 0Leveraging Large Language Model to Generate a Novel Metaheuristic Algorithm with CRISPE Framework Mar 25, 2024 Language Modeling Language Modelling
Code Code Available 0RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict Mar 25, 2024 16k Claim Verification
Code Code Available 0If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions Mar 25, 2024 Language Modeling Language Modelling
Code Code Available 1Learning To Guide Human Decision Makers With Vision-Language Models Mar 25, 2024 Decision Making Language Modelling
— Unverified 0Re2LLM: Reflective Reinforcement Large Language Model for Session-based Recommendation Mar 25, 2024 Language Modeling Language Modelling
— Unverified 0LinkPrompt: Natural and Universal Adversarial Attacks on Prompt-based Language Models Mar 25, 2024 Adversarial Attack Language Modeling
Code Code Available 0Enhanced Facet Generation with LLM Editing Mar 25, 2024 Information Retrieval Language Modelling
— Unverified 0Dia-LLaMA: Towards Large Language Model-driven CT Report Generation Mar 25, 2024 Diagnostic Language Modeling
— Unverified 0Toward Open-Set Human Object Interaction Detection Mar 24, 2024 Contrastive Learning Human-Object Interaction Detection
Code Code Available 0A Survey on Self-Supervised Graph Foundation Models: Knowledge-Based Perspective Mar 24, 2024 Language Modelling Large Language Model
Code Code Available 1Monotonic Paraphrasing Improves Generalization of Language Model Prompting Mar 24, 2024 Language Modeling Language Modelling
Code Code Available 0CBT-LLM: A Chinese Large Language Model for Cognitive Behavioral Therapy-based Mental Health Question Answering Mar 24, 2024 Language Modeling Language Modelling
— Unverified 0Heterogeneous Federated Learning with Splited Language Model Mar 24, 2024 Federated Learning Language Modeling
— Unverified 0Qibo: A Large Language Model for Traditional Chinese Medicine Mar 24, 2024 Language Modeling Language Modelling
— Unverified 0Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention Mar 23, 2024 GPU Language Modeling
— Unverified 0Leveraging Large Language Models for Preliminary Security Risk Analysis: A Mission-Critical Case Study Mar 23, 2024 Language Modelling Large Language Model
— Unverified 0Leveraging Zero-Shot Prompting for Efficient Language Model Distillation Mar 23, 2024 Language Modeling Language Modelling
— Unverified 0LAMPER: LanguAge Model and Prompt EngineeRing for zero-shot time series classification Mar 23, 2024 Language Modeling Language Modelling
Code Code Available 0Protecting Copyrighted Material with Unique Identifiers in Large Language Model Training Mar 23, 2024 Language Modeling Language Modelling
Code Code Available 0AI for Biomedicine in the Era of Large Language Models Mar 23, 2024 Language Modeling Language Modelling
— Unverified 0Towards a RAG-based Summarization Agent for the Electron-Ion Collider Mar 23, 2024 AI Agent Language Modelling
Code Code Available 0SceneX: Procedural Controllable Large-scale Scene Generation Mar 23, 2024 Diversity Language Modelling
— Unverified 0ARO: Large Language Model Supervised Robotics Text2Skill Autonomous Learning Mar 23, 2024 Language Modeling Language Modelling
— Unverified 0Centered Masking for Language-Image Pre-Training Mar 23, 2024 Language Modeling Language Modelling
Code Code Available 0SOEN-101: Code Generation by Emulating Software Process Models Using Large Language Model Agents Mar 23, 2024 Code Generation HumanEval
— Unverified 0Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery Mar 22, 2024 Language Modeling Language Modelling
— Unverified 0Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition Mar 22, 2024 Language Modelling Large Language Model
— Unverified 0Unifying Large Language Model and Deep Reinforcement Learning for Human-in-Loop Interactive Socially-aware Navigation Mar 22, 2024 Benchmarking Deep Reinforcement Learning
— Unverified 0