Language Models as Causal Effect Generators Nov 12, 2024 Causal Inference counterfactual
Code Code Available 1LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models Nov 11, 2024 Knowledge Distillation Language Modeling
Code Code Available 1ITER: Iterative Transformer-based Entity Recognition and Relation Extraction Nov 11, 2024 GPU Language Modeling
Code Code Available 1Aioli: A Unified Optimization Framework for Language Model Data Mixing Nov 8, 2024 Language Modeling Language Modelling
Code Code Available 1AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein Engineering Nov 7, 2024 AutoML Hyperparameter Optimization
Code Code Available 1DELIFT: Data Efficient Language model Instruction Fine Tuning Nov 7, 2024 Language Modeling Language Modelling
Code Code Available 1Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset Nov 5, 2024 Benchmarking Language Modeling
Code Code Available 1TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network Nov 4, 2024 Chunking Language Modelling
Code Code Available 1Training Compute-Optimal Protein Language Models Nov 4, 2024 Language Modeling Language Modelling
Code Code Available 1Regress, Don't Guess -- A Regression-like Loss on Number Tokens for Language Models Nov 4, 2024 Inductive Bias Language Modeling
Code Code Available 1Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge Nov 4, 2024 Diagnostic Language Modeling
Code Code Available 1GraphXForm: Graph transformer for computer-aided molecular design Nov 3, 2024 Drug Design Drug Discovery
Code Code Available 1Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models Nov 1, 2024 Decision Making Informativeness
Code Code Available 1Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility Prediction Oct 31, 2024 Disaster Response Language Modeling
Code Code Available 1LLaMo: Large Language Model-based Molecular Graph Assistant Oct 31, 2024 Instruction Following IUPAC Name Prediction
Code Code Available 1Interpretable Language Modeling via Induction-head Ngram Models Oct 31, 2024 Causal Language Modeling Human fMRI response prediction
Code Code Available 1Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning Oct 30, 2024 In-Context Learning Language Modeling
Code Code Available 1Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback Oct 30, 2024 Decision Making Language Modeling
Code Code Available 1f-PO: Generalizing Preference Optimization with f-divergence Minimization Oct 29, 2024 Language Modeling Language Modelling
Code Code Available 1SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types Oct 29, 2024 Language Modeling Language Modelling
Code Code Available 1Long-context Protein Language Modeling Using Bidirectional Mamba with Shared Projection Layers Oct 29, 2024 Drug Design Language Modeling
Code Code Available 1LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment Oct 28, 2024 Benchmarking Language Modeling
Code Code Available 1TrajAgent: An Agent Framework for Unified Trajectory Modelling Oct 27, 2024 Future prediction Language Modeling
Code Code Available 1Peptide-GPT: Generative Design of Peptides using Generative Pre-trained Transformers and Bio-informatic Supervision Oct 25, 2024 Language Modelling Protein Design
Code Code Available 1LOGO -- Long cOntext aliGnment via efficient preference Optimization Oct 24, 2024 GPU Language Modeling
Code Code Available 1GCoder: Improving Large Language Model for Generalized Graph Problem Solving Oct 24, 2024 Language Modeling Language Modelling
Code Code Available 1Cross-model Control: Improving Multiple Large Language Models in One-time Training Oct 23, 2024 Instruction Following Language Modeling
Code Code Available 1GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration Oct 23, 2024 Language Modeling Language Modelling
Code Code Available 1Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes Oct 22, 2024 GSM8K Language Modeling
Code Code Available 1Scalable Influence and Fact Tracing for Large Language Model Pretraining Oct 22, 2024 Attribute Language Modeling
Code Code Available 1Non-myopic Generation of Language Models for Reasoning and Planning Oct 22, 2024 Computational Efficiency Language Modelling
Code Code Available 1Automated Spinal MRI Labelling from Reports Using a Large Language Model Oct 22, 2024 Language Modeling Language Modelling
Code Code Available 1Building A Coding Assistant via the Retrieval-Augmented Language Model Oct 21, 2024 Code Completion Code Generation
Code Code Available 1Residual vector quantization for KV cache compression in large language model Oct 21, 2024 Audio Compression Language Modeling
Code Code Available 1A Realistic Threat Model for Large Language Model Jailbreaks Oct 21, 2024 Language Modeling Language Modelling
Code Code Available 1SeisLM: a Foundation Model for Seismic Waveforms Oct 21, 2024 Event Detection Language Modeling
Code Code Available 1M-RewardBench: Evaluating Reward Models in Multilingual Settings Oct 20, 2024 Language Modeling Language Modelling
Code Code Available 1Paths-over-Graph: Knowledge Graph Empowered Large Language Model Reasoning Oct 18, 2024 Hallucination Knowledge Base Question Answering
Code Code Available 1MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts Oct 18, 2024 Language Modeling Language Modelling
Code Code Available 1Starbucks: Improved Training for 2D Matryoshka Embeddings Oct 17, 2024 Language Modelling text similarity
Code Code Available 1MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficient Mobile Task Automation Oct 17, 2024 Decision Making Language Modeling
Code Code Available 1MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems Oct 17, 2024 Answer Generation Language Modeling
Code Code Available 1FIRE: Fact-checking with Iterative Retrieval and Verification Oct 17, 2024 Claim Verification Fact Checking
Code Code Available 1VividMed: Vision Language Model with Versatile Visual Grounding for Medicine Oct 16, 2024 Language Modeling Language Modelling
Code Code Available 1CREAM: Consistency Regularized Self-Rewarding Language Models Oct 16, 2024 Language Modeling Language Modelling
Code Code Available 1HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World Claims Oct 16, 2024 Fact Checking Language Modeling
Code Code Available 1DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models Oct 15, 2024 Language Modeling Language Modelling
Code Code Available 1FVEval: Understanding Language Model Capabilities in Formal Verification of Digital Hardware Oct 15, 2024 Code Generation Language Modeling
Code Code Available 1TopoLM: brain-like spatio-functional organization in a topographic language model Oct 15, 2024 Language Modeling Language Modelling
Code Code Available 1Search Engines in an AI Era: The False Promise of Factual and Verifiable Source-Cited Responses Oct 15, 2024 Hallucination Language Modeling
Code Code Available 1