Search Engines in an AI Era: The False Promise of Factual and Verifiable Source-Cited Responses Oct 15, 2024 Hallucination Language Modeling
Code Code Available 1EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs Oct 13, 2024 Language Modeling Language Modelling
Code Code Available 1HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics Oct 13, 2024 Language Modeling Language Modelling
Code Code Available 1PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning Oct 11, 2024 Data Poisoning Language Modeling
Code Code Available 1Parameter-Efficient Fine-Tuning of State Space Models Oct 11, 2024 Language Modeling Language Modelling
Code Code Available 1PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents Oct 11, 2024 Code Generation Language Modeling
Code Code Available 1Hespi: A pipeline for automatically detecting information from hebarium specimen sheets Oct 11, 2024 Handwritten Text Recognition HTR
Code Code Available 1Zeroth-Order Fine-Tuning of LLMs in Random Subspaces Oct 11, 2024 Language Modeling Language Modelling
Code Code Available 1Retraining-Free Merging of Sparse MoE via Hierarchical Clustering Oct 11, 2024 Clustering Language Modeling
Code Code Available 1Do Unlearning Methods Remove Information from Language Model Weights? Oct 11, 2024 Language Modeling Language Modelling
Code Code Available 1Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning Oct 10, 2024 Language Modelling Large Language Model
Code Code Available 1AgroGPT: Efficient Agricultural Vision-Language Model with Expert Tuning Oct 10, 2024 Language Modeling Language Modelling
Code Code Available 1Bilinear MLPs enable weight-based mechanistic interpretability Oct 10, 2024 image-classification Image Classification
Code Code Available 1Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining Oct 10, 2024 Language Modeling Language Modelling
Code Code Available 1OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model Prompting Oct 10, 2024 Entity Linking Few-Shot Learning
Code Code Available 1AuditWen:An Open-Source Large Language Model for Audit Oct 9, 2024 Answer Generation Language Modeling
Code Code Available 1Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning Oct 9, 2024 Language Modeling Language Modelling
Code Code Available 1Vector-ICL: In-context Learning with Continuous Vector Representations Oct 8, 2024 Classification Graph Classification
Code Code Available 1Training-free Diffusion Model Alignment with Sampling Demons Oct 8, 2024 Denoising Image Generation
Code Code Available 1Fine-Tuning CLIP's Last Visual Projector: A Few-Shot Cornucopia Oct 7, 2024 Domain Generalization Language Modeling
Code Code Available 1ImProver: Agent-Based Automated Proof Optimization Oct 7, 2024 Language Modelling Large Language Model
Code Code Available 1Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective Oct 6, 2024 CPU GPU
Code Code Available 1LongGenBench: Long-context Generation Benchmark Oct 5, 2024 Language Modelling Retrieval
Code Code Available 1Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval Oct 4, 2024 Descriptive Language Modeling
Code Code Available 1You Know What I'm Saying: Jailbreak Attack via Implicit Reference Oct 4, 2024 Language Modeling Language Modelling
Code Code Available 1FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model Oct 3, 2024 Emotion Recognition Language Modeling
Code Code Available 1General Preference Modeling with Preference Representations for Aligning Language Models Oct 3, 2024 Language Modelling Representation Learning
Code Code Available 1DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects Oct 3, 2024 Benchmarking Imitation Learning
Code Code Available 1ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration Oct 3, 2024 Decision Making Language Modeling
Code Code Available 1Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 1Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 1Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 1EMMA: Efficient Visual Alignment in Multi-Modal LLMs Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 1Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices Oct 2, 2024 GPU Language Modeling
Code Code Available 1Exploring Empty Spaces: Human-in-the-Loop Data Augmentation Oct 1, 2024 Data Augmentation Diversity
Code Code Available 1RisingBALLER: A player is a token, a match is a sentence, A path towards a foundational model for football players data analytics Oct 1, 2024 Language Modeling Language Modelling
Code Code Available 1Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting Oct 1, 2024 Continual Learning Language Modeling
Code Code Available 1VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs Sep 30, 2024 EgoSchema Language Modelling
Code Code Available 1LML-DAP: Language Model Learning a Dataset for Data-Augmented Prediction Sep 27, 2024 Classification Feature Engineering
Code Code Available 1DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving Sep 26, 2024 Autonomous Driving Language Modeling
Code Code Available 1DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling Sep 25, 2024 Data Augmentation Diversity
Code Code Available 1Counterfactual Token Generation in Large Language Models Sep 25, 2024 Bias Detection counterfactual
Code Code Available 1Training Language Models to Win Debates with Self-Play Improves Judge Accuracy Sep 25, 2024 Language Modeling Language Modelling
Code Code Available 1Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification Sep 25, 2024 Language Modeling Language Modelling
Code Code Available 1FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text Compression Sep 25, 2024 Language Modeling Language Modelling
Code Code Available 1LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation Sep 23, 2024 Language Modeling Language Modelling
Code Code Available 1Instruction Following without Instruction Tuning Sep 21, 2024 Instruction Following Language Modeling
Code Code Available 1One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks Sep 20, 2024 All Dependency Parsing
Code Code Available 1ShizishanGPT: An Agricultural Large Language Model Integrating Tools and Resources Sep 20, 2024 Language Modeling Language Modelling
Code Code Available 1DiffEditor: Enhancing Speech Editing with Semantic Enrichment and Acoustic Consistency Sep 19, 2024 Language Modeling Language Modelling
Code Code Available 1