| Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Scale Transfer Learning for Tabular Data via Language Modeling | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence | Jun 17, 2024 | 16kLanguage Modeling | CodeCode Available | 9 |
| UniGLM: Training One Unified Language Model for Text-Attributed Graph Embedding | Jun 17, 2024 | Contrastive LearningGraph Embedding | CodeCode Available | 1 |
| What Kinds of Tokens Benefit from Distant Text? An Analysis on Long Context Language Modeling | Jun 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mitigating Large Language Model Hallucination with Faithful Finetuning | Jun 17, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 14 |
| SLEGO: A Collaborative Data Analytics System with LLM Recommender for Diverse Users | Jun 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Preserving Knowledge in Large Language Model with Model-Agnostic Self-Decompression | Jun 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adversarial Style Augmentation via Large Language Model for Robust Fake News Detection | Jun 17, 2024 | Fake News DetectionLanguage Modeling | CodeCode Available | 0 |
| RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content | Jun 17, 2024 | BenchmarkingGeneral Knowledge | CodeCode Available | 0 |
| SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and Lexical Alterations | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments | Jun 17, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 |
| AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities | Jun 17, 2024 | Audio Question AnsweringInstruction Following | CodeCode Available | 2 |
| CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven Agents | Jun 17, 2024 | Code GenerationCode Search | CodeCode Available | 0 |
| VideoLLM-online: Online Video Large Language Model for Streaming Video | Jun 17, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Language Modeling with Editable External Knowledge | Jun 17, 2024 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Knowledge-to-Jailbreak: Investigating Knowledge-driven Jailbreaking Attacks for Large Language Models | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HARE: HumAn pRiors, a key to small language model Efficiency | Jun 17, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Generative Visual Instruction Tuning | Jun 17, 2024 | Image GenerationImage-text matching | CodeCode Available | 0 |
| mDPO: Conditional Preference Optimization for Multimodal Large Language Models | Jun 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 2 |
| STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft | Jun 17, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Promises, Outlooks and Challenges of Diffusion Language Modeling | Jun 17, 2024 | ARCHellaSwag | —Unverified | 0 |
| Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels | Jun 17, 2024 | Dataset GenerationInformation Retrieval | —Unverified | 0 |
| CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics | Jun 16, 2024 | ClassificationInformativeness | CodeCode Available | 0 |
| RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning | Jun 16, 2024 | knowledge editingLanguage Modeling | CodeCode Available | 0 |
| Avoiding Copyright Infringement via Large Language Model Unlearning | Jun 16, 2024 | General KnowledgeLanguage Modeling | CodeCode Available | 0 |
| Taking a Deep Breath: Enhancing Language Modeling of Large Language Models with Sentinel Tokens | Jun 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Logit Separability-Driven Samples and Multiple Class-Related Words Selection for Advancing In-Context Learning | Jun 16, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Large Language Models for Dysfluency Detection in Stuttered Speech | Jun 16, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation | Jun 16, 2024 | Continual LearningGSM8K | CodeCode Available | 0 |
| Optimization of Armv9 architecture general large language model inference performance based on Llama.cpp | Jun 16, 2024 | Compiler OptimizationLanguage Modeling | CodeCode Available | 0 |
| VCEval: Rethinking What is a Good Educational Video and How to Automatically Evaluate It | Jun 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CancerLLM: A Large Language Model in Cancer Domain | Jun 15, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Reactor Mk.1 performances: MMLU, HumanEval and BBH test results | Jun 15, 2024 | BenchmarkingHumanEval | —Unverified | 0 |
| RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics | Jun 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MALLM-GAN: Multi-Agent Large Language Model as Generative Adversarial Network for Synthesizing Tabular Data | Jun 15, 2024 | Generative Adversarial NetworkLanguage Modeling | —Unverified | 0 |
| Self-Supervised Representation Learning with Spatial-Temporal Consistency for Sign Language Recognition | Jun 15, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Large Language Model Enhanced Clustering for News Event Detection | Jun 15, 2024 | ClusteringEvent Detection | —Unverified | 0 |
| CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training | Jun 15, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| PARSE-Ego4D: Personal Action Recommendation Suggestions for Egocentric Videos | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Probability--Quality Trade-off in Aligned Language Models and its Relation to Sampling Adaptors | Jun 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment | Jun 14, 2024 | Image Quality AssessmentLanguage Modeling | —Unverified | 0 |
| GEB-1.3B: Open Lightweight Large Language Model | Jun 14, 2024 | CPULanguage Modeling | —Unverified | 0 |
| Large language model validity via enhanced conformal prediction methods | Jun 14, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| OpenECAD: An Efficient Visual Language Model for Editable 3D-CAD Design | Jun 14, 2024 | 3D Object ReconstructionLanguage Modeling | —Unverified | 0 |
| Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs | Jun 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |