| DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence | Jun 17, 2024 | 16kLanguage Modeling | CodeCode Available | 9 |
| What Kinds of Tokens Benefit from Distant Text? An Analysis on Long Context Language Modeling | Jun 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-training Large Language Models through Knowledge Detection | Jun 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| UniGLM: Training One Unified Language Model for Text-Attributed Graph Embedding | Jun 17, 2024 | Contrastive LearningGraph Embedding | CodeCode Available | 1 |
| Mitigating Large Language Model Hallucination with Faithful Finetuning | Jun 17, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Retrieval-Augmented Feature Generation for Domain-Specific Classification | Jun 17, 2024 | Classificationdomain classification | —Unverified | 0 |
| Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 14 |
| Knowledge-to-Jailbreak: Investigating Knowledge-driven Jailbreaking Attacks for Large Language Models | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content | Jun 17, 2024 | BenchmarkingGeneral Knowledge | CodeCode Available | 0 |
| VideoLLM-online: Online Video Large Language Model for Streaming Video | Jun 17, 2024 | GPULanguage Modeling | —Unverified | 0 |
| SLEGO: A Collaborative Data Analytics System with LLM Recommender for Diverse Users | Jun 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generative Visual Instruction Tuning | Jun 17, 2024 | Image GenerationImage-text matching | CodeCode Available | 0 |
| Adversarial Style Augmentation via Large Language Model for Robust Fake News Detection | Jun 17, 2024 | Fake News DetectionLanguage Modeling | CodeCode Available | 0 |
| Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments | Jun 17, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 |
| STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft | Jun 17, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven Agents | Jun 17, 2024 | Code GenerationCode Search | CodeCode Available | 0 |
| mDPO: Conditional Preference Optimization for Multimodal Large Language Models | Jun 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 2 |
| HARE: HumAn pRiors, a key to small language model Efficiency | Jun 17, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels | Jun 17, 2024 | Dataset GenerationInformation Retrieval | —Unverified | 0 |
| Preserving Knowledge in Large Language Model with Model-Agnostic Self-Decompression | Jun 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and Lexical Alterations | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities | Jun 17, 2024 | Audio Question AnsweringInstruction Following | CodeCode Available | 2 |