| Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models | Jul 24, 2024 | ARCInductive Bias | CodeCode Available | 1 |
| INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model | Jul 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback | Jul 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models | Jul 22, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| dMel: Speech Tokenization made Simple | Jul 22, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning | Jul 20, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| ViLLa: Video Reasoning Segmentation with Large Language Model | Jul 18, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 1 |
| EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing | Jul 18, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| Analyzing the Generalization and Reliability of Steering Vectors | Jul 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task | Jul 16, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Exploring Quantization for Efficient Pre-Training of Transformer Language Models | Jul 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains | Jul 16, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| On Large Language Model Continual Unlearning | Jul 14, 2024 | DisentanglementLanguage Modeling | CodeCode Available | 1 |
| ChatLogic: Integrating Logic Programming with Large Language Models for Multi-Step Reasoning | Jul 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| IoT-LM: Large Multisensory Language Models for the Internet of Things | Jul 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Benchmarking Language Model Creativity: A Case Study on Code Generation | Jul 12, 2024 | BenchmarkingCode Generation | CodeCode Available | 1 |
| Vision Language Model is NOT All You Need: Augmentation Strategies for Molecule Language Models | Jul 12, 2024 | AllDrug Discovery | CodeCode Available | 1 |
| Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control | Jul 12, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents | Jul 12, 2024 | Document Layout Analysisdocument understanding | CodeCode Available | 1 |
| Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding | Jul 11, 2024 | EEGLanguage Modeling | CodeCode Available | 1 |
| Incorporating Large Language Models into Production Systems for Enhanced Task Automation and Flexibility | Jul 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation | Jul 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model | Jul 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection | Jul 9, 2024 | CoLALanguage Modeling | CodeCode Available | 1 |
| DebUnc: Improving Large Language Model Agent Communication With Uncertainty Metrics | Jul 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large language models are good medical coders, if provided with tools | Jul 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System | Jul 4, 2024 | Event DetectionLanguage Modeling | CodeCode Available | 1 |
| Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GPTCast: a weather language model for precipitation nowcasting | Jul 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Model Alignment in Multilingual Trolley Problems | Jul 2, 2024 | Decision MakingEthics | CodeCode Available | 1 |
| SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language Model | Jul 1, 2024 | Knowledge TracingLanguage Modeling | CodeCode Available | 1 |
| Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time | Jul 1, 2024 | AUDIO-VISUAL QUESTION ANSWERING (MUSIC-AVQA-v2.0)Fact Checking | CodeCode Available | 1 |
| Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model | Jun 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Decoding-Time Language Model Alignment with Multiple Objectives | Jun 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Knowledge graph enhanced retrieval-augmented generation for failure mode and effects analysis | Jun 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability | Jun 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Refer-and-Ground Multimodal Large Language Model for Biomedicine | Jun 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale | Jun 25, 2024 | ARCLanguage Modeling | CodeCode Available | 1 |
| The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CogMG: Collaborative Augmentation Between Large Language Model and Knowledge Graph | Jun 25, 2024 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 1 |
| Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Jun 25, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Multi-property Steering of Large Language Models with Dynamic Activation Composition | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification | Jun 25, 2024 | Contrastive Learningfew-shot-htc | CodeCode Available | 1 |
| RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale | Jun 24, 2024 | Code GenerationHumanEval | CodeCode Available | 1 |
| C-LLM: Learn to Check Chinese Spelling Errors Character by Character | Jun 24, 2024 | Chinese Spell CheckingLanguage Modeling | CodeCode Available | 1 |
| TacoLM: GaTed Attention Equipped Codec Language Model are Efficient Zero-Shot Text to Speech Synthesizers | Jun 22, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| InternLM-Law: An Open Source Chinese Legal Large Language Model | Jun 21, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |