| TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings | Jun 21, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| InternLM-Law: An Open Source Chinese Legal Large Language Model | Jun 21, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| LiveMind: Low-latency Large Language Models with Simultaneous Inference | Jun 20, 2024 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 |
| VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model | Jun 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors | Jun 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis | Jun 19, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 1 |
| Improving Visual Commonsense in Language Models via Multiple Image Generation | Jun 19, 2024 | Common Sense ReasoningImage Generation | CodeCode Available | 1 |
| On AI-Inspired UI-Design | Jun 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation | Jun 19, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MolecularGPT: Open Large Language Model (LLM) for Few-Shot Molecular Property Prediction | Jun 18, 2024 | Drug DiscoveryGraph Neural Network | CodeCode Available | 1 |
| UniGLM: Training One Unified Language Model for Text-Attributed Graph Embedding | Jun 17, 2024 | Contrastive LearningGraph Embedding | CodeCode Available | 1 |
| SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Modeling with Editable External Knowledge | Jun 17, 2024 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments | Jun 17, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 |
| MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Self-Supervised Representation Learning with Spatial-Temporal Consistency for Sign Language Recognition | Jun 15, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training | Jun 15, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Pre-trained Language Models | Jun 14, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 |
| Large language model validity via enhanced conformal prediction methods | Jun 14, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| Enhancing Domain Adaptation through Prompt Gradient Alignment | Jun 13, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| Newswire: A Large-Scale Structured Database of a Century of Historical News | Jun 13, 2024 | ArticlesEntity Disambiguation | CodeCode Available | 1 |
| Large Language Model Unlearning via Embedding-Corrupted Prompts | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Advancing High Resolution Vision-Language Models in Biomedicine | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Collective Constitutional AI: Aligning a Language Model with Public Input | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MambaLRP: Explaining Selective State Space Sequence Models | Jun 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Scaling Large Language Model-based Multi-Agent Collaboration | Jun 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature | Jun 10, 2024 | Claim VerificationInstruction Following | CodeCode Available | 1 |
| VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text | Jun 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization | Jun 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Soundscape Captioning using Sound Affective Quality Network and Large Language Model | Jun 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MedualTime: A Dual-Adapter Language Model for Medical Time Series-Text Multimodal Learning | Jun 7, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Helpful or Harmful Data? Fine-tuning-free Shapley Attribution for Explaining Language Model Predictions | Jun 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-Answering | Jun 7, 2024 | Graph Question AnsweringLanguage Modeling | CodeCode Available | 1 |
| Revisiting Catastrophic Forgetting in Large Language Model Tuning | Jun 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms | Jun 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Xmodel-LM Technical Report | Jun 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Queue management for slo-oriented large language model serving | Jun 5, 2024 | BlockingGPU | CodeCode Available | 1 |
| LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing | Jun 4, 2024 | ClassificationGPU | CodeCode Available | 1 |
| Zyda: A 1.3T Dataset for Open Language Modeling | Jun 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model | Jun 3, 2024 | Image OutpaintingLanguage Modeling | CodeCode Available | 1 |
| VerilogReader: LLM-Aided Hardware Test Generation | Jun 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MultiMax: Sparse and Multi-Modal Attention Learning | Jun 3, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| HonestLLM: Toward an Honest and Helpful Large Language Model | Jun 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation | Jun 1, 2024 | feature selectionLanguage Modeling | CodeCode Available | 1 |
| Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model | May 30, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CycleFormer : TSP Solver Based on Language Modeling | May 30, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Language Generation with Strictly Proper Scoring Rules | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |