| SheetAgent: Towards A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models | Mar 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SaulLM-7B: A pioneering Large Language Model for Law | Mar 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Assessing the Aesthetic Evaluation Capabilities of GPT-4 with Vision: Insights from Group and Individual Assessments | Mar 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery | Mar 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ESM All-Atom: Multi-scale Protein Language Model for Unified Molecular Modeling | Mar 5, 2024 | AllLanguage Modeling | CodeCode Available | 2 |
| Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Breeze-7B Technical Report | Mar 5, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| Learning to Maximize Mutual Information for Chain-of-Thought Distillation | Mar 5, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 0 |
| An Empirical Study of LLM-as-a-Judge for LLM Evaluation: Fine-tuned Judge Model is not a General Substitute for GPT-4 | Mar 5, 2024 | FairnessLanguage Modeling | CodeCode Available | 0 |
| Language Guided Exploration for RL Agents in Text Environments | Mar 5, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Socratic Reasoning Improves Positive Text Rewriting | Mar 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Training A Chinese Large Language Model for Anesthesiology | Mar 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents | Mar 5, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Causal Prompting: Debiasing Large Language Model Prompting based on Front-Door Adjustment | Mar 5, 2024 | Contrastive LearningData Augmentation | —Unverified | 0 |
| DPPA: Pruning Method for Large Language Model to Model Merging | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Android in the Zoo: Chain-of-Action-Thought for GUI Agents | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MeanCache: User-Centric Semantic Caching for LLM Web Services | Mar 5, 2024 | Federated LearningLanguage Modeling | —Unverified | 0 |
| SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection | Mar 5, 2024 | Concept AlignmentExplanation Generation | —Unverified | 0 |
| Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Word Importance Explains How Prompts Affect Language Model Outputs | Mar 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating and Optimizing Educational Content with Large Language Model Judgments | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LLM vs. Lawyers: Identifying a Subset of Summary Judgments in a Large UK Case Law Dataset | Mar 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| NoteLLM: A Retrievable Large Language Model for Note Recommendation | Mar 4, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider and MoE Transformers | Mar 4, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| RegionGPT: Towards Region Understanding Vision Language Model | Mar 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |