| Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time | Jul 1, 2024 | AUDIO-VISUAL QUESTION ANSWERING (MUSIC-AVQA-v2.0)Fact Checking | CodeCode Available | 1 |
| CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities | Jul 1, 2024 | 3D visual groundingLanguage Modeling | —Unverified | 0 |
| RegMix: Data Mixture as Regression for Language Model Pre-training | Jul 1, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 2 |
| IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Scaling Technology Acceptance Analysis with Large Language Model (LLM) Annotation Systems | Jun 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Explaining Chest X-ray Pathology Models using Textual Concepts | Jun 30, 2024 | counterfactualLanguage Modeling | —Unverified | 0 |
| Characterizing Stereotypical Bias from Privacy-preserving Pre-Training | Jun 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning Formal Mathematics From Intrinsic Motivation | Jun 30, 2024 | Automated Theorem ProvingLanguage Modeling | CodeCode Available | 2 |
| Financial Knowledge Large Language Model | Jun 29, 2024 | Few-Shot LearningFinancial Analysis | —Unverified | 0 |
| Potential Renovation of Information Search Process with the Power of Large Language Model for Healthcare | Jun 29, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Answering real-world clinical questions using large language model based systems | Jun 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Open-Source Conversational AI with SpeechBrain 1.0 | Jun 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention | Jun 29, 2024 | DiversityImage Generation | CodeCode Available | 0 |
| A Study on Effect of Reference Knowledge Choice in Generating Technical Content Relevant to SAPPhIRE Model Using Large Language Model | Jun 29, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Teola: Towards End-to-End Optimization of LLM-based Applications | Jun 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| The Qiyas Benchmark: Measuring ChatGPT Mathematical and Language Understanding in Arabic | Jun 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model | Jun 28, 2024 | Interactive SegmentationLanguage Modeling | CodeCode Available | 3 |
| Molecular Facts: Desiderata for Decontextualization in LLM Fact Verification | Jun 28, 2024 | Fact CheckingFact Verification | CodeCode Available | 0 |
| BESTOW: Efficient and Streamable Speech Language Model with the Best of Two Worlds in GPT and T5 | Jun 28, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model | Jun 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Designing and Evaluating Multi-Chatbot Interface for Human-AI Communication: Preliminary Findings from a Persuasion Task | Jun 28, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| Investigating the Timescales of Language Processing with EEG and Language Models | Jun 28, 2024 | EEGLanguage Modeling | —Unverified | 0 |
| YuLan: An Open-source Large Language Model | Jun 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Scaling Synthetic Data Creation with 1,000,000,000 Personas | Jun 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 11 |
| Simulating Financial Market via Large Language Model based Agents | Jun 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adaptive Draft-Verification for Efficient Large Language Model Decoding | Jun 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Meta Large Language Model Compiler: Foundation Models of Compiler Optimization | Jun 27, 2024 | Compiler OptimizationGPU | —Unverified | 0 |
| LoPT: Low-Rank Prompt Tuning for Parameter Efficient Language Models | Jun 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| xTower: A Multilingual LLM for Explaining and Correcting Translation Errors | Jun 27, 2024 | Error UnderstandingLanguage Modeling | —Unverified | 0 |
| PathAlign: A vision-language model for whole slide images in histopathology | Jun 27, 2024 | DiagnosticImage Retrieval | —Unverified | 0 |
| Efficacy of Language Model Self-Play in Non-Zero-Sum Games | Jun 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulation | Jun 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Length Optimization in Conformal Prediction | Jun 27, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 0 |
| Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs | Jun 27, 2024 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| Decoding-Time Language Model Alignment with Multiple Objectives | Jun 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LICO: Large Language Models for In-Context Molecular Optimization | Jun 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data | Jun 26, 2024 | DecoderGPU | —Unverified | 0 |
| Octo-planner: On-device Language Model for Planner-Action Agents | Jun 26, 2024 | Computational EfficiencyIn-Context Learning | —Unverified | 0 |
| Llamipa: An Incremental Discourse Parser | Jun 26, 2024 | Discourse ParsingLanguage Modeling | —Unverified | 0 |
| Knowledge graph enhanced retrieval-augmented generation for failure mode and effects analysis | Jun 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability | Jun 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards Large Language Model Aided Program Refinement | Jun 26, 2024 | HumanEvalLanguage Modeling | —Unverified | 0 |
| A Refer-and-Ground Multimodal Large Language Model for Biomedicine | Jun 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PharmaGPT: Domain-Specific Large Language Models for Bio-Pharmaceutical and Chemistry | Jun 26, 2024 | Feature EngineeringLanguage Modeling | —Unverified | 0 |
| MammothModa: Multi-Modal Large Language Model | Jun 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| S3: A Simple Strong Sample-effective Multimodal Dialog System | Jun 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Cascading Large Language Models for Salient Event Graph Generation | Jun 26, 2024 | Graph GenerationLanguage Modeling | CodeCode Available | 0 |
| Explicit Diversity Conditions for Effective Question Answer Generation with Large Language Models | Jun 26, 2024 | Answer GenerationData Augmentation | —Unverified | 0 |
| The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |