| Shai: A large language model for asset management | Dec 21, 2023 | Asset ManagementLanguage Modeling | —Unverified | 0 |
| AsyncMLD: Asynchronous Multi-LLM Framework for Dialogue Recommendation System | Dec 21, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks | Dec 21, 2023 | Image RetrievalImage-to-Text Retrieval | CodeCode Available | 1 |
| Speech Translation with Large Language Models: An Industrial Practice | Dec 21, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Sentence Grounding for Long-term Instructional Video | Dec 21, 2023 | DenoisingDescriptive | —Unverified | 0 |
| Developing Interactive Tourism Planning: A Dialogue Robot System Powered by a Large Language Model | Dec 21, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Context-aware Decoding Reduces Hallucination in Query-focused Summarization | Dec 21, 2023 | HallucinationLanguage Modelling | CodeCode Available | 1 |
| Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning | Dec 21, 2023 | Language ModellingLarge Language Model | —Unverified | 0 |
| VideoPoet: A Large Language Model for Zero-Shot Video Generation | Dec 21, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation | Dec 20, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Machine Mindset: An MBTI Exploration of Large Language Models | Dec 20, 2023 | Large Language ModelPersonality Alignment | CodeCode Available | 2 |
| Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LlaMaVAE: Guiding Large Language Model Generation via Continuous Latent Sentence Spaces | Dec 20, 2023 | DecoderDefinition Modelling | —Unverified | 0 |
| Fine-tuning Large Language Models for Adaptive Machine Translation | Dec 20, 2023 | In-Context LearningLanguage Modelling | CodeCode Available | 1 |
| dIR -- Discrete Information Retrieval: Conversational Search over Unstructured (and Structured) Data with Large Language Models | Dec 20, 2023 | Conversational SearchInformation Retrieval | —Unverified | 0 |
| In Generative AI we Trust: Can Chatbots Effectively Verify Political Information? | Dec 20, 2023 | Language ModellingLarge Language Model | —Unverified | 0 |
| AMD:Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion | Dec 20, 2023 | DiversityLanguage Modeling | —Unverified | 0 |
| A Performance Evaluation of a Quantized Large Language Model on Various Smartphones | Dec 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| External Knowledge Augmented Polyphone Disambiguation Using Large Language Model | Dec 19, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Difficulty-Focused Contrastive Learning for Knowledge Tracing with a Large Language Model-Based Difficulty Prediction | Dec 19, 2023 | Contrastive LearningKnowledge Tracing | —Unverified | 0 |
| Can ChatGPT be Your Personal Medical Assistant? | Dec 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Models Empowered Agent-based Modeling and Simulation: A Survey and Perspectives | Dec 19, 2023 | Action GenerationLanguage Modeling | —Unverified | 0 |
| Sparse is Enough in Fine-tuning Pre-trained Large Language Models | Dec 19, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach | Dec 19, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Founder-GPT: Self-play to evaluate the Founder-Idea fit | Dec 19, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Text-Conditioned Resampler For Long Form Video Understanding | Dec 19, 2023 | EgoSchemaForm | —Unverified | 0 |
| "Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation | Dec 18, 2023 | HallucinationLanguage Modelling | CodeCode Available | 1 |
| G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Cascade Speculative Drafting for Even Faster LLM Inference | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Indoor and Outdoor 3D Scene Graph Generation via Language-Enabled Spatial Ontologies | Dec 18, 2023 | 3d scene graph generationgraph construction | —Unverified | 0 |
| VinaLLaMA: LLaMA-based Vietnamese Foundation Model | Dec 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| StarVector: Generating Scalable Vector Graphics Code from Images and Text | Dec 17, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 5 |
| Demystifying Instruction Mixing for Fine-tuning Large Language Models | Dec 17, 2023 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| A Unified Framework for Multi-Domain CTR Prediction via Large Language Models | Dec 17, 2023 | Click-Through Rate PredictionLanguage Modelling | CodeCode Available | 1 |
| LLM-Twin: Mini-Giant Model-driven Beyond 5G Digital Twin Networking Framework with Semantic Secure Communication and Computation | Dec 17, 2023 | Language ModellingLarge Language Model | —Unverified | 0 |
| Decoding Concerns: Multi-label Classification of Vaccine Sentiments in Social Media | Dec 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Knowledge Trees: Gradient Boosting Decision Trees on Knowledge Neurons as Probing Classifier | Dec 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU | Dec 16, 2023 | CPUGPU | CodeCode Available | 5 |
| Learning Interpretable Queries for Explainable Image Classification with Information Pursuit | Dec 16, 2023 | Dictionary Learningimage-classification | —Unverified | 0 |
| M^2ConceptBase: A Fine-Grained Aligned Concept-Centric Multimodal Knowledge Base | Dec 16, 2023 | cross-modal alignmentKnowledge Graphs | CodeCode Available | 0 |
| DeepArt: A Benchmark to Advance Fidelity Research in AI-Generated Content | Dec 16, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 0 |
| Resolving Crash Bugs via Large Language Models: An Empirical Study | Dec 16, 2023 | Language ModellingLarge Language Model | —Unverified | 0 |
| Context-Driven Interactive Query Simulations Based on Generative Large Language Models | Dec 15, 2023 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| LLaMAntino: LLaMA 2 Models for Effective Text Generation in Italian Language | Dec 15, 2023 | Language ModellingLarge Language Model | —Unverified | 0 |
| ProCoT: Stimulating Critical Thinking and Writing of Students through Engagement with Large Language Models (LLMs) | Dec 15, 2023 | Active LearningLanguage Modelling | —Unverified | 0 |
| ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent | Dec 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Student as an Inherent Denoiser of Noisy Teacher | Dec 15, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| InstructPipe: Generating Visual Blocks Pipelines with Human Instructions and LLMs | Dec 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |