| dIR -- Discrete Information Retrieval: Conversational Search over Unstructured (and Structured) Data with Large Language Models | Dec 20, 2023 | Conversational SearchInformation Retrieval | —Unverified | 0 |
| LlaMaVAE: Guiding Large Language Model Generation via Continuous Latent Sentence Spaces | Dec 20, 2023 | DecoderDefinition Modelling | —Unverified | 0 |
| Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AMD:Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion | Dec 20, 2023 | DiversityLanguage Modeling | —Unverified | 0 |
| HCDIR: End-to-end Hate Context Detection, and Intensity Reduction model for online comments | Dec 20, 2023 | Hate Speech DetectionLanguage Modeling | —Unverified | 0 |
| ALMANACS: A Simulatability Benchmark for Language Model Explainability | Dec 20, 2023 | counterfactualLanguage Modeling | CodeCode Available | 0 |
| Dynamic Topic Language Model on Heterogeneous Children's Mental Health Clinical Notes | Dec 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Performance Evaluation of a Quantized Large Language Model on Various Smartphones | Dec 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Difficulty-Focused Contrastive Learning for Knowledge Tracing with a Large Language Model-Based Difficulty Prediction | Dec 19, 2023 | Contrastive LearningKnowledge Tracing | —Unverified | 0 |
| External Knowledge Augmented Polyphone Disambiguation Using Large Language Model | Dec 19, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Can ChatGPT be Your Personal Medical Assistant? | Dec 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition | Dec 19, 2023 | Conditional Text GenerationDecoder | CodeCode Available | 0 |
| Large Language Models Empowered Agent-based Modeling and Simulation: A Survey and Perspectives | Dec 19, 2023 | Action GenerationLanguage Modeling | —Unverified | 0 |
| Text-Conditioned Resampler For Long Form Video Understanding | Dec 19, 2023 | EgoSchemaForm | —Unverified | 0 |
| LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction | Dec 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| Founder-GPT: Self-play to evaluate the Founder-Idea fit | Dec 19, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model | Dec 19, 2023 | AllAttribute | —Unverified | 0 |
| UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts | Dec 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Linear Attention via Orthogonal Memory | Dec 18, 2023 | Causal Language ModelingComputational Efficiency | —Unverified | 0 |
| Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model | Dec 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VinaLLaMA: LLaMA-based Vietnamese Foundation Model | Dec 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Evaluating Language-Model Agents on Realistic Autonomous Tasks | Dec 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Entity or Relation Embeddings? An Analysis of Encoding Strategies for Relation Extraction | Dec 18, 2023 | Entity EmbeddingsLanguage Modeling | CodeCode Available | 0 |
| Cascade Speculative Drafting for Even Faster LLM Inference | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Knowledge Graphs and Pre-trained Language Models enhanced Representation Learning for Conversational Recommender Systems | Dec 18, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 |
| RoleCraft-GLM: Advancing Personalized Role-Playing in Large Language Models | Dec 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| StarVector: Generating Scalable Vector Graphics Code from Images and Text | Dec 17, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 5 |
| Decoding Concerns: Multi-label Classification of Vaccine Sentiments in Social Media | Dec 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Knowledge Trees: Gradient Boosting Decision Trees on Knowledge Neurons as Probing Classifier | Dec 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FedMKGC: Privacy-Preserving Federated Multilingual Knowledge Graph Completion | Dec 17, 2023 | Entity AlignmentFederated Learning | —Unverified | 0 |
| Mixed Distillation Helps Smaller Language Model Better Reasoning | Dec 17, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU | Dec 16, 2023 | CPUGPU | CodeCode Available | 5 |
| Learning Interpretable Queries for Explainable Image Classification with Information Pursuit | Dec 16, 2023 | Dictionary Learningimage-classification | —Unverified | 0 |
| DeepArt: A Benchmark to Advance Fidelity Research in AI-Generated Content | Dec 16, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 0 |
| Paloma: A Benchmark for Evaluating Language Model Fit | Dec 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation | Dec 15, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Student as an Inherent Denoiser of Noisy Teacher | Dec 15, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition | Dec 15, 2023 | Automatic Speech RecognitionLanguage Identification | —Unverified | 0 |
| Context-Driven Interactive Query Simulations Based on Generative Large Language Models | Dec 15, 2023 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent | Dec 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Catwalk: A Unified Language Model Evaluation Framework for Many Datasets | Dec 15, 2023 | In-Context LearningLanguage Model Evaluation | CodeCode Available | 1 |
| InstructPipe: Generating Visual Blocks Pipelines with Human Instructions and LLMs | Dec 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Challenges with unsupervised LLM knowledge discovery | Dec 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unbiased organism-agnostic and highly sensitive signal peptide predictor with deep protein language model | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Dissecting vocabulary biases datasets through statistical testing and automated data augmentation for artifact mitigation in Natural Language Inference | Dec 14, 2023 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Pixel Aligned Language Models | Dec 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Modeling on a SpiNNaker 2 Neuromorphic Chip | Dec 14, 2023 | Gesture RecognitionLanguage Modeling | —Unverified | 0 |
| Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |