| ProtAgents: Protein discovery via large language model multi-agent collaborations combining physics and machine learning | Jan 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Taiyi-Diffusion-XL: Advancing Bilingual Text-to-Image Generation with Large Vision-Language Model Support | Jan 26, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Endowing Protein Language Models with Structural Knowledge | Jan 26, 2024 | Drug DesignLanguage Modeling | CodeCode Available | 1 |
| Parameter-Efficient Conversational Recommender System as a Language Processing Task | Jan 25, 2024 | Dialogue GenerationKnowledge Graphs | CodeCode Available | 1 |
| Fluent dreaming for language models | Jan 24, 2024 | Adversarial AttackLanguage Modeling | CodeCode Available | 1 |
| How well can a large language model explain business processes as perceived by users? | Jan 23, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Can Large Language Models Write Parallel Code? | Jan 23, 2024 | Code CompletionCode Generation | CodeCode Available | 1 |
| MolTailor: Tailoring Chemical Molecular Representation to Specific Tasks via Text Prompts | Jan 21, 2024 | Drug DiscoveryLanguage Modeling | CodeCode Available | 1 |
| Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Excuse me, sir? Your language model is leaking (information) | Jan 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Self-Rewarding Language Models | Jan 18, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| ADCNet: a unified framework for predicting the activity of antibody-drug conjugates | Jan 17, 2024 | Activity PredictionLanguage Modeling | CodeCode Available | 1 |
| Asynchronous Local-SGD Training for Language Modeling | Jan 17, 2024 | Distributed OptimizationLanguage Modeling | CodeCode Available | 1 |
| TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation | Jan 16, 2024 | Emotion RecognitionEmotion Recognition in Conversation | CodeCode Available | 1 |
| Walert: Putting Conversational Search Knowledge into Action by Building and Evaluating a Large Language Model-Powered Chatbot | Jan 14, 2024 | ChatbotConversational Search | CodeCode Available | 1 |
| Multi-Task Learning for Front-End Text Processing in TTS | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ModaVerse: Efficiently Transforming Modalities with LLMs | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Rewriting the Code: A Simple Method for Large Language Model Augmented Code Search | Jan 9, 2024 | Code GenerationCode Search | CodeCode Available | 1 |
| Language Models Encode the Value of Numbers Linearly | Jan 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| VLLaVO: Mitigating Visual Gap through LLMs | Jan 6, 2024 | Domain AdaptationDomain Generalization | CodeCode Available | 1 |
| Multi-modal vision-language model for generalizable annotation-free pathology localization and clinical diagnosis | Jan 4, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| PLLaMa: An Open-source Large Language Model for Plant Science | Jan 3, 2024 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Quokka: An Open-source Large Language Model ChatBot for Material Science | Jan 2, 2024 | ArticlesChatbot | CodeCode Available | 1 |
| GeoGalactica: A Scientific Large Language Model in Geoscience | Dec 31, 2023 | Document ClassificationGeneral Knowledge | CodeCode Available | 1 |
| SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection | Dec 31, 2023 | Data AugmentationIntent Detection | CodeCode Available | 1 |
| Open-TI: Open Traffic Intelligence with Augmented Language Model | Dec 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model Evaluation | Dec 28, 2023 | GSM8KLanguage Model Evaluation | CodeCode Available | 1 |
| DrugAssist: A Large Language Model for Molecule Optimization | Dec 28, 2023 | Drug DiscoveryLanguage Modeling | CodeCode Available | 1 |
| RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation | Dec 26, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Large Language Models as Zero-Shot Keyphrase Extractors: A Preliminary Empirical Study | Dec 23, 2023 | Keyphrase ExtractionLanguage Modeling | CodeCode Available | 1 |
| Exploiting Novel GPT-4 APIs | Dec 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Time is Encoded in the Weights of Finetuned Language Models | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Cached Transformers: Improving Transformers with Differentiable Memory Cache | Dec 20, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction | Dec 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| Knowledge Graphs and Pre-trained Language Models enhanced Representation Learning for Conversational Recommender Systems | Dec 18, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 |
| Cascade Speculative Drafting for Even Faster LLM Inference | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RoleCraft-GLM: Advancing Personalized Role-Playing in Large Language Models | Dec 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Catwalk: A Unified Language Model Evaluation Framework for Many Datasets | Dec 15, 2023 | In-Context LearningLanguage Model Evaluation | CodeCode Available | 1 |
| Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation | Dec 15, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Unbiased organism-agnostic and highly sensitive signal peptide predictor with deep protein language model | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ViLA: Efficient Video-Language Alignment for Video Question Answering | Dec 13, 2023 | cross-modal alignmentLanguage Modeling | CodeCode Available | 1 |
| SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention | Dec 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| On Diversified Preferences of Large Language Model Alignment | Dec 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| READ: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling | Dec 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hallucination Augmented Contrastive Learning for Multimodal Large Language Model | Dec 12, 2023 | Contrastive LearningHallucination | CodeCode Available | 1 |
| Gated Linear Attention Transformers with Hardware-Efficient Training | Dec 11, 2023 | 2kLanguage Modeling | CodeCode Available | 1 |