SOTAVerified

Large Language Model

Papers

Showing 60516097 of 6097 papers

TitleStatusHype
Protoformer: Embedding Prototypes for TransformersCode1
Using cognitive psychology to understand GPT-30
Know your audience: specializing grounded language models with listener subtraction0
Putting GPT-3's Creativity to the (Alternative Uses) TestCode0
Automatic Generation of Programming Exercises and Code Explanations using Large Language Models0
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning0
Happenstance: Utilizing Semantic Search to Track Russian State Media Narratives about the Russo-Ukrainian War On Reddit0
Differentially Private Decoding in Large Language Models0
Housekeep: Tidying Virtual Households using Commonsense ReasoningCode1
RankGen: Improving Text Generation with Large Ranking ModelsCode1
The Unreliability of Explanations in Few-shot Prompting for Textual ReasoningCode1
Combining Extraction and Generation for Constructing Belief-Consequence Causal Links0
CodeGen: An Open Large Language Model for Code with Multi-Turn Program SynthesisCode6
Extraction of Sleep Information from Clinical Notes of Patients with Alzheimer's Disease Using Natural Language Processing0
Fast-R2D2: A Pretrained Recursive Neural Network based on Pruned CKY for Grammar Induction and Text RepresentationCode1
Pop Quiz! Can a Large Language Model Help With Reverse Engineering?0
Hardness Masking via Auto-Regressive Language Model0
CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code0
Imagined versus Remembered Stories: Quantifying Differences in Narrative Flow0
MacBERTh: Development and Evaluation of a Historically Pre-trained Language Model for English (1450-1950)0
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic ArithmeticCode1
Adaptive Testing and Debugging of NLP Models0
SynthBio: A Case Study in Human-AI Collaborative Curation of Text Datasets0
The Klarna Product Page Dataset: Web Element Nomination with Graph Neural Networks and Large Language ModelsCode1
bert2BERT: Towards Reusable Pretrained Language Models0
AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts0
Generate, Annotate, and Learn: Generative Models Advance Self-Training and Knowledge Distillation0
Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuningCode1
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed LanguageCode1
A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic lossCode0
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesisCode0
Transfer training from smaller language model0
Arabic Compact Language Modelling for Resource Limited Devices0
Globalizing BERT-based Transformer Architectures for Long Document Summarization0
Story Centaur: Large Language Model Few Shot Learning as a Creative Writing Tool0
KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense GenerationCode1
Graphmax for Text Generation0
A review of on-device fully neural end-to-end automatic speech recognition algorithms0
Supervised Contrastive Learning for Pre-trained Language Model Fine-tuningCode1
Plug-and-Play Conversational ModelsCode1
Plug-and-Play Conversational Models0
Challenge Closed-book Science Exam: A Meta-learning Based Question Answering System0
Explaining Relationships Between Scientific DocumentsCode1
Compressing Language Models using Doped Kronecker Products0
Paraphrasing with Large Language Models0
Fast Transformer Decoding: One Write-Head is All You NeedCode4
Enhancing Clinical Concept Extraction with Contextual Embeddings0
Show:102550
← PrevPage 122 of 122Next →

No leaderboard results yet.