Language Modelling

A language model is a model of natural language. Language models are useful for a variety of tasks, including speech recognition, machine translation, natural language generation (generating more human-like text), optical character recognition, route optimization, handwriting recognition, grammar induction, and information retrieval.

Large language models (LLMs), currently their most advanced form, are predominantly based on transformers trained on larger datasets (frequently using words scraped from the public internet). They have superseded recurrent neural network-based models, which had previously superseded the purely statistical models, such as word n-gram language model.

Source: Wikipedia

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8351–8400 of 17610 papers

Title	Date	Tasks	Status
Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions	Jul 1, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Effects of sub-word segmentation on performance of transformer language models	May 9, 2023	Language ModelingLanguage Modelling	—Unverified
Investigating the Effects of Large-Scale Pseudo-Stereo Data and Different Speech Foundation Model on Dialogue Generative Spoken Language Model	Jul 2, 2024	Dialogue GenerationDiversity	—Unverified
Investigating the Impact of Text Summarization on Topic Modeling	Sep 28, 2024	DiversityLanguage Modeling	—Unverified
Investigating the Impact of Word Informativeness on Speech Emotion Recognition	Jun 2, 2025	Emotion RecognitionInformativeness	—Unverified
Investigating the Potential of Large Language Model-Based Router Multi-Agent Architectures for Foundation Design Automation: A Task Classification and Expert Selection Study	Jun 13, 2025	Language ModelingLanguage Modelling	—Unverified
Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training	Oct 1, 2024	DecoderLanguage Modeling	—Unverified
Investigating the Timescales of Language Processing with EEG and Language Models	Jun 28, 2024	EEGLanguage Modeling	—Unverified
Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition	Jan 19, 2024	Language ModelingLanguage Modelling	—Unverified
Investigating Vision-Language Model for Point Cloud-based Vehicle Classification	Apr 10, 2025	Autonomous DrivingClassification	—Unverified
Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language	Dec 16, 2022	Language ModelingLanguage Modelling	—Unverified
Investigation of Large-Margin Softmax in Neural Language Modeling	May 20, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Investigation on N-gram Approximated RNNLMs for Recognition of Morphologically Rich Speech	Jul 15, 2019	Language ModelingLanguage Modelling	—Unverified
Investigations in Exact Inference for Hierarchical Translation	Aug 1, 2013	Language ModellingMachine Translation	—Unverified
Investigations on Phrase-based Decoding with Recurrent Neural Network Language and Translation Models	Sep 1, 2015	Language ModellingMachine Translation	—Unverified
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent	Dec 24, 2024	Decision MakingLanguage Modeling	—Unverified
IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking	Oct 30, 2024	Knowledge DistillationLanguage Modelling	—Unverified
IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation	Oct 25, 2024	Common Sense ReasoningLanguage Modeling	—Unverified
iPrOp: Interactive Prompt Optimization for Large Language Models with a Human in the Loop	Dec 17, 2024	Language ModelingLanguage Modelling	—Unverified
IQLS: Framework for leveraging Metadata to enable Large Language Model based queries to complex, versatile Data	May 4, 2024	Information RetrievalLanguage Modeling	—Unverified
IRCologne at GermEval 2021: Toxicity Classification	Sep 1, 2021	ClassificationLanguage Modeling	—Unverified
iREPO: implicit Reward Pairwise Difference based Empirical Preference Optimization	May 24, 2024	Language Model EvaluationLanguage Modeling	—Unverified
IRISA participation to BioNLP-ST13: lazy-learning and information retrieval for information extraction tasks	Aug 1, 2013	Information RetrievalLanguage Modelling	—Unverified
IRIT at TRAC 2020	May 1, 2020	Aggression IdentificationLanguage Modeling	—Unverified
Irreducible Curriculum for Language Model Pretraining	Oct 23, 2023	Language ModelingLanguage Modelling	—Unverified
Irrelevant Alternatives Bias Large Language Model Hiring Decisions	Sep 4, 2024	Language ModelingLanguage Modelling	—Unverified
Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?	May 28, 2024	3D Object DetectionAutonomous Driving	—Unverified
Is Bad Structure Better Than No Structure?: Unsupervised Parsing for Realisation Ranking	Dec 1, 2012	Language ModellingText Generation	—Unverified
Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing	Oct 19, 2023	DecoderLanguage Model Evaluation	—Unverified
Is ChatGPT a Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation	Apr 4, 2023	Grammatical Error CorrectionIn-Context Learning	—Unverified
Is ChatGPT Equipped with Emotional Dialogue Capabilities?	Apr 19, 2023	Dialogue UnderstandingLanguage Modeling	—Unverified
Is Context Helpful for Chat Translation Evaluation?	Mar 13, 2024	Language ModelingLanguage Modelling	—Unverified
Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization	Feb 28, 2024	Language ModelingLanguage Modelling	—Unverified
I See Dead People: Gray-Box Adversarial Attack on Image-To-Text Models	Jun 13, 2023	Adversarial AttackDecoder	—Unverified
Is Einstein more agreeable and less neurotic than Hitler? A computational exploration of the emotional and personality profiles of historical persons	Jun 14, 2021	Language ModelingLanguage Modelling	—Unverified
Is Encoder-Decoder Redundant for Neural Machine Translation?	Oct 21, 2022	DecoderLanguage Modeling	—Unverified
Is English the New Programming Language? How About Pseudo-code Engineering?	Apr 8, 2024	Language ModellingPrompt Engineering	—Unverified
Is GPT-4 a reliable rater? Evaluating Consistency in GPT-4 Text Ratings	Aug 3, 2023	Language ModelingLanguage Modelling	—Unverified
Is it an i or an l: Test-time Adaptation of Text Line Recognition Models	Aug 29, 2023	Language ModellingTest-time Adaptation	—Unverified
Is it Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models	Sep 22, 2023	Language ModellingReading Comprehension	—Unverified
Is Language Modeling Enough? Evaluating Effective Embedding Combinations	May 1, 2020	Entity DisambiguationGeneral Classification	—Unverified
Is Large Language Model Good at Triple Set Prediction? An Empirical Study	Dec 24, 2024	Knowledge Graph CompletionLanguage Modeling	—Unverified
Is My Model Using The Right Evidence? Systematic Probes for Examining Evidence-Based Tabular Reasoning	Aug 2, 2021	Language ModelingLanguage Modelling	—Unverified
Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs	Mar 10, 2025	Language ModelingLanguage Modelling	—Unverified
ISO: Overlap of Computation and Communication within Seqenence For LLM Inference	Sep 4, 2024	GPULanguage Modeling	—Unverified
Toward Trustworthy Neural Program Synthesis	Sep 29, 2022	Language ModelingLanguage Modelling	—Unverified
Unveiling Code Pre-Trained Models: Investigating Syntax and Semantics Capacities	Dec 20, 2022	Code CompletionCode Search	—Unverified
Is Supervised Syntactic Parsing Beneficial for Language Understanding Tasks? An Empirical Investigation	Apr 1, 2021	Language ModelingLanguage Modelling	—Unverified
Is Surprisal in Issue Trackers Actionable?	Apr 15, 2022	Event DetectionLanguage Modelling	—Unverified
Is There Any Social Principle for LLM-Based Agents?	Aug 22, 2023	Language ModelingLanguage Modelling	—Unverified

Show:10 25 50

← PrevPage 168 of 353Next →

All datasets WikiText-103 Penn Treebank (Word Level)enwik8 The Pile WikiText-2 LAMBADA One Billion Word Text8 Penn Treebank (Character Level)Hutter Prize OpenWebText SALMon

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Decay RNN	Validation perplexity	76.67	—	Unverified
2	GRU	Validation perplexity	53.78	—	Unverified
3	LSTM	Validation perplexity	52.73	—	Unverified
4	LSTM	Test perplexity	48.7	—	Unverified
5	Temporal CNN	Test perplexity	45.2	—	Unverified
6	TCN	Test perplexity	45.19	—	Unverified
7	GCNN-8	Test perplexity	44.9	—	Unverified
8	Neural cache model (size = 100)	Test perplexity	44.8	—	Unverified
9	Neural cache model (size = 2,000)	Test perplexity	40.8	—	Unverified
10	GPT-2 Small	Test perplexity	37.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TCN	Test perplexity	108.47	—	Unverified
2	Seq-U-Net	Test perplexity	107.95	—	Unverified
3	GRU (Bai et al., 2018)	Test perplexity	92.48	—	Unverified
4	R-Transformer	Test perplexity	84.38	—	Unverified
5	Zaremba et al. (2014) - LSTM (medium)	Test perplexity	82.7	—	Unverified
6	Gal & Ghahramani (2016) - Variational LSTM (medium)	Test perplexity	79.7	—	Unverified
7	LSTM (Bai et al., 2018)	Test perplexity	78.93	—	Unverified
8	Zaremba et al. (2014) - LSTM (large)	Test perplexity	78.4	—	Unverified
9	Gal & Ghahramani (2016) - Variational LSTM (large)	Test perplexity	75.2	—	Unverified
10	Inan et al. (2016) - Variational RHN	Test perplexity	66	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSTM (7 layers)	Bit per Character (BPC)	1.67	—	Unverified
2	Hypernetworks	Bit per Character (BPC)	1.34	—	Unverified
3	SHA-LSTM (4 layers, h=1024, no attention head)	Bit per Character (BPC)	1.33	—	Unverified
4	LN HM-LSTM	Bit per Character (BPC)	1.32	—	Unverified
5	ByteNet	Bit per Character (BPC)	1.31	—	Unverified
6	Recurrent Highway Networks	Bit per Character (BPC)	1.27	—	Unverified
7	Large FS-LSTM-4	Bit per Character (BPC)	1.25	—	Unverified
8	Large mLSTM	Bit per Character (BPC)	1.24	—	Unverified
9	AWD-LSTM (3 layers)	Bit per Character (BPC)	1.23	—	Unverified
10	Cluster-Former (#C=512)	Bit per Character (BPC)	1.22	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Smaller Transformer 126M (pre-trained)	Test perplexity	33	—	Unverified
2	OPT 125M	Test perplexity	32.26	—	Unverified
3	Larger Transformer 771M (pre-trained)	Test perplexity	28.1	—	Unverified
4	OPT 1.3B	Test perplexity	19.55	—	Unverified
5	GPT-Neo 125M	Test perplexity	17.83	—	Unverified
6	OPT 2.7B	Test perplexity	17.81	—	Unverified
7	Smaller Transformer 126M (fine-tuned)	Test perplexity	12	—	Unverified
8	GPT-Neo 1.3B	Test perplexity	11.46	—	Unverified
9	Transformer 125M	Test perplexity	10.7	—	Unverified
10	GPT-Neo 2.7B	Test perplexity	10.44	—	Unverified