SOTAVerified

Language Modeling

Papers

Showing 45014550 of 14182 papers

TitleStatusHype
Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical SemanticsCode0
GestureGPT: Toward Zero-Shot Free-Form Hand Gesture Understanding with Large Language Model AgentsCode0
Language Models Can Learn Exceptions to Syntactic RulesCode0
Language Models can Self-Improve at State-Value Estimation for Better SearchCode0
Lightweight Cross-Lingual Sentence Representation LearningCode0
A Unified Taxonomy-Guided Instruction Tuning Framework for Entity Set Expansion and Taxonomy ExpansionCode0
Language Model Sentence Completion with a Parser-Driven Rhetorical Control MethodCode0
Localized Symbolic Knowledge Distillation for Visual Commonsense ModelsCode0
Detecting AI-Generated Texts in Cross-DomainsCode0
ERNIE-Doc: A Retrospective Long-Document Modeling TransformerCode0
Detect Camouflaged Spam Content via StoneSkipping: Graph and Text Joint Embedding for Chinese Character Variation RepresentationCode0
Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language ModelCode0
DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?Code0
Brain-Like Language Processing via a Shallow Untrained Multihead Attention NetworkCode0
Error Analysis of using BART for Multi-Document Summarization: A Study for English and German LanguageCode0
Constructing Word-Context-Coupled Space Aligned with Associative Knowledge Relations for Interpretable Language ModelingCode0
Language Models Still Struggle to Zero-shot Reason about Time SeriesCode0
When your Cousin has the Right Connections: Unsupervised Bilingual Lexicon Induction for Related Data-Imbalanced LanguagesCode0
Lightweight Relevance Grader in RAGCode0
Error Detection for Text-to-SQL Semantic ParsingCode0
Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text GenerationCode0
A Simple Way to Initialize Recurrent Networks of Rectified Linear UnitsCode0
Language Models with Pre-Trained (GloVe) Word EmbeddingsCode0
Language Model Tokenizers Introduce Unfairness Between LanguagesCode0
Language Model Training Paradigms for Clinical Feature EmbeddingsCode0
Language Model Transformers as Evaluators for Open-domain DialoguesCode0
FlauBERT : des mod\`eles de langue contextualis\'es pr\'e-entra\^ \'es pour le fran (FlauBERT : Unsupervised Language Model Pre-training for French)Code0
Likelihood as a Performance Gauge for Retrieval-Augmented GenerationCode0
Cynical Selection of Language Model Training DataCode0
AttViz: Online exploration of self-attention for transparent neural language modelingCode0
A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language ModelCode0
Lil-Bevo: Explorations of Strategies for Training Language Models in More Humanlike WaysCode0
CroissantLLM: A Truly Bilingual French-English Language ModelCode0
FASPell: A Fast, Adaptable, Simple, Powerful Chinese Spell Checker Based On DAE-Decoder ParadigmCode0
An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment ClassificationCode0
ESM-NBR: fast and accurate nucleic acid-binding residue prediction via protein language model feature representation and multi-task learningCode0
Casting the Same Sentiment Classification ProblemCode0
LIMIT-BERT : Linguistics Informed Multi-Task BERTCode0
Fine-grained Contrastive Learning for Relation ExtractionCode0
ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural NetworkCode0
LIMP: Large Language Model Enhanced Intent-aware Mobility PredictionCode0
Fine-Grained Emotion Prediction by Modeling Emotion DefinitionsCode0
Geographic Adaptation of Pretrained Language ModelsCode0
Locally Differentially Private Document Generation Using Zero Shot PromptingCode0
An Empirical Revisiting of Linguistic Knowledge Fusion in Language Understanding TasksCode0
Context-aware Captions from Context-agnostic SupervisionCode0
Linearized Relative Positional EncodingCode0
Fusing Sentence Embeddings Into LSTM-based Autoregressive Language ModelsCode0
Estimating Large Language Model Capabilities without Labeled Test DataCode0
CASTILLO: Characterizing Response Length Distributions of Large Language ModelsCode0
Show:102550
← PrevPage 91 of 284Next →

No leaderboard results yet.