SOTAVerified

Language Modeling

Papers

Showing 94519500 of 14182 papers

TitleStatusHype
Introspective Tips: Large Language Model for In-Context Decision Making0
Interpretable Word Sense Representations via Definition Generation: The Case of Semantic Change AnalysisCode0
Extending Memory for Language Modelling0
Eye-SpatialNet: Spatial Information Extraction from Ophthalmology Notes0
From Alignment to Entailment: A Unified Textual Entailment Framework for Entity AlignmentCode0
Constructing Word-Context-Coupled Space Aligned with Associative Knowledge Relations for Interpretable Language ModelingCode0
Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast0
Generalized Multiple Intent Conditioned Slot Filling0
Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings0
How does the task complexity of masked pretraining objectives affect downstream performance?Code0
Vaxformer: Antigenicity-controlled Transformer for Vaccine Design Against SARS-CoV-2Code0
MolXPT: Wrapping Molecules with Text for Generative Pre-trainingCode0
Prompt Engineering for Transformer-based Chemical Similarity Search Identifies Structurally Distinct Functional AnaloguesCode0
Controllable Speaking Styles Using a Large Language Model0
Token-wise Decomposition of Autoregressive Language Model Hidden States for Analyzing Model PredictionsCode0
Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLM's Translation Capability0
SLiC-HF: Sequence Likelihood Calibration with Human Feedback0
Generation of 3D Molecules in Pockets via Language Model0
CageViT: Convolutional Activation Guided Efficient Vision Transformer0
Language Model Tokenizers Introduce Unfairness Between LanguagesCode0
Application-Agnostic Language Modeling for On-Device ASR0
Towards Unifying Multi-Lingual and Cross-Lingual Summarization0
Natural Language Decomposition and Interpretation of Complex Utterances0
NeuSTIP: A Novel Neuro-Symbolic Model for Link and Time Prediction in Temporal Knowledge Graphs0
DarkBERT: A Language Model for the Dark Side of the Internet0
A Language Model of Java Methods with Train/Test DeduplicationCode0
Scalable Educational Question Generation with Pre-trained Language ModelsCode0
Two-in-One: A Model Hijacking Attack Against Text Generation Models0
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers0
Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual Transfer for Open-domain Dialogue GenerationCode0
Using Language Models to Detect Alarming Student Responses0
Learning to Reason over Scene Graphs: A Case Study of Finetuning GPT-2 into a Robot Language Model for Grounded Task Planning0
Detecting Idiomatic Multiword Expressions in Clinical Terminology using Definition-Based Representation Learning0
How Good are Commercial Large Language Models on African Languages?0
Musketeer: Joint Training for Multi-task Vision Language Model with Task Explanation PromptsCode0
Recommendation as Instruction Following: A Large Language Model Empowered Recommendation Approach0
Masked Audio Text Encoders are Effective Multi-Modal Rescorers0
Privacy-Preserving Prompt Tuning for Large Language Model Services0
Say What You Mean! Large Language Models Speak Too Positively about Negative Commonsense KnowledgeCode0
Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer0
Enriching language models with graph-based context information to better understand textual dataCode0
LACoS-BLOOM: Low-rank Adaptation with Contrastive objective on 8 bits Siamese-BLOOM0
A Taxonomy of Foundation Model based Systems through the Lens of Software Architecture0
Effects of sub-word segmentation on performance of transformer language models0
Detection of depression on social networks using transformers and ensemblesCode0
Large Language Model Programs0
DeepTextMark: A Deep Learning-Driven Text Watermarking Approach for Identifying Large Language Model Generated TextCode0
Estimating related words computationally using language model from the Mahabharata - an Indian epic0
PLM-GNN: A Webpage Classification Method based on Joint Pre-trained Language Model and Graph Neural Network0
Towards an Automatic Optimisation Model Generator Assisted with Generative Pre-trained Transformer0
Show:102550
← PrevPage 190 of 284Next →

No leaderboard results yet.