Word Embeddings

Word embedding is the collective name for a set of language modeling and feature learning techniques in natural language processing (NLP) where words or phrases from the vocabulary are mapped to vectors of real numbers.

Techniques for learning word embeddings can include Word2Vec, GloVe, and other neural network-based approaches that train on an NLP task such as language modeling or document classification.

( Image credit: Dynamic Word Embedding for Evolving Semantic Discovery )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 4002 papers

Title	Date	Tasks	Status	Hype
DeFINE: DEep Factorized INput Token Embeddings for Neural Sequence Modeling	Nov 27, 2019	Machine TranslationTranslation	CodeCode Available	1
Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases	Jun 6, 2020	Bias DetectionSentence	CodeCode Available	1
“Did you really mean what you said?” : Sarcasm Detection in Hindi-English Code-Mixed Data using Bilingual Word Embeddings	Nov 1, 2020	Deep LearningSarcasm Detection	CodeCode Available	1
DiffEditor: Enhancing Speech Editing with Semantic Enrichment and Acoustic Consistency	Sep 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Discovering and Categorising Language Biases in Reddit	Aug 6, 2020	Word Embeddings	CodeCode Available	1
AI4Bharat-IndicNLP Corpus: Monolingual Corpora and Word Embeddings for Indic Languages	Apr 30, 2020	Word Embeddings	CodeCode Available	1
Disentangling Visual Embeddings for Attributes and Objects	May 17, 2022	AttributeCompositional Zero-Shot Learning	CodeCode Available	1
Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation	May 3, 2020	Word Embeddings	CodeCode Available	1
Embarrassingly Simple Unsupervised Aspect Extraction	Apr 28, 2020	Aspect Category DetectionPOS	CodeCode Available	1
Embed2Detect: Temporally Clustered Embedded Words for Event Detection in Social Media	Jun 10, 2020	ClusteringEvent Detection	CodeCode Available	1
Emotion-Aware Transformer Encoder for Empathetic Dialogue Generation	Apr 24, 2022	DecoderDialogue Generation	CodeCode Available	1
Emotion Understanding in Videos Through Body, Context, and Visual-Semantic Embedding Loss	Oct 30, 2020	Emotion RecognitionWord Embeddings	CodeCode Available	1
AnomalyLLM: Few-shot Anomaly Edge Detection for Dynamic Graphs using Large Language Models	May 13, 2024	Anomaly DetectionEdge Detection	CodeCode Available	1
Fair Embedding Engine: A Library for Analyzing and Mitigating Gender Bias in Word Embeddings	Oct 25, 2020	Word Embeddings	CodeCode Available	1
FastText.zip: Compressing text classification models	Dec 12, 2016	General ClassificationQuantization	CodeCode Available	1
A Source-Criticism Debiasing Method for GloVe Embeddings	Jun 25, 2021	Word Embeddings	CodeCode Available	1
Brain2Word: Decoding Brain Activity for Language Generation	Sep 10, 2020	Brain DecodingText Generation	CodeCode Available	1
Gender Bias in Contextualized Word Embeddings	Apr 5, 2019	Word Embeddings	CodeCode Available	1
Going Beyond T-SNE: Exposing whatlies in Text Embeddings	Sep 4, 2020	Dimensionality ReductionSentence	CodeCode Available	1
Applying Occam's Razor to Transformer-Based Dependency Parsing: What Works, What Doesn't, and What is Really Necessary	Oct 23, 2020	Dependency ParsingPart-Of-Speech Tagging	CodeCode Available	1
ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual Pre-training	Nov 15, 2022	Cross-Lingual TransferPOS	CodeCode Available	1
GREEK-BERT: The Greeks visiting Sesame Street	Aug 27, 2020	Language ModelingLanguage Modelling	CodeCode Available	1
HJ-Ky-0.1: an Evaluation Dataset for Kyrgyz Word Embeddings	Nov 16, 2024	Sentiment AnalysisWord Embeddings	CodeCode Available	1
Homophone Reveals the Truth: A Reality Check for Speech2Vec	Sep 22, 2022	Word EmbeddingsWord Similarity	CodeCode Available	1
ALL-IN-1: Short Text Classification with One Model for All Languages	Oct 26, 2017	AllGeneral Classification	CodeCode Available	1
Zero-Shot Semantic Segmentation	Jun 3, 2019	General ClassificationSegmentation	CodeCode Available	1
All Word Embeddings from One Embedding	Apr 25, 2020	AllDecoder	CodeCode Available	1
Improving Bilingual Lexicon Induction with Cross-Encoder Reranking	Oct 30, 2022	Bilingual Lexicon InductionCross Encoder Reranking	CodeCode Available	1
Improving word mover's distance by leveraging self-attention matrix	Nov 11, 2022	Paraphrase IdentificationSemantic Similarity	CodeCode Available	1
Improving Word Translation via Two-Stage Contrastive Learning	Nov 16, 2021	Bilingual Lexicon InductionContrastive Learning	CodeCode Available	1
Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost	Mar 15, 2022	Contrastive LearningLanguage Modeling	CodeCode Available	1
Imputing Out-of-Vocabulary Embeddings with LOVE Makes LanguageModels Robust with Little Cost	May 1, 2022	Contrastive LearningLanguage Modeling	CodeCode Available	1
iNLTK: Natural Language Toolkit for Indic Languages	Sep 26, 2020	Data AugmentationParaphrase Generation	CodeCode Available	1
In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data	Apr 4, 2019	Speech Synthesistext-to-speech	CodeCode Available	1
IRB-NLP at SemEval-2022 Task 1: Exploring the Relationship Between Words and Their Semantic Representations	May 13, 2022	DescriptiveReverse Dictionary	CodeCode Available	1
Is Neural Topic Modelling Better than Clustering? An Empirical Study on Clustering with Contextual Embeddings for Topics	Apr 21, 2022	ClusteringSentence	CodeCode Available	1
Keyword-Guided Neural Conversational Model	Dec 15, 2020	Knowledge Graphsmodel	CodeCode Available	1
Corrected CBOW Performs as well as Skip-gram	Dec 30, 2020	Word Embeddings	CodeCode Available	1
Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation	Jun 24, 2019	Language ModellingLEMMA	CodeCode Available	1
Language Models Implement Simple Word2Vec-style Vector Arithmetic	May 25, 2023	In-Context LearningLanguage Modeling	CodeCode Available	1
Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning	Nov 18, 2024	AttributeCompositional Zero-Shot Learning	CodeCode Available	1
LingJing at SemEval-2022 Task 1: Multi-task Self-supervised Pre-training for Multilingual Reverse Dictionary	Jul 1, 2022	Reverse DictionaryWord Embeddings	CodeCode Available	1
GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph Knowledge	Sep 26, 2024	Natural Language InferenceSentiment Analysis	CodeCode Available	1
Machine learning as a model for cultural learning: Teaching an algorithm what it means to be fat	Mar 24, 2020	ArticlesCultural Vocal Bursts Intensity Prediction	CodeCode Available	1
MIANet: Aggregating Unbiased Instance and General Information for Few-Shot Semantic Segmentation	May 23, 2023	Few-Shot Semantic SegmentationGeneral Knowledge	CodeCode Available	1
MirrorWiC: On Eliciting Word-in-Context Representations from Pretrained Language Models	Sep 19, 2021	Contextualised Word RepresentationsContrastive Learning	CodeCode Available	1
Modality-Transferable Emotion Embeddings for Low-Resource Multimodal Emotion Recognition	Sep 21, 2020	Emotion RecognitionMultimodal Emotion Recognition	CodeCode Available	1
MorphTE: Injecting Morphology in Tensorized Embeddings	Oct 27, 2022	Learning Word EmbeddingsMachine Translation	CodeCode Available	1
Multilingual Jointly Trained Acoustic and Written Word Embeddings	Jun 24, 2020	Dynamic Time WarpingRetrieval	CodeCode Available	1
Contextualized Embeddings based Transformer Encoder for Sentence Similarity Modeling in Answer Selection Task	May 1, 2020	Answer SelectionSentence	CodeCode Available	1

Show:10 25 50

← PrevPage 4 of 81Next →

No leaderboard results yet.