SOTAVerified

Word Embeddings

Word embedding is the collective name for a set of language modeling and feature learning techniques in natural language processing (NLP) where words or phrases from the vocabulary are mapped to vectors of real numbers.

Techniques for learning word embeddings can include Word2Vec, GloVe, and other neural network-based approaches that train on an NLP task such as language modeling or document classification.

( Image credit: Dynamic Word Embedding for Evolving Semantic Discovery )

Papers

Showing 150 of 4002 papers

TitleStatusHype
Fine-mixing: Mitigating Backdoors in Fine-tuned Language ModelsCode8
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion ModelsCode3
Generative Adversarial Training for Text-to-Speech Synthesis Based on Raw Phonetic Input and Explicit Prosody ModellingCode2
ConceptNet at SemEval-2017 Task 2: Extending Word Embeddings with Multilingual Relational KnowledgeCode2
ConceptNet 5.5: An Open Multilingual Graph of General KnowledgeCode2
FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and Transferable Topic ModelCode2
WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question AnsweringCode2
RETVec: Resilient and Efficient Text VectorizerCode2
A Pilot Study for Chinese SQL Semantic ParsingCode2
Contextual Semantic Embeddings for Ontology Subsumption PredictionCode2
VNLP: Turkish NLP PackageCode2
An Ensemble Method to Produce High-Quality Word Embeddings (2016)Code2
Train Short, Test Long: Attention with Linear Biases Enables Input Length ExtrapolationCode2
CTRAN: CNN-Transformer-based Network for Natural Language UnderstandingCode1
Cross-Lingual Word Embedding Refinement by _1 Norm OptimisationCode1
Cycle Text-To-Image GAN with BERTCode1
Cooperative Self-training of Machine Reading ComprehensionCode1
Contextualized Embeddings based Transformer Encoder for Sentence Similarity Modeling in Answer Selection TaskCode1
Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous GraphCode1
Data Mining in Clinical Trial Text: Transformers for Classification and Question Answering TasksCode1
Compositional Demographic Word EmbeddingsCode1
Combining Static Word Embeddings and Contextual Representations for Bilingual Lexicon InductionCode1
AI4Bharat-IndicNLP Corpus: Monolingual Corpora and Word Embeddings for Indic LanguagesCode1
Context-aware Feature Generation for Zero-shot Semantic SegmentationCode1
comp-syn: Perceptually Grounded Word Embeddings with ColorCode1
Contextual Word Representations: A Contextual IntroductionCode1
ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual Pre-trainingCode1
Cross-Lingual Word Embedding Refinement by _1 Norm OptimisationCode1
All Word Embeddings from One EmbeddingCode1
ALL-IN-1: Short Text Classification with One Model for All LanguagesCode1
CODER: Knowledge infused cross-lingual medical term embedding for term normalizationCode1
Can a Fruit Fly Learn Word Embeddings?Code1
FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input RepresentationsCode1
Combining Self-Training and Self-Supervised Learning for Unsupervised Disfluency DetectionCode1
BERT for Monolingual and Cross-Lingual Reverse DictionaryCode1
Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP ModelsCode1
BERT Goes Shopping: Comparing Distributional Models for Product RepresentationsCode1
A Source-Criticism Debiasing Method for GloVe EmbeddingsCode1
A Comprehensive Analysis of Static Word Embeddings for TurkishCode1
Backpack Language ModelsCode1
Circumventing Concept Erasure Methods For Text-to-Image Generative ModelsCode1
Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM NetworkCode1
Zero-Shot Semantic SegmentationCode1
ADEPT: A DEbiasing PrompT FrameworkCode1
Comparative Evaluation of Pretrained Transfer Learning Models on Automatic Short Answer GradingCode1
Compass-aligned Distributional Embeddings for Studying Semantic Differences across CorporaCode1
Affective and Contextual Embedding for Sarcasm DetectionCode1
Adversarial Training Methods for Semi-Supervised Text ClassificationCode1
Conditional probing: measuring usable information beyond a baselineCode1
GLOW : Global Weighted Self-Attention Network for Web SearchCode1
Show:102550
← PrevPage 1 of 81Next →

No leaderboard results yet.