SOTAVerified

Language Modeling

Papers

Showing 9511000 of 14182 papers

TitleStatusHype
Towards Universal Fake Image Detectors that Generalize Across Generative ModelsCode2
BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and BenchmarkCode2
Simple Hardware-Efficient Long Convolutions for Sequence ModelingCode2
RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQLCode2
Accelerating Large Language Model Decoding with Speculative SamplingCode2
In-Context Retrieval-Augmented Language ModelsCode2
Grounding Language Models to Images for Multimodal Inputs and OutputsCode2
Editing Language Model-based Knowledge Graph EmbeddingsCode2
Adapting a Language Model While Preserving its General KnowledgeCode2
Hungry Hungry Hippos: Towards Language Modeling with State Space ModelsCode2
SODA: Million-scale Dialogue Distillation with Social Commonsense ContextualizationCode2
A Length-Extrapolatable TransformerCode2
Precise Zero-Shot Dense Retrieval without Relevance LabelsCode2
DiffusionBERT: Improving Generative Masked Language Models with Diffusion ModelsCode2
CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text LabelsCode2
Ignore Previous Prompt: Attack Techniques For Language ModelsCode2
LERT: A Linguistically-motivated Pre-trained Language ModelCode2
When Language Model Meets Private LibraryCode2
Contrastive Decoding: Open-ended Text Generation as OptimizationCode2
Retrieval Oriented Masking Pre-training Language Model for Dense Passage RetrievalCode2
Contrastive Search Is What You Need For Neural Text GenerationCode2
TabLLM: Few-shot Classification of Tabular Data with Large Language ModelsCode2
Deep Bidirectional Language-Knowledge Graph PretrainingCode2
Re3: Generating Longer Stories With Recursive Reprompting and RevisionCode2
Mass-Editing Memory in a TransformerCode2
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal ShiftsCode2
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language UnderstandingCode2
LambdaKG: A Library for Pre-trained Language Model-Based Knowledge Graph EmbeddingsCode2
Mega: Moving Average Equipped Gated AttentionCode2
Generate rather than Retrieve: Large Language Models are Strong Context GeneratorsCode2
T-NER: An All-Round Python Library for Transformer-based Named Entity RecognitionCode2
Atlas: Few-shot Learning with Retrieval Augmented Language ModelsCode2
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq ModelCode2
Language Model CascadesCode2
Recurrent Memory TransformerCode2
Scene Text Recognition with Permuted Autoregressive Sequence ModelsCode2
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and ActionCode2
Accurate RNA 3D structure prediction using a language model-based deep learning approachCode2
Egocentric Video-Language Pretraining @ Ego4D Challenge 2022Code2
Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022Code2
BigBIO: A Framework for Data-Centric Biomedical Natural Language ProcessingCode2
Solving Quantitative Reasoning Problems with Language ModelsCode2
TEVR: Improving Speech Recognition by Token Entropy Variance ReductionCode2
Mining Error Templates for Grammatical Error CorrectionCode2
GODEL: Large-Scale Pre-Training for Goal-Directed DialogCode2
Revealing Single Frame Bias for Video-and-Language LearningCode2
GIT: A Generative Image-to-text Transformer for Vision and LanguageCode2
RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-EncoderCode2
A Generalist AgentCode2
Symphony Generation with Permutation Invariant Language ModelCode2
Show:102550
← PrevPage 20 of 284Next →

No leaderboard results yet.