SOTAVerified

Language Modeling

Papers

Showing 27512775 of 14182 papers

TitleStatusHype
Long-Short Transformer: Efficient Transformers for Language and VisionCode1
Robust End-to-End Offline Chinese Handwriting Text Page Spotter with Text KernelCode1
R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language ModelingCode1
XLM-E: Cross-lingual Language Model Pre-training via ELECTRACode1
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin InformationCode1
R-Drop: Regularized Dropout for Neural NetworksCode1
Stabilizing Equilibrium Models by Jacobian RegularizationCode1
SymbolicGPT: A Generative Transformer Model for Symbolic RegressionCode1
CLIP2Video: Mastering Video-Text Retrieval via Image CLIPCode1
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-modelsCode1
Golos: Russian Dataset for Speech ResearchCode1
SPBERT: An Efficient Pre-training BERT on SPARQL Queries for Question Answering over Knowledge GraphsCode1
Distributed Deep Learning in Open CollaborationsCode1
Scene Transformer: A unified architecture for predicting multiple agent trajectoriesCode1
Direction is what you need: Improving Word Embedding Compression in Large Language ModelsCode1
Incorporating External POS Tagger for Punctuation RestorationCode1
BioELECTRA:Pretrained Biomedical text Encoder using DiscriminatorsCode1
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word AlignmentCode1
Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language ModelsCode1
Ultra-Fine Entity Typing with Weak Supervision from a Masked Language ModelCode1
Staircase Attention for Recurrent Processing of SequencesCode1
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared HypernetworksCode1
Top-KAST: Top-K Always Sparse TrainingCode1
Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product OperatorsCode1
Luna: Linear Unified Nested AttentionCode1
Show:102550
← PrevPage 111 of 568Next →

No leaderboard results yet.