SOTAVerified

Language Modeling

Papers

Showing 33013350 of 14182 papers

TitleStatusHype
Making Parameter-efficient Tuning More Efficient: A Unified Framework for Classification TasksCode0
Making the Most of Text Semantics to Improve Biomedical Vision--Language ProcessingCode0
An LSTM Adaptation Study of (Un)grammaticalityCode0
Making Language Model a Hierarchical Classifier and GeneratorCode0
Private Memorization Editing: Turning Memorization into a Defense to Strengthen Data Privacy in Large Language ModelsCode0
Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy TrainingCode0
Blockwise Self-Attention for Long Document UnderstandingCode0
Block-wise Dynamic SparsenessCode0
MADLAD-400: A Multilingual And Document-Level Large Audited DatasetCode0
Machine-generated text detection prevents language model collapseCode0
Machine-in-the-Loop Rewriting for Creative Image CaptioningCode0
Ankh: Optimized Protein Language Model Unlocks General-Purpose ModellingCode0
Macsen: A Voice Assistant for Speakers of a Lesser Resourced LanguageCode0
M2SA: Multimodal and Multilingual Model for Sentiment Analysis of TweetsCode0
LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model EditingCode0
A Framework for Adapting Human-Robot Interaction to Diverse User GroupsCode0
LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model CompressionCode0
LVLM-Interpret: An Interpretability Tool for Large Vision-Language ModelsCode0
BLCU-ICALL at SemEval-2022 Task 1: Cross-Attention Multitasking Framework for Definition ModelingCode0
Blank Collapse: Compressing CTC emission for the faster decodingCode0
BlackOut: Speeding up Recurrent Neural Network Language Models With Very Large VocabulariesCode0
An Investigation of Noise in Morphological InflectionCode0
LSTM based Conversation ModelsCode0
Black-box language model explanation by context length probingCode0
Low-Resource Sequence Labeling via Unsupervised Multilingual Contextualized RepresentationsCode0
Multi-task Pre-training Language Model for Semantic Network CompletionCode0
LT-LM: a novel non-autoregressive language model for single-shot lattice rescoringCode0
An Invariant Learning Characterization of Controlled Text GenerationCode0
Low-rank passthrough neural networksCode0
Low-Rank Constraints for Fast Inference in Structured ModelsCode0
Low Rank Factorizations are Indirect Encodings for Deep NeuroevolutionCode0
Low-Rank RNN Adaptation for Context-Aware Language ModelingCode0
Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response TheoryCode0
A Common Pitfall of Margin-based Language Model Alignment: Gradient EntanglementCode0
BIRCO: A Benchmark of Information Retrieval Tasks with Complex ObjectivesCode0
Lower Perplexity is Not Always Human-LikeCode0
Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model AlignmentCode0
BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity RecognitionCode0
Looking for a Handsome Carpenter! Debiasing GPT-3 Job AdvertisementsCode0
Retrieval-Pretrained Transformer: Long-range Language Modeling with Self-retrievalCode0
Long Range Language Modeling via Gated State SpacesCode0
Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech RecognitionCode0
Biomedical Language Models are Robust to Sub-optimal TokenizationCode0
Long Short-Term Memory-Networks for Machine ReadingCode0
Biomedical Event Extraction as Multi-turn Question AnsweringCode0
An Independent Evaluation of ChatGPT on Mathematical Word Problems (MWP)Code0
A Few-shot Approach to Resume Information Extraction via PromptsCode0
Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequencesCode0
Logical Implications for Visual Question Answering ConsistencyCode0
A Feasible Framework for Arbitrary-Shaped Scene Text RecognitionCode0
Show:102550
← PrevPage 67 of 284Next →

No leaderboard results yet.