SOTAVerified

Language Modeling

Papers

Showing 80018050 of 14182 papers

TitleStatusHype
Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?0
S^3: Increasing GPU Utilization during Generative Inference for Higher Throughput0
FinGPT: Open-Source Financial Large Language ModelsCode6
Simple and Controllable Music GenerationCode6
Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding0
Hexatagging: Projective Dependency Parsing as TaggingCode1
K2: A Foundation Language Model for Geoscience Knowledge Understanding and UtilizationCode2
Improving Language Model Integration for Neural Machine Translation0
InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding0
PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for FinanceCode2
Mapping Brains with Language Models: A Survey0
Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-ExpertsCode0
Robot Task Planning Based on Large Language Model Representing Knowledge with Directed Graph StructuresCode0
RETA-LLM: A Retrieval-Augmented Large Language Model ToolkitCode2
Soft-prompt Tuning for Large Language Models to Evaluate Bias0
Privately generating tabular data using language modelsCode1
Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization0
Can current NLI systems handle German word order? Investigating language model performance on a new German challenge set of minimal pairsCode0
Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions0
Dial-MAE: ConTextual Masked Auto-Encoder for Retrieval-based Dialogue SystemsCode0
Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers0
Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering0
Benchmarking Foundation Models with Language-Model-as-an-Examiner0
Long-form analogies generated by chatGPT lack human-like psycholinguistic properties0
Multi-Task Training with In-Domain Language Models for Diagnostic Reasoning0
Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer0
Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization0
Inference-Time Intervention: Eliciting Truthful Answers from a Language ModelCode2
LLMZip: Lossless Text Compression using Large Language ModelsCode1
On the Difference of BERT-style and CLIP-style Text EncodersCode1
Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!Code1
TKDP: Threefold Knowledge-enriched Deep Prompt Tuning for Few-shot Named Entity RecognitionCode0
A generative framework for conversational laughter: Its 'language model' and laughter sound synthesis0
Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics0
Leveraging Explicit Procedural Instructions for Data-Efficient Action Prediction0
Iterative Translation Refinement with Large Language Models0
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias0
Semantically-Prompted Language Models Improve Visual Descriptions0
AutoScrum: Automating Project Planning Using Large Language ModelsCode1
A Scalable and Adaptive System to Infer the Industry Sectors of Companies: Prompt + Model Tuning of Generative Language Models0
Information Flow Control in Machine Learning through Modular Model Architecture0
CoSiNES: Contrastive Siamese Network for Entity Standardization0
Benchmarking Middle-Trained Language Models for Neural Search0
CTRL: Connect Collaborative and Language Model for CTR Prediction0
Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model0
COMET: Learning Cardinality Constrained Mixture of Experts with Trees and Local SearchCode1
Improving Conversational Recommendation Systems via Counterfactual Data SimulationCode1
CELDA: Leveraging Black-box Language Model as Enhanced Classifier without Labels0
On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research0
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video UnderstandingCode4
Show:102550
← PrevPage 161 of 284Next →

No leaderboard results yet.