Language Modeling

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7951–8000 of 14182 papers

Title	Date	Tasks	Status	Hype
Just One Byte (per gradient): A Note on Low-Bandwidth Decentralized Language Model Finetuning Using Shared Randomness	Jun 16, 2023	Distributed OptimizationLanguage Modeling	CodeCode Available	1
AD-AutoGPT: An Autonomous GPT for Alzheimer's Disease Infodemiology	Jun 16, 2023	Language ModelingLanguage Modelling	—Unverified	0
ClinicalGPT: Large Language Models Finetuned with Diverse Medical Data and Comprehensive Evaluation	Jun 16, 2023	DiagnosticLanguage Modeling	—Unverified	0
FALL-E: A Foley Sound Synthesis Model and Strategies	Jun 16, 2023	DiversityLanguage Modeling	CodeCode Available	1
Learning to Summarize and Answer Questions about a Virtual Robot's Past Actions	Jun 16, 2023	Language ModelingLanguage Modelling	—Unverified	0
Process Knowledge-infused Learning for Clinician-friendly Explanations	Jun 16, 2023	DiagnosticExplainable Artificial Intelligence (XAI)	—Unverified	0
Inspire creativity with ORIBA: Transform Artists' Original Characters into Chatbots through Large Language Model	Jun 16, 2023	ChatbotLanguage Modeling	—Unverified	0
CMLM-CSE: Based on Conditional MLM Contrastive Learning for Sentence Embeddings	Jun 16, 2023	Contrastive LearningLanguage Modeling	—Unverified	0
ChessGPT: Bridging Policy Learning and Language Modeling	Jun 15, 2023	Decision MakingLanguage Modeling	CodeCode Available	1
Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models	Jun 15, 2023	Electrical EngineeringFew-Shot Learning	—Unverified	0
Block-State Transformers	Jun 15, 2023	Language ModelingLanguage Modelling	—Unverified	0
Distillation Strategies for Discriminative Speech Recognition Rescoring	Jun 15, 2023	Language ModelingLanguage Modelling	—Unverified	0
Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation	Jun 15, 2023	Automatic Speech RecognitionClustering	CodeCode Available	1
Personalized Image Enhancement Featuring Masked Style Modeling	Jun 15, 2023	Image EnhancementLanguage Modeling	CodeCode Available	0
Mapping Researcher Activity based on Publication Data by means of Transformers	Jun 15, 2023	Language ModelingLanguage Modelling	—Unverified	0
Can ChatGPT pass the Vietnamese National High School Graduation Examination?	Jun 15, 2023	Language ModelingLanguage Modelling	—Unverified	0
Neural models for Factual Inconsistency Classification with Explanations	Jun 15, 2023	8kClassification	CodeCode Available	0
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration	Jun 15, 2023	Language ModelingLanguage Modelling	CodeCode Available	3
Generate to Understand for Representation	Jun 14, 2023	Contrastive LearningGPU	CodeCode Available	1
Revealing the structure of language model capabilities	Jun 14, 2023	Language ModelingLanguage Modelling	CodeCode Available	0
CLIPXPlore: Coupled CLIP and Shape Spaces for 3D Shape Exploration	Jun 14, 2023	AttributeLanguage Modeling	—Unverified	0
Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models	Jun 14, 2023	DecoderLanguage Modeling	—Unverified	0
World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models	Jun 14, 2023	Grounded Open Vocabulary AcquisitionLanguage Modeling	CodeCode Available	1
Radiology-GPT: A Large Language Model for Radiology	Jun 14, 2023	Language ModelingLanguage Modelling	—Unverified	0
Large-scale Language Model Rescoring on Long-form Data	Jun 13, 2023	FormLanguage Modeling	—Unverified	0
AVIS: Autonomous Visual Information Seeking with Large Language Model Agent	Jun 13, 2023	Decision MakingLanguage Modeling	—Unverified	0
PauseSpeech: Natural Speech Synthesis via Pre-trained Language Model and Pause-based Prosody Modeling	Jun 13, 2023	Language ModelingLanguage Modelling	—Unverified	0
INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation	Jun 13, 2023	Language ModelingLanguage Modelling	CodeCode Available	4
Tokenization with Factorized Subword Encoding	Jun 13, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences	Jun 13, 2023	Language ModelingLanguage Modelling	CodeCode Available	3
NoCoLA: The Norwegian Corpus of Linguistic Acceptability	Jun 13, 2023	Binary ClassificationDiagnostic	CodeCode Available	0
XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models	Jun 13, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Augmenting Language Models with Long-Term Memory	Jun 12, 2023	FormIn-Context Learning	—Unverified	0
EriBERTa: A Bilingual Pre-Trained Language Model for Clinical Natural Language Processing	Jun 12, 2023	Language ModelingLanguage Modelling	—Unverified	0
Waffling around for Performance: Visual Classification with Random Words and Broad Concepts	Jun 12, 2023	ClassificationLanguage Modeling	CodeCode Available	1
Large language models and (non-)linguistic recursion	Jun 12, 2023	Language ModelingLanguage Modelling	—Unverified	0
Weakly supervised information extraction from inscrutable handwritten document images	Jun 12, 2023	Language ModelingLanguage Modelling	—Unverified	0
InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions	Jun 12, 2023	Language ModelingLanguage Modelling	—Unverified	0
Valley: Video Assistant with Large Language model Enhanced abilitY	Jun 12, 2023	Action RecognitionInstruction Following	CodeCode Available	2
Gradient Ascent Post-training Enhances Language Model Generalization	Jun 12, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search	Jun 11, 2023	Domain AdaptationLanguage Modeling	CodeCode Available	1
GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model	Jun 11, 2023	General KnowledgeKnowledge Distillation	CodeCode Available	1
RoBERTweet: A BERT Language Model for Romanian Tweets	Jun 11, 2023	Language IdentificationLanguage Modeling	—Unverified	0
Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method	Jun 11, 2023	Knowledge DistillationLanguage Modeling	CodeCode Available	1
Language-Guided Traffic Simulation via Scene-Level Diffusion	Jun 10, 2023	Language ModelingLanguage Modelling	—Unverified	0
Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTC	Jun 10, 2023	Language ModelingLanguage Modelling	—Unverified	0
Large Language Models Are Semi-Parametric Reinforcement Learning Agents	Jun 9, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
14 Examples of How LLMs Can Transform Materials Science and Chemistry: A Reflection on a Large Language Model Hackathon	Jun 9, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
Language Models Can Learn Exceptions to Syntactic Rules	Jun 9, 2023	Language ModelingLanguage Modelling	CodeCode Available	0
Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?	Jun 9, 2023	Adversarial TextLanguage Modeling	—Unverified	0

Show:10 25 50

← PrevPage 160 of 284Next →

No leaderboard results yet.