Language Modeling

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1601–1650 of 14182 papers

Title	Date	Tasks	Status	Hype	Score
IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities	Aug 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and Optimization	May 5, 2025	DiversityLanguage Modeling	CodeCode Available	1	5
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling	Nov 23, 2021	Image CaptioningImage Description	CodeCode Available	1	5
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation	Jan 6, 2025	Language Model EvaluationLanguage Modeling	CodeCode Available	1	5
CPT: Efficient Deep Neural Network Training via Cyclic Precision	Jan 25, 2021	Language ModelingLanguage Modelling	CodeCode Available	1	5
Crafting Large Language Models for Enhanced Interpretability	Jul 5, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node Classification on Text-Attributed Hypergraphs	Feb 11, 2024	Inductive BiasLanguage Modeling	CodeCode Available	1	5
iBOT: Image BERT Pre-Training with Online Tokenizer	Nov 15, 2021	image-classificationImage Classification	CodeCode Available	1	5
DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining	May 20, 2023	Extractive SummarizationKnowledge Distillation	CodeCode Available	1	5
Balanced Data Sampling for Language Model Training with Clustering	Feb 22, 2024	ClusteringLanguage Modeling	CodeCode Available	1	5
Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation	Oct 21, 2020	Language ModelingLanguage Modelling	CodeCode Available	1	5
Discovering Autoregressive Orderings with Variational Inference	Jan 1, 2021	Code GenerationImage Captioning	CodeCode Available	1	5
Addressing Some Limitations of Transformers with Feedback Memory	Feb 21, 2020	Language ModelingLanguage Modelling	CodeCode Available	1	5
Improved training of end-to-end attention models for speech recognition	May 8, 2018	Language ModelingLanguage Modelling	CodeCode Available	1	5
Counterfactual Token Generation in Large Language Models	Sep 25, 2024	Bias Detectioncounterfactual	CodeCode Available	1	5
Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text	Jul 15, 2023	Language ModelingLanguage Modelling	CodeCode Available	1	5
Hybrid Ranking Network for Text-to-SQL	Aug 11, 2020	Language ModelingLanguage Modelling	CodeCode Available	1	5
Discrete Flows: Invertible Generative Models of Discrete Data	May 24, 2019	Language ModelingLanguage Modelling	CodeCode Available	1	5
BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models	Sep 23, 2023	Code CompletionHallucination	CodeCode Available	1	5
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla	Jan 1, 2021	Document ClassificationLanguage Modeling	CodeCode Available	1	5
Counterfactual Data Augmentation for Neural Machine Translation	Jun 1, 2021	counterfactualData Augmentation	CodeCode Available	1	5
Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech	Jul 19, 2021	Language ModelingLanguage Modelling	CodeCode Available	1	5
DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models	Oct 15, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla	May 23, 2022	Conditional Text GenerationDialogue Generation	CodeCode Available	1	5
Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution	Jun 3, 2021	Abstractive Text SummarizationDecoder	CodeCode Available	1	5
Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training	Sep 10, 2021	Language ModelingLanguage Modelling	CodeCode Available	1	5
CPLLM: Clinical Prediction with Large Language Models	Sep 20, 2023	Disease PredictionLanguage Modeling	CodeCode Available	1	5
Human Language Modeling	May 10, 2022	Age EstimationLanguage Modeling	CodeCode Available	1	5
Hydra: A System for Large Multi-Model Deep Learning	Oct 16, 2021	Deep LearningGPU	CodeCode Available	1	5
Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language Model	May 1, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available	1	5
DocSCAN: Unsupervised Text Classification via Learning from Neighbors	May 9, 2021	ClassificationClustering	CodeCode Available	1	5
Distilling Linguistic Context for Language Model Compression	Sep 17, 2021	Knowledge DistillationLanguage Modeling	CodeCode Available	1	5
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR	Aug 9, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application	May 7, 2024	Collaborative FilteringLanguage Modeling	CodeCode Available	1	5
HPT: Hierarchy-aware Prompt Tuning for Hierarchical Text Classification	Apr 28, 2022	ClassificationLanguage Modeling	CodeCode Available	1	5
Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression	Oct 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
Batch Prompting: Efficient Inference with Large Language Model APIs	Jan 19, 2023	Arithmetic ReasoningIn-Context Learning	CodeCode Available	1	5
CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference	Jun 25, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
cosFormer: Rethinking Softmax in Attention	Feb 17, 2022	D4RLLanguage Modeling	CodeCode Available	1	5
Distributed Deep Learning in Open Collaborations	Jun 18, 2021	Deep LearningLanguage Modeling	CodeCode Available	1	5
Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model Inference	May 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
Knowledge-enhanced Visual-Language Pretraining for Computational Pathology	Apr 15, 2024	Cross-Modal RetrievalLanguage Modeling	CodeCode Available	1	5
Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling	Oct 22, 2022	Abstractive Text SummarizationLanguage Modeling	CodeCode Available	1	5
Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences	Dec 10, 2024	Bayesian OptimizationLanguage Modeling	CodeCode Available	1	5
DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints	May 29, 2024	DiversityLanguage Modeling	CodeCode Available	1	5
Knowledge Graph Generation From Text	Nov 18, 2022	Graph GenerationJoint Entity and Relation Extraction	CodeCode Available	1	5
How well can a large language model explain business processes as perceived by users?	Jan 23, 2024	HallucinationLanguage Modeling	CodeCode Available	1	5
Copy Is All You Need	Jul 13, 2023	AllDomain Adaptation	CodeCode Available	1	5
ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents	Nov 6, 2023	Decision MakingLanguage Modeling	CodeCode Available	1	5
CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models	Feb 20, 2025	BlockingLanguage Modeling	CodeCode Available	1	5

Show:10 25 50

← PrevPage 33 of 284Next →

No leaderboard results yet.