SOTAVerified

Language Modeling

Papers

Showing 13011350 of 14182 papers

TitleStatusHype
GraPPa: Grammar-Augmented Pre-Training for Table Semantic ParsingCode1
Advancing Beyond Identification: Multi-bit Watermark for Large Language ModelsCode1
An Embarrassingly Simple Method to Mitigate Undesirable Properties of Pretrained Language Model TokenizersCode1
CPLLM: Clinical Prediction with Large Language ModelsCode1
GraphXForm: Graph transformer for computer-aided molecular designCode1
Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse GradientsCode1
Counterfactual Token Generation in Large Language ModelsCode1
GraphLLM: Boosting Graph Reasoning Ability of Large Language ModelCode1
An Efficient Self-Supervised Cross-View Training For Sentence EmbeddingCode1
An Efficient Multilingual Language Model Compression through Vocabulary TrimmingCode1
Coupling Large Language Models with Logic Programming for Robust and General Reasoning from TextCode1
Graph Neural Prompting with Large Language ModelsCode1
Counterfactual Data Augmentation for Neural Machine TranslationCode1
GradInit: Learning to Initialize Neural Networks for Stable and Efficient TrainingCode1
A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language ModelCode1
GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent CollaborationCode1
Great Memory, Shallow Reasoning: Limits of kNN-LMsCode1
Guiding Attention for Self-Supervised Learning with TransformersCode1
HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials ScienceCode1
GPT-NeoX-20B: An Open-Source Autoregressive Language ModelCode1
CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language ModelsCode1
Copy Is All You NeedCode1
Accurate Prediction of Antibody Function and Structure Using Bio-Inspired Antibody Language ModelCode1
GPT-too: A language-model-first approach for AMR-to-text generationCode1
Advanced Language Model-based Translator for English-Vietnamese TranslationCode1
Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model InfillingCode1
Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language ModelsCode1
ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based PolishingCode1
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model GenerationCode1
GPTCast: a weather language model for precipitation nowcastingCode1
Conversational Recommender System and Large Language Model Are Made for Each Other in E-commerce Pre-sales DialogueCode1
Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning DynamicsCode1
GPTailor: Large Language Model Pruning Through Layer Cutting and StitchingCode1
Control Prefixes for Parameter-Efficient Text GenerationCode1
Golos: Russian Dataset for Speech ResearchCode1
GPU-based Private Information Retrieval for On-Device Machine Learning InferenceCode1
Gloss Attention for Gloss-free Sign Language TranslationCode1
GlotScript: A Resource and Tool for Low Resource Writing System IdentificationCode1
Controlling Perceived Emotion in Symbolic Music Generation with Monte Carlo Tree SearchCode1
Controlled Text Generation for Large Language Model with Dynamic Attribute GraphsCode1
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic FactorsCode1
Controllable Text Generation with Neurally-Decomposed OracleCode1
Controlled Text Generation as Continuous Optimization with Multiple ConstraintsCode1
Controllable Sentence Simplification with a Unified Text-to-Text Transfer TransformerCode1
GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue GenerationCode1
Addressing Some Limitations of Transformers with Feedback MemoryCode1
Analyzing the Source and Target Contributions to Predictions in Neural Machine TranslationCode1
Controllable Dialogue Simulation with In-Context LearningCode1
Contrastive Vision-Language Alignment Makes Efficient Instruction LearnerCode1
AD-KD: Attribution-Driven Knowledge Distillation for Language Model CompressionCode1
Show:102550
← PrevPage 27 of 284Next →

No leaderboard results yet.