SOTAVerified

Language Modeling

Papers

Showing 1395114000 of 14182 papers

TitleStatusHype
Paying More Attention to Source Context: Mitigating Unfaithful Translations from Large Language ModelCode0
PAYADOR: A Minimalist Approach to Grounding Language Models on Structured Data for Interactive Storytelling and Role-playing GamesCode0
Turning Logic Against Itself : Probing Model Defenses Through Contrastive QuestionsCode0
The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented InterventionCode0
Patterns versus Characters in Subword-aware Neural Language ModelingCode0
Recurrent Additive NetworksCode0
Neurocache: Efficient Vector Retrieval for Long-range Language ModelingCode0
Variational Autoencoders for Collaborative FilteringCode0
Recoding latent sentence representations -- Dynamic gradient-based activation modification in RNNsCode0
Patient-Level Anatomy Meets Scanning-Level Physics: Personalized Federated Low-Dose CT Denoising Empowered by Large Language ModelCode0
Multi-Grained Patch Training for Efficient LLM-based RecommendationCode0
Partially Shuffling the Training Data to Improve Language ModelsCode0
TwinBooster: Synergising Large Language Models with Barlow Twins and Gradient Boosting for Enhanced Molecular Property PredictionCode0
YellowFin and the Art of Momentum TuningCode0
Neural Text Generation from Structured Data with Application to the Biography DomainCode0
MolXPT: Wrapping Molecules with Text for Generative Pre-trainingCode0
ReCAM@IITK at SemEval-2021 Task 4: BERT and ALBERT based Ensemble for Abstract Word PredictionCode0
Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem FeaturesCode0
Reasoning-Grounded Natural Language Explanations for Language ModelsCode0
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical ReasoningCode0
Multimodal data matters: language model pre-training over structured and unstructured electronic health recordsCode0
Neural spell-checker: Beyond words with synthetic data generationCode0
Reanalyzing L2 Preposition Learning with Bayesian Mixed Effects and a Pretrained Language ModelCode0
The Hidden Space of Transformer Language AdaptersCode0
Women Are Beautiful, Men Are Leaders: Gender Stereotypes in Machine Translation and Language ModelingCode0
A Comparison of Language Modeling and Translation as Multilingual Pretraining ObjectivesCode0
word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding methodCode0
Neural Sign Language TranslationCode0
Vaxformer: Antigenicity-controlled Transformer for Vaccine Design Against SARS-CoV-2Code0
Parsing as Language ModelingCode0
The Impact of Element Ordering on LM Agent PerformanceCode0
TypedThinker: Typed Thinking Improves Large Language Model ReasoningCode0
Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language ModelsCode0
Parameter-Efficient Language Model Tuning with Active Learning in Low-Resource SettingsCode0
The impact of responding to patient messages with large language model assistanceCode0
The implementation of a Deep Recurrent Neural Network Language Model on a Xilinx FPGACode0
The Importance of Being Recurrent for Modeling Hierarchical StructureCode0
Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian LawsCode0
Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy TrainingCode0
Neural Shuffle-Exchange Networks - Sequence Processing in O(n log n) TimeCode0
UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense DisambiguationCode0
Panoramic Interests: Stylistic-Content Aware Personalized Headline GenerationCode0
We're Calling an Intervention: Exploring Fundamental Hurdles in Adapting Language Models to Nonstandard TextCode0
The Influence of Context on Sentence Acceptability JudgementsCode0
Z-LaVI: Zero-Shot Language Solver Fueled by Visual ImaginationCode0
RealHarm: A Collection of Real-World Language Model Application FailuresCode0
UBERT: A Novel Language Model for Synonymy Prediction at Scale in the UMLS MetathesaurusCode0
Neural Shuffle-Exchange Networks -- Sequence Processing in O(n log n) TimeCode0
Neural Scaling Laws Rooted in the Data DistributionCode0
MAMUT: A Novel Framework for Modifying Mathematical Formulas for the Generation of Specialized Datasets for Language Model TrainingCode0
Show:102550
← PrevPage 280 of 284Next →

No leaderboard results yet.