SOTAVerified

Language Modeling

Papers

Showing 1205112100 of 14182 papers

TitleStatusHype
Improve Language Model and Brain Alignment via Associative MemoryCode0
Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM CompressionCode0
ETHAN at SemEval-2020 Task 5: Modelling Causal Reasoning inLanguage using neuro-symbolic cloud computingCode0
Estimating Large Language Model Capabilities without Labeled Test DataCode0
Beyond Ontology in Dialogue State Tracking for Goal-Oriented ChatbotCode0
ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural NetworkCode0
Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical SemanticsCode0
Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LMCode0
Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language ModelCode0
1Cademy @ Causal News Corpus 2022: Enhance Causal Span Detection via Beam-Search-based Position SelectorCode0
Connections between Schedule-Free Optimizers, AdEMAMix, and Accelerated SGD VariantsCode0
ArNLI: Arabic Natural Language Inference for Entailment and Contradiction DetectionCode0
Knowledge-to-Jailbreak: Investigating Knowledge-driven Jailbreaking Attacks for Large Language ModelsCode0
Are VLMs Really BlindCode0
FASPell: A Fast, Adaptable, Simple, Powerful Chinese Spell Checker Based On DAE-Decoder ParadigmCode0
Are Some Words Worth More than Others?Code0
ESM-NBR: fast and accurate nucleic acid-binding residue prediction via protein language model feature representation and multi-task learningCode0
Error Detection for Text-to-SQL Semantic ParsingCode0
Beyond Language: Learning Commonsense from Images for ReasoningCode0
Confidential Prompting: Protecting User Prompts from Cloud LLM ProvidersCode0
Unipa-GPT: Large Language Models for university-oriented QA in ItalianCode0
Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model GeneralizationCode0
LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language ModelCode0
Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model DecodingCode0
Error Analysis of using BART for Multi-Document Summarization: A Study for English and German LanguageCode0
Leveraging Domain Knowledge for Inclusive and Bias-aware Humanitarian Response Entry ClassificationCode0
A Recurrent BERT-based Model for Question GenerationCode0
A Semi-Supervised Approach for Low-Resourced Text GenerationCode0
ERNIE-Doc: A Retrospective Long-Document Modeling TransformerCode0
A Dutch Financial Large Language ModelCode0
Conditionally Learn to Pay Attention for Sequential Visual TaskCode0
ERASMO: Leveraging Large Language Models for Enhanced Clustering SegmentationCode0
Environmental large language model Evaluation (ELLE) dataset: A Benchmark for Evaluating Generative AI applications in Eco-environment DomainCode0
Fast Multipole Attention: A Divide-and-Conquer Attention Mechanism for Long SequencesCode0
Fast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User ControlCode0
Beyond Distributional Hypothesis: Let Language Models Learn Meaning-Text CorrespondenceCode0
Entry Separation using a Mixed Visual and Textual Language Model: Application to 19th century French Trade DirectoriesCode0
Fast-Slow Recurrent Neural NetworksCode0
Large Memory Layers with Product KeysCode0
Fast, Small and Exact: Infinite-order Language Modelling with Compressed Suffix TreesCode0
FASTSUBS: An Efficient and Exact Procedure for Finding the Most Likely Lexical Substitutes Based on an N-gram Language ModelCode0
Entity or Relation Embeddings? An Analysis of Encoding Strategies for Relation ExtractionCode0
A Content-Based Novelty Measure for Scholarly Publications: A Proof of ConceptCode0
A Reality Check on Context Utilisation for Retrieval-Augmented GenerationCode0
Entity at SemEval-2021 Task 5: Weakly Supervised Token Labelling for Toxic Spans DetectionCode0
Fast Training of Recurrent Neural Networks with Stationary State FeedbacksCode0
Fast transcription of speech in low-resource languagesCode0
FastTrees: Parallel Latent Tree-Induction for Faster Sequence EncodingCode0
Entities as Experts: Sparse Memory Access with Entity SupervisionCode0
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-TuningCode0
Show:102550
← PrevPage 242 of 284Next →

No leaderboard results yet.