SOTAVerified

Language Modeling

Papers

Showing 42514300 of 14182 papers

TitleStatusHype
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector AttentionCode0
FiSSA at SemEval-2020 Task 9: Fine-tuned For FeelingsCode0
ArthModel: Enhance Arithmetic Skills to Large Language ModelCode0
Data Noising as Smoothing in Neural Network Language ModelsCode0
Letter-Based Speech Recognition with Gated ConvNetsCode0
HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language ModelsCode0
LLM vs. Lawyers: Identifying a Subset of Summary Judgments in a Large UK Case Law DatasetCode0
A Common Pitfall of Margin-based Language Model Alignment: Gradient EntanglementCode0
Detection of depression on social networks using transformers and ensemblesCode0
Let the Poem Hit the Rhythm: Using a Byte-Based Transformer for Beat-Aligned Poetry GenerationCode0
Boosting Disfluency Detection with Large Language Model as Disfluency GeneratorCode0
AnchiBERT: A Pre-Trained Model for Ancient ChineseLanguage Understanding and GenerationCode0
ACL Ready: RAG Based Assistant for the ACL ChecklistCode0
Knowledge-to-Jailbreak: Investigating Knowledge-driven Jailbreaking Attacks for Large Language ModelsCode0
An Empirical Evaluation of Word Embedding Models for Subjectivity Analysis TasksCode0
B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal TokensCode0
SUPP.AI: Finding Evidence for Supplement-Drug InteractionsCode0
Comparing Specialised Small and General Large Language Models on Text Classification: 100 Labelled Samples to Achieve Break-Even PerformanceCode0
Looking for a Handsome Carpenter! Debiasing GPT-3 Job AdvertisementsCode0
Boosting Large Language Models with Mask Fine-TuningCode0
AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction GameCode0
Gated Word-Character Recurrent Language ModelCode0
Go Forth and Prosper: Language Modeling with Ancient Textual HistoryCode0
Goal-Oriented Script ConstructionCode0
Enhancing Content-based Recommendation via Large Language ModelCode0
Leveraging Domain Knowledge for Inclusive and Bias-aware Humanitarian Response Entry ClassificationCode0
Enhancing Crisis-Related Tweet Classification with Entity-Masked Language Modeling and Multi-Task LearningCode0
An Empirical Study of Language CNN for Image CaptioningCode0
AgentCF++: Memory-enhanced LLM-based Agents for Popularity-aware Cross-domain RecommendationsCode0
Detection of circular permutations by Protein Language ModelsCode0
Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class ClassificationCode0
Anchor Points: Benchmarking Models with Much Fewer ExamplesCode0
Fraternal DropoutCode0
L-MAGIC: Language Model Assisted Generation of Images with CoherenceCode0
Enhancing Cross-lingual Natural Language Inference by Prompt-learning from Cross-lingual TemplatesCode0
Goal-Aware Identification and Rectification of Misinformation in Multi-Agent SystemsCode0
Composer Style Classification of Piano Sheet Music Images Using Language Model PretrainingCode0
Boosting Zero-Shot Human-Object Interaction Detection with Vision-Language TransferCode0
CovidLLM: A Robust Large Language Model with Missing Value Adaptation and Multi-Objective Learning Strategy for Predicting Disease Severity and Clinical Outcomes in COVID-19 PatientsCode0
Composing Byte-Pair Encodings for Morphological Sequence ClassificationCode0
Deep-FSMN for Large Vocabulary Continuous Speech RecognitionCode0
CTBench: A Comprehensive Benchmark for Evaluating Language Model Capabilities in Clinical Trial DesignCode0
Building Language Models for Text with Named EntitiesCode0
Enhancing Domain Word Embedding via Latent Semantic ImputationCode0
Enhancing E-Commerce Recommendation using Pre-Trained Language Model and Fine-TuningCode0
Fine-tuning the ESM2 protein language model to understand the functional impact of missense variantsCode0
Enhancing elusive clues in knowledge learning by contrasting attention of language modelsCode0
Composing Structure-Aware Batches for Pairwise Sentence ClassificationCode0
CPE-Pro: A Structure-Sensitive Deep Learning Method for Protein Representation and Origin EvaluationCode0
Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System ResponsesCode0
Show:102550
← PrevPage 86 of 284Next →

No leaderboard results yet.