SOTAVerified

Language Modeling

Papers

Showing 15011550 of 14182 papers

TitleStatusHype
A Multi-Grained Self-Interpretable Symbolic-Neural Model For Single/Multi-Labeled Text ClassificationCode1
CTRAN: CNN-Transformer-based Network for Natural Language UnderstandingCode1
Coupling Large Language Models with Logic Programming for Robust and General Reasoning from TextCode1
Counterfactual Token Generation in Large Language ModelsCode1
CPM: A Large-scale Generative Chinese Pre-trained Language ModelCode1
Housekeep: Tidying Virtual Households using Commonsense ReasoningCode1
AdaVAE: Exploring Adaptive GPT-2s in Variational Auto-Encoders for Language ModelingCode1
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language ModelCode1
CXR-LLAVA: a multimodal large language model for interpreting chest X-ray imagesCode1
CycleFormer : TSP Solver Based on Language ModelingCode1
Counterfactual Data Augmentation for Neural Machine TranslationCode1
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance ScalingCode1
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and GenerationCode1
DALE: Generative Data Augmentation for Low-Resource Legal NLPCode1
hmBERT: Historical Multilingual Language Models for Named Entity RecognitionCode1
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed LanguageCode1
AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZCode1
A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual CluesCode1
Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense RetrievalCode1
How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language modelCode1
Data Augmentation using Pre-trained Transformer ModelsCode1
Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model InfillingCode1
Data Efficient Masked Language Modeling for Vision and LanguageCode1
Hierarchical Transformers Are More Efficient Language ModelsCode1
High-Dimension Human Value Representation in Large Language ModelsCode1
CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue CoreferenceCode1
Debiasing Methods in Natural Language Understanding Make Bias More AccessibleCode1
cosFormer: Rethinking Softmax in AttentionCode1
CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language ModelsCode1
CPT: Efficient Deep Neural Network Training via Cyclic PrecisionCode1
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model GenerationCode1
Autonomous Microscopy Experiments through Large Language Model AgentsCode1
AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein EngineeringCode1
A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NERCode1
Enhancing Monocular 3D Scene Completion with Diffusion ModelCode1
AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-TuningCode1
DistilProtBert: A distilled protein language model used to distinguish between real proteins and their randomly shuffled counterpartsCode1
Decoding Speculative DecodingCode1
Copy Is All You NeedCode1
History Matters: Temporal Knowledge Editing in Large Language ModelCode1
Decoding-Time Language Model Alignment with Multiple ObjectivesCode1
How does the pre-training objective affect what large language models learn about linguistic properties?Code1
Heterogeneous Graph Reasoning for Fact Checking over Texts and TablesCode1
Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source)Code1
HetSeq: Distributed GPU Training on Heterogeneous InfrastructureCode1
Deep Equilibrium ModelsCode1
Improving NER's Performance with Massive financial corpusCode1
Automated Spinal MRI Labelling from Reports Using a Large Language ModelCode1
Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language ModelsCode1
Show:102550
← PrevPage 31 of 284Next →

No leaderboard results yet.