SOTAVerified

Language Modeling

Papers

Showing 71017150 of 14182 papers

TitleStatusHype
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small ModelsCode1
GenDistiller: Distilling Pre-trained Language Models based on Generative Models0
FATA-Trans: Field And Time-Aware Transformer for Sequential Tabular DataCode1
Exploring the Impact of Corpus Diversity on Financial Pretrained Language Models0
Ask Language Model to Clean Your Noisy Translation Data0
Enhancing Zero-Shot Crypto Sentiment with Fine-tuned Language Model and Prompt Engineering0
MoqaGPT : Zero-Shot Multi-modal Open-domain Question Answering with Large Language ModelCode1
LASER: Linear Compression in Wireless Distributed Optimization0
Reliable Academic Conference Question Answering: A Study Based on Large Language ModelCode0
Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer0
Exploring In-Context Learning of Textless Speech Language Model for Speech Classification Tasks0
Label-Aware Automatic Verbalizer for Few-Shot Text Classification0
Knowledge-Augmented Language Model VerificationCode1
MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal AdapterCode1
Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing0
Lost in Translation: When GPT-4V(ision) Can't See Eye to Eye with Text. A Vision-Language-Consistency Analysis of VLLMs and Beyond0
Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language ModelCode0
Named Entity Recognition for Monitoring Plant Health Threats in Tweets: a ChouBERT Approach0
ICU: Conquering Language Barriers in Vision-and-Language Modeling by Dividing the Tasks into Image Captioning and Language UnderstandingCode0
GestureGPT: Toward Zero-Shot Free-Form Hand Gesture Understanding with Large Language Model AgentsCode0
A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems0
Large Language Model for Multi-objective Evolutionary OptimizationCode1
Loop Copilot: Conducting AI Ensembles for Music Generation and Iterative EditingCode1
CLAIR: Evaluating Image Captions with Large Language Models0
Character-level Chinese Backpack Language ModelsCode1
Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization ProblemsCode0
TabuLa: Harnessing Language Models for Tabular Data SynthesisCode1
Data Augmentations for Improved (Large) Language Model Generalization0
Monarch Mixer: A Simple Sub-Quadratic GEMM-Based ArchitectureCode2
Solving the multiplication problem of a large language model system using a graph-based method0
Preference Optimization for Molecular Language ModelsCode0
Document-Level Language Models for Machine Translation0
Pseudointelligence: A Unifying Framework for Language Model Evaluation0
Harnessing Dataset Cartography for Improved Compositional Generalization in TransformersCode0
Zero-shot Faithfulness Evaluation for Text Summarization with Foundation Language ModelCode1
Fast Multipole Attention: A Divide-and-Conquer Attention Mechanism for Long SequencesCode0
Solving Hard Analogy Questions with Relation Embedding ChainsCode0
Large Language Model Prediction Capabilities: Evidence from a Real-World Forecasting Tournament0
Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition0
Multi-stage Large Language Model Correction for Speech Recognition0
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter MergingCode1
Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges0
BitNet: Scaling 1-bit Transformers for Large Language ModelsCode2
Leveraging Large Language Model for Automatic Evolving of Industrial Data-Centric R&D Cycle0
ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text ProcessingCode0
Correction Focused Language Model Training for Speech Recognition0
Learn Your Tokens: Word-Pooled Tokenization for Language ModelingCode0
Watermarking LLMs with Weight QuantizationCode1
ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation0
Utilising a Large Language Model to Annotate Subject Metadata: A Case Study in an Australian National Research Data Catalogue0
Show:102550
← PrevPage 143 of 284Next →

No leaderboard results yet.