SOTAVerified

Language Modeling

Papers

Showing 84018450 of 14182 papers

TitleStatusHype
ChatGPT in the Classroom: An Analysis of Its Strengths and Weaknesses for Solving Undergraduate Computer Science Questions0
CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl DataCode1
Framing the News:From Human Perception to Large Language Model Inferences0
A Modular Approach for Multilingual Timex Detection and Normalization using Deep Learning and Grammar-based methodsCode0
Large Language Models are Strong Zero-Shot Retriever0
UIO at SemEval-2023 Task 12: Multilingual fine-tuning for sentiment classification in low-resource languagesCode0
PMC-LLaMA: Towards Building Open-source Language Models for MedicineCode2
SweCTRL-Mini: a data-transparent Transformer-based large language model for controllable text generation in SwedishCode0
ZeroShotDataAug: Generating and Augmenting Training Data with ChatGPT0
Vision Conformer: Incorporating Convolutions into Vision Transformer LayersCode0
Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement0
Learning Human-Human Interactions in Images from Weak Textual Supervision0
Enhancing Large Language Model with Self-Controlled Memory FrameworkCode1
MasonNLP+ at SemEval-2023 Task 8: Extracting Medical Questions, Experiences and Claims from Social Media using Knowledge-Augmented Pre-trained Language Models0
Generative Relevance Feedback with Large Language Models0
KINLP at SemEval-2023 Task 12: Kinyarwanda Tweet Sentiment Analysis0
Compressing Sentence Representation with maximum Coding Rate Reduction0
Blockchain Large Language Models0
GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters0
Nondeterministic Stacks in Neural Networks0
State Spaces Aren't Enough: Machine Translation Needs Attention0
Domain Mastery Benchmark: An Ever-Updating Benchmark for Evaluating Holistic Domain Knowledge of Large Language Model--A Preliminary Release0
A Lightweight Constrained Generation Alternative for Query-focused SummarizationCode0
LaMP: When Large Language Models Meet PersonalizationCode1
Recurrent Neural Networks and Long Short-Term Memory Networks: Tutorial and Survey0
Transformer-Based Language Model Surprisal Predicts Human Reading Times Best with About Two Billion Training Tokens0
SAILER: Structure-aware Pre-trained Language Model for Legal Case RetrievalCode1
Dialectical language model evaluation: An initial appraisal of the commonsense spatial reasoning abilities of LLMs0
KitchenScale: Learning to predict ingredient quantities from recipe contextsCode0
Evaluating Transformer Language Models on Arithmetic Operations Using Number DecompositionCode0
Robot-Enabled Construction Assembly with Automated Sequence Planning based on ChatGPT: RoboGPT0
SkinGPT-4: An Interactive Dermatology Diagnostic System with Visual Large Language Model0
Word Sense Induction with Knowledge Distillation from BERT0
CEIL: A General Classification-Enhanced Iterative Learning Framework for Text Clustering0
Analyzing FOMC Minutes: Accuracy and Constraints of Language Models0
Phoenix: Democratizing ChatGPT across LanguagesCode4
Scaling Transformer to 1M tokens and beyond with RMTCode2
BRENT: Bidirectional Retrieval Enhanced Norwegian TransformerCode0
A Theory on Adam Instability in Large-Scale Machine Learning0
LLM as A Robotic Brain: Unifying Egocentric Memory and Control0
CB-Conformer: Contextual biasing Conformer for biased word recognitionCode1
Is ChatGPT Equipped with Emotional Dialogue Capabilities?0
Creating Large Language Model Resistant Exams: Guidelines and Strategies0
HeRo: RoBERTa and Longformer Hebrew Language Models0
Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions0
A Two-Stage Framework with Self-Supervised Distillation For Cross-Domain Text Classification0
Large Language Models Based Automatic Synthesis of Software Specifications0
CodeKGC: Code Language Model for Generative Knowledge Graph ConstructionCode0
Masked Language Model Based Textual Adversarial Example DetectionCode0
SkillGPT: a RESTful API service for skill extraction and standardization using a Large Language ModelCode1
Show:102550
← PrevPage 169 of 284Next →

No leaderboard results yet.