SOTAVerified

Language Modeling

Papers

Showing 15511600 of 14182 papers

TitleStatusHype
Filtering Noisy Parallel Corpus using Transformers with Proxy Task LearningCode1
LawInstruct: A Resource for Studying Language Model Adaptation to the Legal DomainCode1
FADE: Few-shot/zero-shot Anomaly Detection Engine using Large Vision-Language ModelCode1
Fairer Preferences Elicit Improved Human-Aligned Large Language Model JudgmentsCode1
Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language TranslationCode1
Facilitating large language model Russian adaptation with Learned Embedding PropagationCode1
Factorized Neural Transducer for Efficient Language Model AdaptationCode1
GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent CollaborationCode1
fairseq: A Fast, Extensible Toolkit for Sequence ModelingCode1
GraPPa: Grammar-Augmented Pre-Training for Table Semantic ParsingCode1
Analysing The Impact of Sequence Composition on Language Model Pre-TrainingCode1
AVocaDo: Strategy for Adapting Vocabulary to Downstream DomainCode1
GREEK-BERT: The Greeks visiting Sesame StreetCode1
LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test ConstructionCode1
Avoiding Inference Heuristics in Few-shot Prompt-based FinetuningCode1
GRENADE: Graph-Centric Language Model for Self-Supervised Representation Learning on Text-Attributed GraphsCode1
Extracting Latent Steering Vectors from Pretrained Language ModelsCode1
CORAL: Expert-Curated medical Oncology Reports to Advance Language Model InferenceCode1
Extracting Cultural Commonsense Knowledge at ScaleCode1
Extracting Definienda in Mathematical Scholarly Articles with TransformersCode1
Extracting Training Data from Large Language ModelsCode1
Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language ModelsCode1
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous DrivingCode1
gzip Predicts Data-dependent Scaling LawsCode1
Unifying Segment Anything in Microscopy with Multimodal Large Language ModelCode1
Exploring the Limits of Language ModelingCode1
Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete LabelsCode1
Handwritten Mathematical Expression Recognition with Bidirectionally Trained TransformerCode1
Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer LearningCode1
Extensive Self-Contrast Enables Feedback-Free Language Model AlignmentCode1
Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue SystemsCode1
Aya Dataset: An Open-Access Collection for Multilingual Instruction TuningCode1
Accelerating Vision-Language Pretraining with Free Language ModelingCode1
HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World ClaimsCode1
Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source)Code1
Heterogeneous Graph Reasoning for Fact Checking over Texts and TablesCode1
Exploring Large Language Model for Graph Data Understanding in Online Job RecommendationsCode1
ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI AgentsCode1
Exploring Quantization for Efficient Pre-Training of Transformer Language ModelsCode1
Hierarchical Transformers Are More Efficient Language ModelsCode1
FLIP: Fine-grained Alignment between ID-based Models and Pretrained Language Models for CTR PredictionCode1
Backpack Language ModelsCode1
Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less ReasonableCode1
Exploring Stochastic Autoregressive Image Modeling for Visual RepresentationCode1
Extracting and Inferring Personal Attributes from DialogueCode1
AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task GenerationCode1
Housekeep: Tidying Virtual Households using Commonsense ReasoningCode1
How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language modelCode1
FALL-E: A Foley Sound Synthesis Model and StrategiesCode1
Exploiting Novel GPT-4 APIsCode1
Show:102550
← PrevPage 32 of 284Next →

No leaderboard results yet.