SOTAVerified

Language Modeling

Papers

Showing 24512500 of 14182 papers

TitleStatusHype
CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language ModelsCode1
Handwritten Mathematical Expression Recognition with Bidirectionally Trained TransformerCode1
Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model MergingCode1
CLIP2Video: Mastering Video-Text Retrieval via Image CLIPCode1
Reinforcement Learning Friendly Vision-Language Model for MinecraftCode1
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language ModelCode1
KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation ExtractionCode1
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than GeneratorsCode1
Chain of Natural Language Inference for Reducing Large Language Model Ungrounded HallucinationsCode1
Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry WritingCode1
HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World ClaimsCode1
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-trainingCode1
Efficient Pre-training of Masked Language Model via Concept-based Curriculum MaskingCode1
Cognitive Dissonance: Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness?Code1
EGFI: Drug-Drug Interaction Extraction and Generation with Fusion of Enriched Entity and Sentence InformationCode1
ELI5: Long Form Question AnsweringCode1
Empower Entity Set Expansion via Language Model ProbingCode1
Efficient Nearest Neighbor Language ModelsCode1
CogBench: a large language model walks into a psychology labCode1
hmBERT: Historical Multilingual Language Models for Named Entity RecognitionCode1
Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence ModelingCode1
HOP: History-and-Order Aware Pre-training for Vision-and-Language NavigationCode1
A Study of Generative Large Language Model for Medical Research and HealthcareCode1
How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language modelCode1
Efficiently Modeling Long Sequences with Structured State SpacesCode1
Clover: Towards A Unified Video-Language Alignment and Fusion ModelCode1
How Much Knowledge Can You Pack Into the Parameters of a Language Model?Code1
How multilingual is Multilingual BERT?Code1
Efficient OCR for Building a Diverse Digital HistoryCode1
Efficient Hierarchical Domain Adaptation for Pretrained Language ModelsCode1
Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-GenerationCode1
Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognitionCode1
Acoustic Prompt Tuning: Empowering Large Language Models with Audition CapabilitiesCode1
Efficient Content-Based Sparse Attention with Routing TransformersCode1
Efficient Long Sequence Modeling via State Space Augmented TransformerCode1
Efficient Online Data Mixing For Language Model Pre-TrainingCode1
Hydra: A System for Large Multi-Model Deep LearningCode1
Effective Sequence-to-Sequence Dialogue State TrackingCode1
CFGPT: Chinese Financial Assistant with Large Language ModelCode1
Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video UnderstandingCode1
Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code GenerationCode1
IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal CapabilitiesCode1
Effective Use of Graph Convolution Network and Contextual Sub-Tree forCommodity News Event ExtractionCode1
IDAS: Intent Discovery with Abstractive SummarizationCode1
Salmon: A Suite for Acoustic Language Model EvaluationCode1
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal ModelsCode1
CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language ModelsCode1
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text SpottingCode1
CFBenchmark: Chinese Financial Assistant Benchmark for Large Language ModelCode1
Effective Use of Graph Convolution Network and Contextual Sub-Tree for Commodity News Event ExtractionCode1
Show:102550
← PrevPage 50 of 284Next →

No leaderboard results yet.