SOTAVerified

Language Modeling

Papers

Showing 17011750 of 14182 papers

TitleStatusHype
A Dynamic LLM-Powered Agent Network for Task-Oriented Agent CollaborationCode1
Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGsCode1
CREAM: Consistency Regularized Self-Rewarding Language ModelsCode1
BERT got a Date: Introducing Transformers to Temporal TaggingCode1
DziriBERT: a Pre-trained Language Model for the Algerian DialectCode1
Large Language Models as Corporate LobbyistsCode1
EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote SensingCode1
Large Language Models as Zero-Shot Keyphrase Extractors: A Preliminary Empirical StudyCode1
Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense RetrievalCode1
Image Super-Resolution with Text Prompt DiffusionCode1
CriticEval: Evaluating Large Language Model as CriticCode1
ECAMP: Entity-centered Context-aware Medical Vision Language Pre-trainingCode1
Image-Text Co-Decomposition for Text-Supervised Semantic SegmentationCode1
ImagineBench: Evaluating Reinforcement Learning with Large Language Model RolloutsCode1
Implicit Unlikelihood Training: Improving Neural Text Generation with Reinforcement LearningCode1
BERTje: A Dutch BERT ModelCode1
Data-to-Text Generation with Iterative Text EditingCode1
BERT Loses Patience: Fast and Robust Inference with Early ExitCode1
AutoDIR: Automatic All-in-One Image Restoration with Latent DiffusionCode1
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine TranslationCode1
Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resourcesCode1
AutoDiff: combining Auto-encoder and Diffusion model for tabular data synthesizingCode1
Crafting Large Language Models for Enhanced InterpretabilityCode1
CrowdCLIP: Unsupervised Crowd Counting via Vision-Language ModelCode1
CPT: Efficient Deep Neural Network Training via Cyclic PrecisionCode1
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and GenerationCode1
Image Hijacks: Adversarial Images can Control Generative Models at RuntimeCode1
Effective Sequence-to-Sequence Dialogue State TrackingCode1
Efficient Long Sequence Modeling via State Space Augmented TransformerCode1
Imposing Relation Structure in Language-Model Embeddings Using Contrastive LearningCode1
Effect of Pre-Training Scale on Intra- and Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest ImagesCode1
Effective Use of Graph Convolution Network and Contextual Sub-Tree for Commodity News Event ExtractionCode1
Improving Indonesian Text Classification Using Multilingual Language ModelCode1
LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language ModelsCode1
TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human AnnotationCode1
ITER: Iterative Transformer-based Entity Recognition and Relation ExtractionCode1
iBOT: Image BERT Pre-Training with Online TokenizerCode1
IDAS: Intent Discovery with Abstractive SummarizationCode1
An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language ModelsCode1
Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-GenerationCode1
HYTREL: Hypergraph-enhanced Tabular Data Representation LearningCode1
Counterfactual Token Generation in Large Language ModelsCode1
IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal CapabilitiesCode1
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language ModelCode1
HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node Classification on Text-Attributed HypergraphsCode1
Efficiently Modeling Long Sequences with Structured State SpacesCode1
Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video UnderstandingCode1
Efficient Online Data Mixing For Language Model Pre-TrainingCode1
Efficient OCR for Building a Diverse Digital HistoryCode1
Counterfactual Data Augmentation for Neural Machine TranslationCode1
Show:102550
← PrevPage 35 of 284Next →

No leaderboard results yet.