SOTAVerified

Language Modeling

Papers

Showing 16011650 of 14182 papers

TitleStatusHype
IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal CapabilitiesCode1
CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and OptimizationCode1
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language ModelingCode1
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model EvaluationCode1
CPT: Efficient Deep Neural Network Training via Cyclic PrecisionCode1
Crafting Large Language Models for Enhanced InterpretabilityCode1
HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node Classification on Text-Attributed HypergraphsCode1
iBOT: Image BERT Pre-Training with Online TokenizerCode1
DisCo: Distilled Student Models Co-training for Semi-supervised Text MiningCode1
Balanced Data Sampling for Language Model Training with ClusteringCode1
Analyzing the Source and Target Contributions to Predictions in Neural Machine TranslationCode1
Discovering Autoregressive Orderings with Variational InferenceCode1
Addressing Some Limitations of Transformers with Feedback MemoryCode1
Improved training of end-to-end attention models for speech recognitionCode1
Counterfactual Token Generation in Large Language ModelsCode1
Coupling Large Language Models with Logic Programming for Robust and General Reasoning from TextCode1
Hybrid Ranking Network for Text-to-SQLCode1
Discrete Flows: Invertible Generative Models of Discrete DataCode1
BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language ModelsCode1
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in BanglaCode1
Counterfactual Data Augmentation for Neural Machine TranslationCode1
Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate SpeechCode1
DISP-LLM: Dimension-Independent Structural Pruning for Large Language ModelsCode1
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in BanglaCode1
Dissecting Generation Modes for Abstractive Summarization Models via Ablation and AttributionCode1
Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-TrainingCode1
CPLLM: Clinical Prediction with Large Language ModelsCode1
Human Language ModelingCode1
Hydra: A System for Large Multi-Model Deep LearningCode1
Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language ModelCode1
DocSCAN: Unsupervised Text Classification via Learning from NeighborsCode1
Distilling Linguistic Context for Language Model CompressionCode1
Distilling the Knowledge of BERT for Sequence-to-Sequence ASRCode1
LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial ApplicationCode1
HPT: Hierarchy-aware Prompt Tuning for Hierarchical Text ClassificationCode1
Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model CompressionCode1
Batch Prompting: Efficient Inference with Large Language Model APIsCode1
CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue CoreferenceCode1
cosFormer: Rethinking Softmax in AttentionCode1
Distributed Deep Learning in Open CollaborationsCode1
Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model InferenceCode1
Knowledge-enhanced Visual-Language Pretraining for Computational PathologyCode1
Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model InfillingCode1
Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving SequencesCode1
DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing ConstraintsCode1
Knowledge Graph Generation From TextCode1
How well can a large language model explain business processes as perceived by users?Code1
Copy Is All You NeedCode1
ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI AgentsCode1
CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language ModelsCode1
Show:102550
← PrevPage 33 of 284Next →

No leaderboard results yet.