SOTAVerified

Language Modeling

Papers

Showing 81018150 of 14182 papers

TitleStatusHype
Fine-tuning Strategies for Domain Specific Question Answering under Low Annotation Budget Constraints0
Enhancing Document-level Translation of Large Language Model via Translation Mixed-instructions0
Into the crossfire: evaluating the use of a language model to crowdsource gun violence reportsCode0
MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World0
Self-Imagine: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination0
When Large Language Model Agents Meet 6G Networks: Perception, Grounding, and Alignment0
On the importance of Data Scale in Pretraining Arabic Language Models0
Stability Analysis of ChatGPT-based Sentiment Analysis in AI Quality Assurance0
SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERTCode0
Your Instructions Are Not Always Helpful: Assessing the Efficacy of Instruction Fine-tuning for Software Vulnerability Detection0
Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization0
A character-based steganography using masked language modelingCode0
Activations and Gradients Compression for Model-Parallel TrainingCode0
ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering0
Distilling Event Sequence Knowledge From Large Language Models0
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation0
Reinforcement Learning from LLM Feedback to Counteract Goal Misgeneralization0
Small Language Model Can Self-correct0
Parameter-Efficient Detoxification with Contrastive Decoding0
Tracing the Genealogies of Ideas with Large Language Model Embeddings0
Evolving Code with A Large Language Model0
Dynamic Behaviour of Connectionist Speech Recognition with Strong Latency Constraints0
Generalizing Visual Question Answering from Synthetic to Human-Written Questions via a Chain of QA with a Large Language ModelCode0
InRanker: Distilled Rankers for Zero-shot Information RetrievalCode0
A systematic review of geospatial location embedding approaches in large language models: A path to spatial AI systems0
PersianMind: A Cross-Lingual Persian-English Large Language Model0
XLS-R Deep Learning Model for Multilingual ASR on Low- Resource Languages: Indonesian, Javanese, and Sundanese0
xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein0
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems0
EpilepsyLLM: Domain-Specific Large Language Model Fine-tuned with Epilepsy Medical Knowledge0
Investigating Data Contamination for Pre-training Language Models0
How Teachers Can Use Large Language Models and Bloom's Taxonomy to Create Educational Quizzes0
Distilling Vision-Language Models on Millions of Videos0
LEGOBench: Scientific Leaderboard Generation BenchmarkCode0
Combating Adversarial Attacks with Multi-Agent DebateCode0
AugSumm: towards generalizable speech summarization using synthetic labels from large language modelCode0
Less is More: A Closer Look at Semantic-based Few-Shot Learning0
Hierarchical Classification of Transversal Skills in Job Ads Based on Sentence Embeddings0
ChatGPT, Let us Chat Sign Language: Experiments, Architectural Elements, Challenges and Research Directions0
Knowledge Sharing in Manufacturing using Large Language Models: User Evaluation and Model Benchmarking0
Generating Diverse and High-Quality Texts by Minimum Bayes Risk DecodingCode0
How predictable is language model benchmark performance?0
Exploring Prompt-Based Methods for Zero-Shot Hypernym Prediction with Large Language Models0
TwinBooster: Synergising Large Language Models with Barlow Twins and Gradient Boosting for Enhanced Molecular Property PredictionCode0
The Butterfly Effect of Altering Prompts: How Small Changes and Jailbreaks Affect Large Language Model PerformanceCode0
Why Solving Multi-agent Path Finding with Large Language Model has not Succeeded Yet0
Sparse Meets Dense: A Hybrid Approach to Enhance Scientific Document Retrieval0
IDoFew: Intermediate Training Using Dual-Clustering in Language Models for Few Labels Text Classification0
FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference0
DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving0
Show:102550
← PrevPage 163 of 284Next →

No leaderboard results yet.