SOTAVerified

Language Modeling

Papers

Showing 28512900 of 14182 papers

TitleStatusHype
LLMSTEP: LLM proofstep suggestions in LeanCode1
GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and AugmentationCode1
LMBot: Distilling Graph Knowledge into Language Model for Graph-less Deployment in Twitter Bot DetectionCode1
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy PreservationCode1
Data Efficient Masked Language Modeling for Vision and LanguageCode1
AraGPT2: Pre-Trained Transformer for Arabic Language GenerationCode1
LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language ModelsCode1
GePpeTto Carves Italian into a Language ModelCode1
Data Augmentation using Pre-trained Transformer ModelsCode1
RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUsCode1
AraELECTRA: Pre-Training Text Discriminators for Arabic Language UnderstandingCode1
RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language ModelingCode1
LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital TwinsCode1
LLM-in-the-loop: Leveraging Large Language Model for Thematic AnalysisCode1
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language TextsCode1
GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and TextCode1
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed LanguageCode1
DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNACode1
GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language ModelCode1
GLADIS: A General and Large Acronym Disambiguation BenchmarkCode1
LLMBind: A Unified Modality-Task Integration FrameworkCode1
DARTS: Differentiable Architecture SearchCode1
LLMCBench: Benchmarking Large Language Model Compression for Efficient DeploymentCode1
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference AccelerationCode1
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
3D Visual Illusion Depth EstimationCode1
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupCode1
Contrastive Chain-of-Thought PromptingCode1
LLMs Can Simulate Standardized Patients via Agent CoevolutionCode1
Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across ModalitiesCode1
Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage RetrievalCode1
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model EvaluationCode1
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance ScalingCode1
Residual Energy-Based Models for Text GenerationCode1
GNN-LM: Language Modeling based on Global Contexts via GNNCode1
Contrastive Learning for Prompt-Based Few-Shot Language LearnersCode1
ArabicMMLU: Assessing Massive Multitask Language Understanding in ArabicCode1
LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial RelationsCode1
CXR-LLAVA: a multimodal large language model for interpreting chest X-ray imagesCode1
CycleFormer : TSP Solver Based on Language ModelingCode1
DALE: Generative Data Augmentation for Low-Resource Legal NLPCode1
Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series ForecastingCode1
A context-aware knowledge transferring strategy for CTC-based ASRCode1
Golos: Russian Dataset for Speech ResearchCode1
Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language ModelCode1
LLaRA: Large Language-Recommendation AssistantCode1
CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language TechnologiesCode1
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language ModelsCode1
CTRAN: CNN-Transformer-based Network for Natural Language UnderstandingCode1
CTRL: A Conditional Transformer Language Model for Controllable GenerationCode1
Show:102550
← PrevPage 58 of 284Next →

No leaderboard results yet.