SOTAVerified

Language Modeling

Papers

Showing 29012925 of 14182 papers

TitleStatusHype
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed LanguageCode1
Gradient Ascent Post-training Enhances Language Model GeneralizationCode1
ArabicMMLU: Assessing Massive Multitask Language Understanding in ArabicCode1
Controllable Sentence Simplification with a Unified Text-to-Text Transfer TransformerCode1
Automated Spinal MRI Labelling from Reports Using a Large Language ModelCode1
DARTS: Differentiable Architecture SearchCode1
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference AccelerationCode1
Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series ForecastingCode1
A context-aware knowledge transferring strategy for CTC-based ASRCode1
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupCode1
Enhancing Monocular 3D Scene Completion with Diffusion ModelCode1
LLMCBench: Benchmarking Large Language Model Compression for Efficient DeploymentCode1
LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital TwinsCode1
Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language ModelCode1
Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across ModalitiesCode1
Great Memory, Shallow Reasoning: Limits of kNN-LMsCode1
Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse GradientsCode1
GRENADE: Graph-Centric Language Model for Self-Supervised Representation Learning on Text-Attributed GraphsCode1
G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable RecommendationCode1
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance ScalingCode1
CycleFormer : TSP Solver Based on Language ModelingCode1
CXR-LLAVA: a multimodal large language model for interpreting chest X-ray imagesCode1
DALE: Generative Data Augmentation for Low-Resource Legal NLPCode1
Dealing with Typos for BERT-based Passage Retrieval and RankingCode1
Show:102550
← PrevPage 117 of 568Next →

No leaderboard results yet.