SOTAVerified

Language Modeling

Papers

Showing 29012950 of 14182 papers

TitleStatusHype
GraphLLM: Boosting Graph Reasoning Ability of Large Language ModelCode1
LML-DAP: Language Model Learning a Dataset for Data-Augmented PredictionCode1
GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent CollaborationCode1
Controllable Sentence Simplification with a Unified Text-to-Text Transfer TransformerCode1
Localizing Paragraph Memorization in Language ModelsCode1
Logic.py: Bridging the Gap between LLMs and Constraint SolversCode1
Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage RetrievalCode1
GraPPa: Grammar-Augmented Pre-Training for Table Semantic ParsingCode1
Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse GradientsCode1
Great Memory, Shallow Reasoning: Limits of kNN-LMsCode1
LLMSTEP: LLM proofstep suggestions in LeanCode1
ArabicMMLU: Assessing Massive Multitask Language Understanding in ArabicCode1
LLMs Can Simulate Standardized Patients via Agent CoevolutionCode1
LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary CaptioningCode1
Data Efficient Masked Language Modeling for Vision and LanguageCode1
Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series ForecastingCode1
A context-aware knowledge transferring strategy for CTC-based ASRCode1
Data Augmentation using Pre-trained Transformer ModelsCode1
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language TextsCode1
LLMZip: Lossless Text Compression using Large Language ModelsCode1
Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language ModelCode1
DARTS: Differentiable Architecture SearchCode1
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupCode1
Controlling Perceived Emotion in Symbolic Music Generation with Monte Carlo Tree SearchCode1
DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNACode1
Guiding Attention for Self-Supervised Learning with TransformersCode1
LLM-in-the-loop: Leveraging Large Language Model for Thematic AnalysisCode1
LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language ModelsCode1
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed LanguageCode1
Control Prefixes for Parameter-Efficient Text GenerationCode1
LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital TwinsCode1
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
RoBERTa: A Robustly Optimized BERT Pretraining ApproachCode1
Hallucinations in Large Multilingual Translation ModelsCode1
LOGO -- Long cOntext aliGnment via efficient preference OptimizationCode1
LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attentionCode1
Conversational Recommender System and Large Language Model Are Made for Each Other in E-commerce Pre-sales DialogueCode1
HARDMath: A Benchmark Dataset for Challenging Problems in Applied MathematicsCode1
Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic FidelityCode1
Robust Optimization in Protein Fitness Landscapes Using Reinforcement Learning in Latent SpaceCode1
Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model MergingCode1
RoChBert: Towards Robust BERT Fine-tuning for ChineseCode1
Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?Code1
CycleFormer : TSP Solver Based on Language ModelingCode1
Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across ModalitiesCode1
A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19Code1
Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source)Code1
HYTREL: Hypergraph-enhanced Tabular Data Representation LearningCode1
HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World ClaimsCode1
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance ScalingCode1
Show:102550
← PrevPage 59 of 284Next →

No leaderboard results yet.