SOTAVerified

Language Modeling

Papers

Showing 27512800 of 14182 papers

TitleStatusHype
GraphVL: Graph-Enhanced Semantic Modeling via Vision-Language Models for Generalized Class Discovery0
ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model0
KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension0
Training Compute-Optimal Protein Language ModelsCode1
Exploring the Landscape for Generative Sequence Models for Specialized Data SynthesisCode0
Context Parallelism for Scalable Million-Token Inference0
RAGViz: Diagnose and Visualize Retrieval-Augmented GenerationCode2
Regress, Don't Guess -- A Regression-like Loss on Number Tokens for Language ModelsCode1
High-performance automated abstract screening with large language model ensembles0
A Deep Dive Into Large Language Model Code Generation Mistakes: What and Why?0
GraphXForm: Graph transformer for computer-aided molecular designCode1
Large Language Model Supply Chain: Open Problems From the Security Perspective0
Enriching Tabular Data with Contextual LLM Embeddings: A Comprehensive Ablation Study for Ensemble Classifiers0
Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks0
Can Multimodal Large Language Model Think Analogically?0
Interacting Large Language Model Agents. Interpretable Models and Social Learning0
A Mechanistic Explanatory Strategy for XAI0
Rule Based Rewards for Language Model SafetyCode3
PRIMO: Progressive Induction for Multi-hop Open Rule Generation0
Can Large Language Model Predict Employee Attrition?0
Privacy Leakage Overshadowed by Views of AI: A Study on Human Oversight of Privacy in Language Model Agent0
Improving Few-Shot Cross-Domain Named Entity Recognition by Instruction Tuning a Word-Embedding based Retrieval Augmented Large Language Model0
Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language ModelsCode1
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations0
Leveraging Large Language Models for Code-Mixed Data Augmentation in Sentiment AnalysisCode0
Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in TransformersCode0
RadFlag: A Black-Box Hallucination Detection Method for Medical Vision Language Models0
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents0
Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software ImprovementCode3
Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback0
LLM-KT: A Versatile Framework for Knowledge Transfer from Large Language Models to Collaborative Filtering0
Unified Generative and Discriminative Training for Multi-modal Large Language Models0
Randomized Autoregressive Visual GenerationCode5
LLaMo: Large Language Model-based Molecular Graph AssistantCode1
DEREC-SIMPRO: unlock Language Model benefits to advance Synthesis in Data Clean Room0
MESS+: Energy-Optimal Inferencing in Language Model Zoos with Service Level Guarantees0
Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning0
Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking0
GPT or BERT: why not both?Code2
EchoNarrator: Generating natural text explanations for ejection fraction predictionsCode0
Morphological Typology in BPE Subword Productivity and Language Modeling0
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach0
Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility PredictionCode1
Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning0
Interpretable Language Modeling via Induction-head Ngram ModelsCode1
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts0
π_0: A Vision-Language-Action Flow Model for General Robot Control0
What is Wrong with Perplexity for Long-context Language Modeling?Code2
Matchmaker: Self-Improving Large Language Model Programs for Schema Matching0
The NPU-HWC System for the ISCSLP 2024 Inspirational and Convincing Audio Generation Challenge0
Show:102550
← PrevPage 56 of 284Next →

No leaderboard results yet.