SOTAVerified

Language Modeling

Papers

Showing 17511800 of 14182 papers

TitleStatusHype
ProtAgents: Protein discovery via large language model multi-agent collaborations combining physics and machine learningCode1
Taiyi-Diffusion-XL: Advancing Bilingual Text-to-Image Generation with Large Vision-Language Model SupportCode1
Endowing Protein Language Models with Structural KnowledgeCode1
Parameter-Efficient Conversational Recommender System as a Language Processing TaskCode1
Fluent dreaming for language modelsCode1
How well can a large language model explain business processes as perceived by users?Code1
Can Large Language Models Write Parallel Code?Code1
MolTailor: Tailoring Chemical Molecular Representation to Specific Tasks via Text PromptsCode1
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image SequencesCode1
Excuse me, sir? Your language model is leaking (information)Code1
Self-Rewarding Language ModelsCode1
ADCNet: a unified framework for predicting the activity of antibody-drug conjugatesCode1
Asynchronous Local-SGD Training for Language ModelingCode1
TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in ConversationCode1
Walert: Putting Conversational Search Knowledge into Action by Building and Evaluating a Large Language Model-Powered ChatbotCode1
Multi-Task Learning for Front-End Text Processing in TTSCode1
ModaVerse: Efficiently Transforming Modalities with LLMsCode1
Rewriting the Code: A Simple Method for Large Language Model Augmented Code SearchCode1
Language Models Encode the Value of Numbers LinearlyCode1
VLLaVO: Mitigating Visual Gap through LLMsCode1
Multi-modal vision-language model for generalizable annotation-free pathology localization and clinical diagnosisCode1
PLLaMa: An Open-source Large Language Model for Plant ScienceCode1
Quokka: An Open-source Large Language Model ChatBot for Material ScienceCode1
GeoGalactica: A Scientific Large Language Model in GeoscienceCode1
SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent DetectionCode1
Open-TI: Open Traffic Intelligence with Augmented Language ModelCode1
MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model EvaluationCode1
DrugAssist: A Large Language Model for Molecule OptimizationCode1
RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k RecommendationCode1
Large Language Models as Zero-Shot Keyphrase Extractors: A Preliminary Empirical StudyCode1
Exploiting Novel GPT-4 APIsCode1
Time is Encoded in the Weights of Finetuned Language ModelsCode1
Cached Transformers: Improving Transformers with Differentiable Memory CacheCode1
ECAMP: Entity-centered Context-aware Medical Vision Language Pre-trainingCode1
LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test ConstructionCode1
Knowledge Graphs and Pre-trained Language Models enhanced Representation Learning for Conversational Recommender SystemsCode1
Cascade Speculative Drafting for Even Faster LLM InferenceCode1
RoleCraft-GLM: Advancing Personalized Role-Playing in Large Language ModelsCode1
Catwalk: A Unified Language Model Evaluation Framework for Many DatasetsCode1
Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document GenerationCode1
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgentCode1
Unbiased organism-agnostic and highly sensitive signal peptide predictor with deep protein language modelCode1
TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model ReasoningCode1
Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward HackingCode1
ViLA: Efficient Video-Language Alignment for Video Question AnsweringCode1
SwitchHead: Accelerating Transformers with Mixture-of-Experts AttentionCode1
On Diversified Preferences of Large Language Model AlignmentCode1
READ: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language ModelingCode1
Hallucination Augmented Contrastive Learning for Multimodal Large Language ModelCode1
Gated Linear Attention Transformers with Hardware-Efficient TrainingCode1
Show:102550
← PrevPage 36 of 284Next →

No leaderboard results yet.