SOTAVerified

Language Modeling

Papers

Showing 23512400 of 14182 papers

TitleStatusHype
Unseen Attack Detection in Software-Defined Networking Using a BERT-Based Large Language Model0
BatchTopK Sparse AutoencodersCode3
ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance0
MAVias: Mitigate any Visual Bias0
Simulating Human-like Daily Activities with Desire-driven Autonomy0
LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial RelationsCode1
Gated Delta Networks: Improving Mamba2 with Delta RuleCode4
Pre-trained protein language model for codon optimization0
GL-Fusion: Rethinking the Combination of Graph Neural Network and Large Language model0
Enhanced Computationally Efficient Long LoRA Inspired Perceiver Architectures for Auto-Regressive Language Modeling0
Cooperative SQL Generation for Segmented Databases By Using Multi-functional LLM Agents0
Trust No AI: Prompt Injection Along The CIA Security Triad0
LVP-CLIP:Revisiting CLIP for Continual Learning with Label Vector Pool0
Confidence Diagram of Nonparametric Ranking for Uncertainty Assessment in Large Language Models Evaluation0
ULMRec: User-centric Large Language Model for Sequential Recommendation0
SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervision0
Text-to-3D Gaussian Splatting with Physics-Grounded Motion Generation0
RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of ExpertsCode1
DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNACode1
Enhancing LLMs for Impression Generation in Radiology Reports through a Multi-Agent System0
C^2LEVA: Toward Comprehensive and Contamination-Free Language Model EvaluationCode2
Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora0
PETapter: Leveraging PET-style classification heads for modular few-shot parameter-efficient fine-tuning0
From Voice to Value: Leveraging AI to Enhance Spoken Online Reviews on the Go0
CigTime: Corrective Instruction Generation Through Inverse Motion Editing0
Gla-AI4BioMed at RRG24: Visual Instruction-tuned Adaptation for Radiology Report GenerationCode0
Transformers Can Navigate Mazes With Multi-Step PredictionCode1
Generative Humanization for Therapeutic Antibodies0
Smoothie: Label Free Language Model RoutingCode1
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling0
Adaptive Optimization for Enhanced Efficiency in Large-Scale Language Model Training0
Enhancing Cross-Language Code Translation via Task-Specific Embedding Alignment in Retrieval-Augmented Generation0
Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model0
QueEn: A Large Language Model for Quechua-English Translation0
A Survey of Large Language Model-Based Generative AI for Text-to-SQL: Benchmarks, Applications, Use Cases, and Challenges0
KaLM: Knowledge-aligned Autoregressive Language Modeling via Dual-view Knowledge Graph Contrastive Learning0
Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference0
LinVT: Empower Your Image-level Large Language Model to Understand VideosCode2
A Practical Examination of AI-Generated Text Detectors for Large Language Models0
Understanding Hidden Computations in Chain-of-Thought ReasoningCode0
Establishing Task Scaling Laws via Compute-Efficient Model Ladders0
EgoPlan-Bench2: A Benchmark for Multimodal Large Language Model Planning in Real-World Scenarios0
MIND: Effective Incorrect Assignment Detection through a Multi-Modal Structure-Enhanced Language ModelCode1
MISR: Measuring Instrumental Self-Reasoning in Frontier ModelsCode1
ALMA: Alignment with Minimal Annotation0
Aligned Music Notation and Lyrics TranscriptionCode0
A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios0
Liquid: Language Models are Scalable Multi-modal GeneratorsCode4
EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM0
A large language model-type architecture for high-dimensional molecular potential energy surfaces0
Show:102550
← PrevPage 48 of 284Next →

No leaderboard results yet.