SOTAVerified

Language Modeling

Papers

Showing 27012750 of 14182 papers

TitleStatusHype
The Empirical Impact of Data Sanitization on Language Models0
Recycled Attention: Efficient inference for long-context language modelsCode0
Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation0
Towards Multi-Modal Mastery: A 4.5B Parameter Truly Multi-Modal Small Language Model0
An Early FIRST Reproduction and Improvements to Single-Token Decoding for Fast Listwise Reranking0
SSSD: Simply-Scalable Speculative Decoding0
LBPE: Long-token-first Tokenization to Improve Large Language Models0
Assessing the Answerability of Queries in Retrieval-Augmented Code Generation0
Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent0
Unmasking the Shadows: Pinpoint the Implementations of Anti-Dynamic Analysis Techniques in Malware Using LLM0
Real-World Offline Reinforcement Learning from Vision Language Model Feedback0
A Two-Step Concept-Based Approach for Enhanced Interpretability and Trust in Skin Lesion DiagnosisCode0
AgentOps: Enabling Observability of LLM Agents0
Aioli: A Unified Optimization Framework for Language Model Data MixingCode1
End-to-End Navigation with Vision Language Models: Transforming Spatial Reasoning into Question-AnsweringCode2
Improving Multi-Domain Task-Oriented Dialogue System with Offline Reinforcement Learning0
Watermarking Language Models through Language Models0
VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models0
DELIFT: Data Efficient Language model Instruction Fine TuningCode1
LLM2CLIP: Powerful Language Model Unlocks Richer Visual RepresentationCode4
BendVLM: Test-Time Debiasing of Vision-Language EmbeddingsCode0
Benchmarking Large Language Models with Integer Sequence Generation Tasks0
Scaling Laws for Pre-training Agents and World Models0
SuffixDecoding: Extreme Speculative Decoding for Emerging AI ApplicationsCode3
PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-trainingCode2
AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein EngineeringCode1
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos0
Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language ModelCode0
When Does Classical Chinese Help? Quantifying Cross-Lingual Transfer in Hanja and KanbunCode0
A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model0
Deploying Multi-task Online Server with Large Language Model0
Large Generative Model-assisted Talking-face Semantic Communication System0
The N-Grammys: Accelerating Autoregressive Inference with Learning-Free Batched Speculation0
Reducing Hyperparameter Tuning Costs in ML, Vision and Language Model Training Pipelines via Memoization-AwarenessCode0
Fine-Tuning Vision-Language Model for Automated Engineering Drawing Information Extraction0
Unified Pathological Speech Analysis with Prompt Tuning0
Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity DatasetCode1
AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution0
Controlling for Unobserved Confounding with Large Language Model Classification of Patient Smoking Status0
The Evolution of RWKV: Advancements in Efficient Language Modeling0
Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning0
Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities0
ChatGPT in Research and Education: Exploring Benefits and Threats0
HumanVLM: Foundation for Human-Scene Vision-Language Model0
PersianRAG: A Retrieval-Augmented Generation System for Persian Language0
V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference OptimizationCode2
[Vision Paper] PRObot: Enhancing Patient-Reported Outcome Measures for Diabetic Retinopathy using Chatbots and Generative AI0
AVSS: Layer Importance Evaluation in Large Language Models via Activation Variance-Sparsity Analysis0
Wave Network: An Ultra-Small Language Model0
Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease KnowledgeCode1
Show:102550
← PrevPage 55 of 284Next →

No leaderboard results yet.