Language Modeling

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1651–1700 of 14182 papers

Title	Date	Tasks	Status	Hype
Reinforced Large Language Model is a formal theorem prover	Feb 13, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
Logical forms complement probability in understanding language model (and human) performance	Feb 13, 2025	Language ModelingLanguage Modelling	—Unverified	0
AIDE: Agentically Improve Visual Language Model with Domain Experts	Feb 13, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
On Mechanistic Circuits for Extractive Question-Answering	Feb 12, 2025	Extractive Question-AnsweringLanguage Modeling	—Unverified	0
LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search	Feb 12, 2025	Feature EngineeringGraph Learning	—Unverified	0
E2LVLM:Evidence-Enhanced Large Vision-Language Model for Multimodal Out-of-Context Misinformation Detection	Feb 12, 2025	Instruction FollowingLanguage Modeling	—Unverified	0
Lexical Manifold Reconfiguration in Large Language Models: A Novel Architectural Approach for Contextual Modulation	Feb 12, 2025	Language ModelingLanguage Modelling	—Unverified	0
TANTE: Time-Adaptive Operator Learning via Neural Taylor Expansion	Feb 12, 2025	Computational EfficiencyLanguage Modeling	—Unverified	0
ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification	Feb 12, 2025	DecoderDescriptive	CodeCode Available	2
SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence	Feb 12, 2025	Computational EfficiencyLanguage Modeling	CodeCode Available	1
Contextual Subspace Manifold Projection for Structural Refinement of Large Language Model Representations	Feb 12, 2025	Language ModelingLanguage Modelling	—Unverified	0
Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model	Feb 12, 2025	Language ModelingLanguage Modelling	—Unverified	0
Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples	Feb 12, 2025	Distractor GenerationInformation Retrieval	—Unverified	0
LLM Pretraining with Continuous Concepts	Feb 12, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval	Feb 12, 2025	Answer GenerationInformation Retrieval	—Unverified	0
AI-VERDE: A Gateway for Egalitarian Access to Large Language Model-Based Resources For Educational Institutions	Feb 11, 2025	Language ModelingLanguage Modelling	—Unverified	0
MetaSC: Test-Time Safety Specification Optimization for Language Models	Feb 11, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification	Feb 11, 2025	Contrastive LearningData Augmentation	CodeCode Available	1
ETimeline: An Extensive Timeline Generation Dataset based on Large Language Model	Feb 11, 2025	ArticlesLanguage Modeling	—Unverified	0
Recursive Inference Scaling: A Winning Path to Scalable Inference in Language and Multimodal Systems	Feb 11, 2025	Language ModelingLanguage Modelling	—Unverified	0
JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata	Feb 11, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
Small Language Model Makes an Effective Long Text Extractor	Feb 11, 2025	GPULanguage Modeling	CodeCode Available	1
DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization	Feb 11, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
RomanLens: Latent Romanization and its role in Multilinguality in LLMs	Feb 11, 2025	Language ModelingLanguage Modelling	—Unverified	0
Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More	Feb 11, 2025	DecoderInformation Retrieval	CodeCode Available	0
Auditing Prompt Caching in Language Model APIs	Feb 11, 2025	DecoderLanguage Modeling	CodeCode Available	0
Implicit Language Models are RNNs: Balancing Parallelization and Expressivity	Feb 10, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
AppVLM: A Lightweight Vision Language Model for Online App Control	Feb 10, 2025	Language ModelingLanguage Modelling	—Unverified	0
Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLM	Feb 10, 2025	Language ModelingLanguage Modelling	CodeCode Available	4
K-ON: Stacking Knowledge On the Head Layer of Large Language Model	Feb 10, 2025	Language ModelingLanguage Modelling	—Unverified	0
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates	Feb 10, 2025	Hierarchical Reinforcement LearningLanguage Modeling	CodeCode Available	4
Recent Advances in Discrete Speech Tokens: A Review	Feb 10, 2025	Language ModelingLanguage Modelling	—Unverified	0
Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation	Feb 10, 2025	Decision MakingLanguage Modeling	—Unverified	0
RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning	Feb 10, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE	Feb 10, 2025	DiversityLanguage Modeling	CodeCode Available	1
Rationalization Models for Text-to-SQL	Feb 10, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
μnit Scaling: Simple and Scalable FP8 LLM Training	Feb 9, 2025	Language ModelingLanguage Modelling	—Unverified	0
HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models	Feb 9, 2025	Answer GenerationLanguage Modeling	CodeCode Available	0
Investigating Compositional Reasoning in Time Series Foundation Models	Feb 9, 2025	Language ModelingLanguage Modelling	—Unverified	0
Digital Twin Buildings: 3D Modeling, GIS Integration, and Visual Descriptions Using Gaussian Splatting, ChatGPT/Deepseek, and Google Maps Platform	Feb 9, 2025	Decision MakingLanguage Modeling	—Unverified	0
Effective Black-Box Multi-Faceted Attacks Breach Vision Large Language Model Guardrails	Feb 9, 2025	Language ModelingLanguage Modelling	—Unverified	0
Enabling Autoregressive Models to Fill In Masked Tokens	Feb 9, 2025	DecoderLanguage Modeling	—Unverified	0
Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education	Feb 9, 2025	Image RetrievalLanguage Modeling	—Unverified	0
Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks	Feb 9, 2025	Language ModelingLanguage Modelling	—Unverified	0
ScaffoldGPT: A Scaffold-based GPT Model for Drug Optimization	Feb 9, 2025	Language ModelingLanguage Modelling	—Unverified	0
DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control	Feb 9, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care	Feb 9, 2025	Language ModelingLanguage Modelling	—Unverified	0
UniCMs: A Unified Consistency Model For Efficient Multimodal Generation and Understanding	Feb 8, 2025	DenoisingImage Generation	CodeCode Available	1
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System	Feb 8, 2025	DecoderLanguage Modeling	CodeCode Available	11
Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging	Feb 8, 2025	Language ModelingLanguage Modelling	—Unverified	0

Show:10 25 50

← PrevPage 34 of 284Next →

No leaderboard results yet.