SOTAVerified

Language Modeling

Papers

Showing 851900 of 14182 papers

TitleStatusHype
Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training0
Uncertainty-Aware Trajectory Prediction via Rule-Regularized Heteroscedastic Deep ClassificationCode0
Mixer Metaphors: audio interfaces for non-musical applications0
Higher-Order Binding of Language Model Virtual Personas: a Study on Approximating Political Partisan Misperceptions0
BitNet b1.58 2B4T Technical Report0
Generative Recommendation with Continuous-Token Diffusion0
Rethinking LLM-Based Recommendations: A Query Generation-Based, Training-Free Approach0
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning0
DVLTA-VQA: Decoupled Vision-Language Modeling with Text-Guided Adaptation for Blind Video Quality Assessment0
Interpreting the linear structure of vision-language model embedding spaces0
Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions0
Towards Conversational AI for Human-Machine Collaborative MLOps0
Recommending Clinical Trials for Online Patient Cases using Artificial Intelligence0
Co-STAR: Collaborative Curriculum Self-Training with Adaptive Regularization for Source-Free Video Domain AdaptationCode0
GraphicBench: A Planning Benchmark for Graphic Design with Language Agents0
A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports0
From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image SegmentationCode0
Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content0
ReZero: Enhancing LLM search ability by trying one-more-time0
ProtFlow: Fast Protein Sequence Design via Flow Matching on Compressed Protein Language Model Embeddings0
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning0
DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis0
Looking beyond the next token0
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance0
A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science0
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model0
SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced Vision-Language Model0
Forecasting from Clinical Textual Time Series: Adaptations of the Encoder and Decoder Language Model Families0
α-Flow: A Unified Framework for Continuous-State Discrete Flow Matching Models0
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models0
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical ImagingCode1
MorphTok: Morphologically Grounded Tokenization for Indian Languages0
Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data0
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single TransformerCode2
LangPert: Detecting and Handling Task-level Perturbations for Robust Object Rearrangement0
Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis0
Automated Testing of COBOL to Java Transformation0
GNN-ACLP: Graph Neural Networks based Analog Circuit Link Prediction0
RealHarm: A Collection of Real-World Language Model Application FailuresCode0
Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design0
SegEarth-R1: Geospatial Pixel Reasoning via Large Language ModelCode2
Domain-Adaptive Continued Pre-Training of Small Language Models0
Kongzi: A Historical Large Language Model with Fact Enhancement0
Vision-Language Model for Object Detection and Segmentation: A Review and EvaluationCode2
Structure-Accurate Medical Image Translation via Dynamic Frequency Balance and Knowledge Guidance0
ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language modelCode2
UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents0
AgentDynEx: Nudging the Mechanics and Dynamics of Multi-Agent Simulations0
AgentA/B: Automated and Scalable Web A/BTesting with Interactive LLM Agents0
Fine-tuning a Large Language Model for Automating Computational Fluid Dynamics SimulationsCode1
Show:102550
← PrevPage 18 of 284Next →

No leaderboard results yet.