SOTAVerified

Large Language Model

Papers

Showing 751800 of 6097 papers

TitleStatusHype
SAGA: A Security Architecture for Governing AI Agentic Systems0
MultiMind: Enhancing Werewolf Agents with Multimodal Reasoning and Theory of Mind0
An Empirical Study of Evaluating Long-form Question AnsweringCode0
LEAM: A Prompt-only Large Language Model-enabled Antenna Modeling MethodCode1
The Big Send-off: High Performance Collectives on GPU-based Supercomputers0
Exploring a Large Language Model for Transforming Taxonomic Data into OWL: Lessons Learned and Implications for Ontology Development0
Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providersCode1
Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation0
Does Knowledge Distillation Matter for Large Language Model based Bundle Generation?0
TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation0
Towards Leveraging Large Language Model Summaries for Topic Modeling in Source Code0
Automatically Generating Rules of Malicious Software Packages via Large Language Model0
Toward Personalizing Quantum Computing Education: An Evolutionary LLM-Powered Approach0
WebEvolver: Enhancing Web Agent Self-Improvement with Coevolving World Model0
Monte Carlo Planning with Large Language Model for Text-Based Game Agents0
ParamΔ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost0
Improving Significant Wave Height Prediction Using Chronos Models0
Exploring human-SAV interaction using large language models: The impact of psychological ownership and anthropomorphism on user experience0
FaceInsight: A Multimodal Large Language Model for Face Perception0
Reasoning Physical Video Generation with Diffusion Timestep Tokens via Reinforcement Learning0
Enhancing TCR-Peptide Interaction Prediction with Pretrained Language Models and Molecular Representations0
Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V30
Automated Bug Report Prioritization in Large Open-Source ProjectsCode0
DATETIME: A new benchmark to measure LLM translation and reasoning capabilitiesCode0
Research on Cloud Platform Network Traffic Monitoring and Anomaly Detection System based on Large Language Models0
LLMs meet Federated Learning for Scalable and Secure IoT Management0
Do It For Me vs. Do It With Me: Investigating User Perceptions of Different Paradigms of Automation in Copilots for Feature-Rich Software0
Large Language Model Empowered Privacy-Protected Framework for PHI Annotation in Clinical Notes0
LAPP: Large Language Model Feedback for Preference-Driven Reinforcement Learning0
Speculative Sampling via Exponential Races0
Virology Capabilities Test (VCT): A Multimodal Virology Q&A BenchmarkCode0
Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling EvaluatorsCode0
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models0
Kuwain 1.5B: An Arabic SLM via Language Injection0
Automated Duplicate Bug Report Detection in Large Open Bug Repositories0
Don't Retrieve, Generate: Prompting LLMs for Synthetic Training Data in Dense Retrieval0
Causal Disentanglement for Robust Long-tail Medical Image Generation0
PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines0
ResNetVLLM -- Multi-modal Vision LLM for the Video Understanding Task0
Bottom-Up Synthesis of Knowledge-Grounded Task-Oriented Dialogues with Iteratively Self-Refined Prompts0
Improving the Serving Performance of Multi-LoRA Large Language Models via Efficient LoRA and KV Cache Management0
SOTOPIA-S4: a user-friendly system for flexible, customizable, and large-scale social simulation0
Accelerating LLM Inference with Flexible N:M Sparsity via A Fully Digital Compute-in-Memory AcceleratorCode0
FGMP: Fine-Grained Mixed-Precision Weight and Activation Quantization for Hardware-Accelerated LLM Inference0
Manipulating Multimodal Agents via Cross-Modal Prompt Injection0
Walk the Talk? Measuring the Faithfulness of Large Language Model ExplanationsCode1
Large Language Model Enhanced Particle Swarm Optimization for Hyperparameter Tuning for Deep Learning Models0
High-Throughput LLM inference on Heterogeneous Clusters0
Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods0
Large Language Bayes0
Show:102550
← PrevPage 16 of 122Next →

No leaderboard results yet.