SOTAVerified

Large Language Model

Papers

Showing 60516097 of 6097 papers

TitleStatusHype
BAR: A Backward Reasoning based Agent for Complex Minecraft TasksCode0
A dynamical clipping approach with task feedback for Proximal Policy OptimizationCode0
Democratizing Large Language Model-Based Graph Data Augmentation via Latent Knowledge GraphsCode0
Unveiling Environmental Impacts of Large Language Model Serving: A Functional Unit ViewCode0
Unveiling Gender Bias in Large Language Models: Using Teacher's Evaluation in Higher Education As an ExampleCode0
Surpassing Cosine Similarity for Multidimensional Comparisons: Dimension Insensitive Euclidean MetricCode0
HeavyWater and SimplexWater: Watermarking Low-Entropy Text DistributionsCode0
Are Generative AI Agents Effective Personalized Financial Advisors?Code0
A Quick, trustworthy spectral knowledge Q&A system leveraging retrieval-augmented generation on LLMCode0
Vision Meets Language: A RAG-Augmented YOLOv8 Framework for Coffee Disease Diagnosis and Farmer AssistanceCode0
AI-University: An LLM-based platform for instructional alignment to scientific classroomsCode0
Zero-Shot Multi-modal Large Language Model v.s. Supervised Deep Learning: A Comparative Study on CT-Based Intracranial Hemorrhage SubtypingCode0
Measuring the Influence of Incorrect Code on Test GenerationCode0
Heaps' Law in GPT-Neo Large Language Model Emulated CorporaCode0
Suspected Undeclared Use of Artificial Intelligence in the Academic Literature: An Analysis of the Academ-AI DatasetCode0
Health Text Simplification: An Annotated Corpus for Digestive Cancer Education and Novel Strategies for Reinforcement LearningCode0
Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between ActionsCode0
CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM PipelinesCode0
Harnessing the Power of Large Language Model for Uncertainty Aware Graph ProcessingCode0
VIS-Shepherd: Constructing Critic for LLM-based Data Visualization GenerationCode0
De-jargonizing Science for Journalists with GPT-4: A Pilot StudyCode0
DeepTextMark: A Deep Learning-Driven Text Watermarking Approach for Identifying Large Language Model Generated TextCode0
CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model AgentsCode0
SweCTRL-Mini: a data-transparent Transformer-based large language model for controllable text generation in SwedishCode0
Harnessing Large Language Models Over Transformer Models for Detecting Bengali Depressive Social Media Text: A Comprehensive StudyCode0
Deep Natural Language Feature Learning for Interpretable PredictionCode0
Deep Learning and Data Augmentation for Detecting Self-Admitted Technical DebtCode0
Retrieval Augmented Generation Systems: Automatic Dataset Creation, Evaluation and Boolean Agent SetupCode0
HALoS: Hierarchical Asynchronous Local SGD over Slow Networks for Geo-Distributed Large Language Model TrainingCode0
Guarded Query Routing for Large Language ModelsCode0
DeepArt: A Benchmark to Advance Fidelity Research in AI-Generated ContentCode0
SyllabusQA: A Course Logistics Question Answering DatasetCode0
Decoding the Silent Majority: Inducing Belief Augmented Social Graph with Large Language Model for Response ForecastingCode0
Zero-shot Translation of Attention Patterns in VQA Models to Natural LanguageCode0
G-SciEdBERT: A Contextualized LLM for Science Assessment Tasks in GermanCode0
Visual Anchors Are Strong Information Aggregators For Multimodal Large Language ModelCode0
Towards Ontology-Enhanced Representation Learning for Large Language ModelsCode0
CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal RepresentationsCode0
Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language ModelCode0
Towards Personalized Evaluation of Large Language Models with An Anonymous Crowd-Sourcing PlatformCode0
Retrospective Comparative Analysis of Prostate Cancer In-Basket Messages: Responses from Closed-Domain LLM vs. Clinical TeamsCode0
G-Safeguard: A Topology-Guided Security Lens and Treatment on LLM-based Multi-agent SystemsCode0
Revealing and Mitigating the Challenge of Detecting Character Knowledge Errors in LLM Role-PlayingCode0
Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite AttacksCode0
SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table ExtractionCode0
Towards Real Zero-Shot Camouflaged Object Segmentation without Camouflaged AnnotationsCode0
SynSUM -- Synthetic Benchmark with Structured and Unstructured Medical RecordsCode0
Show:102550
← PrevPage 122 of 122Next →

No leaderboard results yet.