SOTAVerified

Large Language Model

Papers

Showing 901950 of 6097 papers

TitleStatusHype
Caution for the Environment: Multimodal Agents are Susceptible to Environmental DistractionsCode1
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty AgentsCode1
AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task GenerationCode1
Fuzz-Testing Meets LLM-Based Agents: An Automated and Efficient Framework for Jailbreaking Text-To-Image Generation ModelsCode1
LADDER: Language Driven Slice Discovery and Error RectificationCode1
CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-TuningCode1
Wonderful Team: Zero-Shot Physical Task Planning with Visual LLMsCode1
Cost-effective Instruction Learning for Pathology Vision and Language AnalysisCode1
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language ModelCode1
DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan DesignCode1
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language ModelsCode1
ViLLa: Video Reasoning Segmentation with Large Language ModelCode1
EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote SensingCode1
InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply ChainsCode1
On Large Language Model Continual UnlearningCode1
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary DetectionCode1
Incorporating Large Language Models into Production Systems for Enhanced Task Automation and FlexibilityCode1
Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video UnderstandingCode1
Open-world Multi-label Text Classification with Extremely Weak SupervisionCode1
DebUnc: Improving Large Language Model Agent Communication With Uncertainty MetricsCode1
Large language models are good medical coders, if provided with toolsCode1
SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based SpikingCode1
Crafting Large Language Models for Enhanced InterpretabilityCode1
Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMsCode1
WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection SystemCode1
Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language ModelsCode1
SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language ModelCode1
Meerkat: Audio-Visual Large Language Model for Grounding in Space and TimeCode1
MM-Instruct: Generated Visual Instructions for Large Multimodal Model AlignmentCode1
A Refer-and-Ground Multimodal Large Language Model for BiomedicineCode1
The FineWeb Datasets: Decanting the Web for the Finest Text Data at ScaleCode1
Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse GradientsCode1
The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data AnnotatorsCode1
CogMG: Collaborative Augmentation Between Large Language Model and Knowledge GraphCode1
CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue CoreferenceCode1
C-LLM: Learn to Check Chinese Spelling Errors Character by CharacterCode1
DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-ResolutionCode1
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository ScaleCode1
Safely Learning with Private Data: A Federated Learning Framework for Large Language ModelCode1
InternLM-Law: An Open Source Chinese Legal Large Language ModelCode1
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal BehaviorsCode1
LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone SensorsCode1
CityBench: Evaluating the Capabilities of Large Language Models for Urban TasksCode1
LiveMind: Low-latency Large Language Models with Simultaneous InferenceCode1
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMsCode1
BiLD: Bi-directional Logits Difference Loss for Large Language Model DistillationCode1
On AI-Inspired UI-DesignCode1
RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image UnderstandingCode1
MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQLCode1
MolecularGPT: Open Large Language Model (LLM) for Few-Shot Molecular Property PredictionCode1
Show:102550
← PrevPage 19 of 122Next →

No leaderboard results yet.