SOTAVerified

In-Context Learning

Papers

Showing 226250 of 2297 papers

TitleStatusHype
Zero-shot Model-based Reinforcement Learning using Large Language ModelsCode1
ELICIT: LLM Augmentation via External In-Context CapabilityCode1
AppBench: Planning of Multiple APIs from Various APPs for Complex User InstructionCode1
Metalic: Meta-Learning In-Context with Protein Language ModelsCode1
Steering Large Language Models using Conceptors: Improving Addition-Based Activation EngineeringCode1
Tree of Problems: Improving structured problem solving with compositionalityCode1
Retrieval-Augmented Decision Transformer: External Memory for In-context RLCode1
Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM PruningCode1
Vector-ICL: In-context Learning with Continuous Vector RepresentationsCode1
Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic PlanningCode1
Unleashing the Potential of the Diffusion Model in Few-shot Semantic SegmentationCode1
ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AICode1
PersonalLLM: Tailoring LLMs to Individual PreferencesCode1
Text Clustering as Classification with LLMsCode1
T2Vs Meet VLMs: A Scalable Multimodal Dataset for Visual Harmfulness RecognitionCode1
In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow UnderstandingCode1
In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language ModelsCode1
StateAct: State Tracking and Reasoning for Acting and Planning with Large Language ModelsCode1
Fine-tuning Large Language Models for Entity MatchingCode1
The Compressor-Retriever Architecture for Language Model OSCode1
Evaluating Named Entity Recognition Using Few-Shot Prompting with Large Language ModelsCode1
Causal-Guided Active Learning for Debiasing Large Language ModelsCode1
V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard?Code1
MAG-SQL: Multi-Agent Generative Approach with Soft Schema Linking and Iterative Sub-SQL Refinement for Text-to-SQLCode1
ArabLegalEval: A Multitask Benchmark for Assessing Arabic Legal Knowledge in Large Language ModelsCode1
Show:102550
← PrevPage 10 of 92Next →

No leaderboard results yet.