SOTAVerified

parameter-efficient fine-tuning

Parameter-Efficient Fine-Tuning (PEFT) is a technique used to adapt pre-trained models to new tasks with minimal changes to the model's parameters. This approach is particularly useful in scenarios where computational resources are limited or when it is desirable to maintain the original model's performance on the initial task.

Papers

Showing 176200 of 935 papers

TitleStatusHype
CLLoRA: An Approach to Measure the Effects of the Context Length for LLM Fine-Tuning0
SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models0
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization AlignmentCode2
ELBA-Bench: An Efficient Learning Backdoor Attacks Benchmark for Large Language Models0
R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task LearningCode1
Sparsity May Be All You Need: Sparse Random Parameter AdaptationCode0
R^3Mem: Bridging Memory Retention and Retrieval via Reversible Compression0
Generative Modeling of Individual Behavior at Scale0
Sculpting [CLS] Features for Pre-Trained Model-Based Class-Incremental Learning0
LoRA-GGPO: Mitigating Double Descent in LoRA Fine-Tuning via Gradient-Guided Perturbation OptimizationCode0
NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language ModelsCode0
Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition0
Token Adaptation via Side Graph Convolution for Temporally and Spatially Efficient Fine-tuning of 3D Point Cloud TransformersCode0
LSR-Adapt: Ultra-Efficient Parameter Tuning with Matrix Low Separation Rank Kernel Adaptation0
BeamLoRA: Beam-Constraint Low-Rank Adaptation0
GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning0
Revisiting Privacy, Utility, and Efficiency Trade-offs when Fine-Tuning Large Language Models0
SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMsCode2
Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRACode0
Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent0
FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading0
CiteCheck: Towards Accurate Citation Faithfulness DetectionCode0
CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-TuningCode1
Hallucinations and Truth: A Comprehensive Accuracy Evaluation of RAG, LoRA and DoRA0
DiffoRA: Enabling Parameter-Efficient LLM Fine-Tuning via Differential Low-Rank Matrix Adaptation0
Show:102550
← PrevPage 8 of 38Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LLaMA2-7bAccuracy (% )82.63Unverified
2LLaMA2-7bAccuracy (% )82.63Unverified
3LLaMA2-7bAccuracy (% )81.93Unverified
4LLaMA2-7bAccuracy (% )80.28Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA2-7bAccuracy (% )76.68Unverified
2LLaMA2-7bAccuracy (% )76.67Unverified
3LLaMA2-7bAccuracy (% )76.27Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA2-7bAccuracy (% )70.8Unverified
2LLaMA2-7bAccuracy (% )70.09Unverified
3LLaMA2-7bAccuracy (% )69.85Unverified