Chien, Jen-Tzung;
Chen, Ming-Yen;
Lee, Ching-hsien;
Xue, Jing-Hao;
(2024)
Meta Soft Prompting and Learning.
APSIPA Transactions on Signal and Information Processing
, 13
(5)
, Article e402. 10.1561/116.20240010.
Preview |
Text
OA-116.20240010.pdf - Published Version Download (1MB) | Preview |
Abstract
Traditionally, either applying the hard prompt for sentences by handcrafting the prompt templates or directly optimizing the soft or continuous prompt may not sufficiently generalize for unseen domain data. This paper presents a parameter efficient learning for domain-agnostic soft prompt which is developed for few-shot unsupervised domain adaptation. A pre-trained language model (PLM) is frozen and utilized to extract knowledge for unseen domains in various language understanding tasks. The meta learning and optimization over a set of trainable soft tokens is performed by minimizing the cross-entropy loss for masked language model from support and query data in source and target domains, respectively, where the masked tokens for text category and random masking are predicted. The meta soft prompt is learned through a doubly-looped optimization for individual learners and a meta learner when implementing the unsupervised domain adaptation. The PLM is then closely adapted to compensate the domain shift in a target domain. The domain adaptation loss and the prompt-based classification loss are jointly minimized through meta learning. The experiments on multi-domain natural language understanding illustrate the merit of the proposed meta soft prompt in pre-trained language modeling under few-shot setting.
Type: | Article |
---|---|
Title: | Meta Soft Prompting and Learning |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1561/116.20240010 |
Publisher version: | https://doi.org/10.1561/116.20240010 |
Language: | English |
Additional information: | This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http:// creativecommons.org/ licenses/ by-nc/ 4.0/ ), which permits unrestricted re-use, distribution, and reproduction in any medium, for non-commercial use, provided the original work is properly cited. |
Keywords: | Meta learning, few-shot learning, soft prompt, domain adaptation, language model |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science |
URI: | https://discovery.ucl.ac.uk/id/eprint/10205777 |
Archive Staff Only
![]() |
View Item |