arXiv Analytics

Sign in

arXiv:1909.08531 [cs.LG]AbstractReferencesReviewsResources

Transfer Learning with Dynamic Distribution Adaptation

Jindong Wang, Yiqiang Chen, Wenjie Feng, Han Yu, Meiyu Huang, Qiang Yang

Published 2019-09-17Version 1

Transfer learning aims to learn robust classifiers for the target domain by leveraging knowledge from a source domain. Since the source and the target domains are usually from different distributions, existing methods mainly focus on adapting the cross-domain marginal or conditional distributions. However, in real applications, the marginal and conditional distributions usually have different contributions to the domain discrepancy. Existing methods fail to quantitatively evaluate the different importance of these two distributions, which will result in unsatisfactory transfer performance. In this paper, we propose a novel concept called Dynamic Distribution Adaptation (DDA), which is capable of quantitatively evaluating the relative importance of each distribution. DDA can be easily incorporated into the framework of structural risk minimization to solve transfer learning problems. On the basis of DDA, we propose two novel learning algorithms: (1) Manifold Dynamic Distribution Adaptation (MDDA) for traditional transfer learning, and (2) Dynamic Distribution Adaptation Network (DDAN) for deep transfer learning. Extensive experiments demonstrate that MDDA and DDAN significantly improve the transfer learning performance and setup a strong baseline over the latest deep and adversarial methods on digits recognition, sentiment analysis, and image classification. More importantly, it is shown that marginal and conditional distributions have different contributions to the domain divergence, and our DDA is able to provide good quantitative evaluation of their relative importance which leads to better performance. We believe this observation can be helpful for future research in transfer learning.

Comments: Accepted to ACM Transactions on Intelligent Systems and Technology (ACM TIST) 2019, 25 pages. arXiv admin note: text overlap with arXiv:1807.07258
Journal: ACM Transactions on Intelligent Systems and Technology (ACM TIST) 2019
Categories: cs.LG, stat.ML
Related articles: Most relevant | Search more
arXiv:1212.2466 [cs.LG] (Published 2012-10-19)
On Information Regularization
arXiv:1912.00895 [cs.LG] (Published 2019-12-02)
Learning to smell for wellness
arXiv:2005.02196 [cs.LG] (Published 2020-05-05)
Measuring the Discrepancy between Conditional Distributions: Methods, Properties and Applications