Energy Transfer Contrast Network for Unsupervised Domain Adaption

Ouyang, Jiajun; Lv, Qingxuan; Zhang, Shu; Dong, Junyu

doi:10.1007/978-3-031-27818-1_10

Jiajun Ouyang¹⁵,
Qingxuan Lv¹⁵,
Shu Zhang¹⁵ &
…
Junyu Dong¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13834))

Included in the following conference series:

International Conference on Multimedia Modeling

1463 Accesses
1 Citations

Abstract

The main goal of unsupervised domain adaptation is to improve the classification performance on unlabeled data in target domains. Many methods try to reduce the domain gap by treating multiple domains as one to enhance the generalization of a model. However, aligning domains as a whole does not account for instance-level alignment, which might lead to sub-optimal results. Currently, many researchers utilize meta-learning and instance segmentation approaches to tackle this problem. But it can only obtain a further optimized the domain-invariant feature learned by the model, rather than achieve instance-level alignment. In this paper, we interpret unsupervised domain adaptation from a new perspective, which exploits the energy difference between the source and target domains to reduce the performance drops caused by the domain gap. At the same time, we improve and exploit the contrastive learning loss, which can push the target domain away from the decision boundary. The experimental results on different benchmarks against a range of the state-of-the-art approaches justify the performance and the effectiveness of the proposed method.

Access provided by Autonomous University of Puebla. Download conference paper PDF

NaCL: noise-robust cross-domain contrastive learning for unsupervised domain adaptation

Article 27 June 2023

Contrastive Vicinal Space for Unsupervised Domain Adaptation

Subdomain adaptation via correlation alignment with entropy minimization for unsupervised domain adaptation

Article 28 February 2024

1 Introduction

Convolutional Neural Network (CNN) is one of the representative algorithms of deep learning. However, CNNs often rely on a large amount of labeled training data in practical applications. Although we can provide rich labels for some fields with many categories, this leads to high time costs. To address this problem, Unsupervised Domain Adaptation (UDA) can transfer the knowledge learned from the labeled source domain to the unlabeled target domain, which has attracted a lot of attention from academia [5, 18] and industry [22].

Unsupervised domain adaptation has made impressive progress so far, and the vast majority of methods adjust the distribution of source and target domains by reducing the domain discrepancy, such as Maximum Mean Discrepancy (MMD) [1], joint maximum mean Discrepancy (JMMD) [2], etc. Another predominant streams in UDA based on the Generative Adversarial Networks [25] to maximize the error of the domain discriminator to confuse the source and target domains (Fig. 1).

However, directing alignment on the feature space may lead to the following problems: First, due to sampling variability, the label space of source and target domain samples on each mini-batch is different, which undoubtedly leads to outlier generation and negative optimization of generalization performance. Second, this direct approach to reducing the domain gap does not take into account instance-level alignment. Therefore, domain adaptation urgently need a solution that considers both distribution and categories discrepancy.

To tacle aforementioned problem, this paper propose an energy representation-based contrastive learning algorithm to avoid the first two problems: first, we improve contrastive learning and apply it to the UDA task for instance-level alignment. Secondly, we look at the UDA problem from another perspective, treating the target domain data as out-of-distribution data with the same labels as the source domain data. Due to the different data distributions in the target domain and the source domain, their energy values will be different to some extent [28, 32, 33]. So we use this difference to encourage the classifier to fit the energy value of the target domain to the vicinity of the source domain to mitigate the effects of domain shift. Since the energy is a non-probabilistic scalar value, it can be regarded as a certain norm of the output vector, which is less negatively affected by the label space in the mini-batch and reduce the domain gap can better avoid the negative optimization caused by Randomness of sampling.

We conduct experiments on several datasets to compare state-of-the-art methods, and the experimental results demonstrate the effectiveness of our method. Furthermore, we comprehensively investigate the impact of different components of our approach, aiming to provide insights for the following research. The contribution of this article is summarized as follows:

1.
We provide a new perspective that treats the target domain as out-of-distribution data with the same label space in the source domain, and achieves unsupervised domain adaptation by narrowing the difference between the OOD and ID data of the source and target domains
2.
We improve the paradigm of contrastive learning, using contrastive learning to pull positive pairs closer and push negative pairs farther, enabling instance-level alignment
3.
To verify the effectiveness of our method, we conduct extensive experiments on two datasets in UDA and select multiple state-of-the-art methods as our adversaries. Experiments show that our method has good consistency in UDA. We further conduct comprehensive ablation experiments to verify the effectiveness of our method in different settings.

2 Related Work

2.1 Contrastive Learning

Self-supervised learning aims to improve the feature extraction ability of models by designing auxiliary tasks to mine the representational features of data as supervised information for unlabeled data. [6, 16, 17]. At the same time, thanks to the emergence of contrastive learning, many methods have been proposed to further Improve the performance of unsupervised learning by reducing the distance between positive samples. SimClr [8] is mainly used to generate comparison pairs for the data in the current mini-batch through data augmentation and cosine similarity, which improves the generalization ability of the model; MoCov1 [9] updates the historical features of the stored samples through momentum, so that the contrastive learning samples can contain historical information to obtain better feature representation. Recent research shows that comparative learning is further extended as a paradigm. There are also many methods attempt to contrastive learning from the perspectives of clustering [23, 27, 31]. Inspired by this, we want to achieve instance-level alignment of UDA by contrasting the ability of learning to narrow the distance between positive samples.

2.2 Energy Based Model

The main purpose of an energy model is to construct a function that maps every point in space to a non-probabilistic scalar called energy based model (EBM) was first proposed by LeCun et al. in [29]. Through this non-probabilistic scalar, the problem that the model caused by probability density is difficult to optimize and unstable can be well solved. liu et al. [28] used energy to detect out-of-distribution (OOD) data; and [32] employs a formal connection of machine learning with thermodynamics to characterize the quality of learnt representations for transfer learning, the energy based model has also been explored in domain adaptation. Similarly, we approach the UDA problem from another perspective: taking energy as a domain-specific representation, and completing knowledge transfer in unsupervised domain adaptation through energy transfer.

2.3 Unsupervised Domain Adaption

The main purpose of unsupervised domain adaptation (UDA) is to transfer knowledge in the labeled source domain to the unlabeled target domain. Ben et al. [15] theoretically verifies that reduce the domain gap in the process of training data is more conducive to making the classifier suitable for the target domain. Based on this, reducing the domain gap [1, 3, 35] is a classic method to solve the UDA problem. Without dealing with instance information in each domain data, knowledge transfer can be accomplished by map the data distribution to the Reproducing Kernel Hilbert Space (RKHS) and by convolutional neural network to reduce the domain discrepancy [2, 13].

3 Proposed Method

3.1 Basic Definition

Given the well-annotated source domain $\left\{ (x^{s}_{i},y^{s}_{i})\right\} ^{n_{s}}_{i=1} = D_{s}$, and unlabeled target instances $\left\{ (x^{t}_{j})\right\} ^{n_{t}}_{j=1} = D_{t}$, and an augmented to the target domain data $\left\{ (x^{a}_{j})\right\} ^{n_{a}}_{j=1} = D_{a}$, where N denotes the number of classes. Our aim is to transfer the knowledge learned from the labeled source domain to the unlabeled target domain.

The overall structure of our network is shown in Fig. 2, we extract features with a feature extractor F and define it as $f_{i}$. To obtain better feature embeddings, we utilize the projection head P to map the features to the latent contrast space, defined as $p_{i}$. Finally we will go through the classifier C to generate a probabilistic model for each sample.

3.2 Contrastive Learning at the Instance Level

Contrastive learning is a framework that usually uses the context of the same instance to learn representations by discriminating between positive queries and a collection of negative examples in an embedding feature space. We hope to accomplish cross-domain instance-level alignment through its ability to learn representations. However, contrastive learning methods that suitable for unsupervised learning do not involve knowledge transfer between domains, and They tend to fail if there is not enough contrast, e.g. samples in a mini-batch is insufficient.

After exploring a lot of recent work on contrastive learning, we found that memory-bank and data augmentation techniques can be used to reduce the risk of contrastive learning failure. memory-bank [9] can make up for the shortage of samples in mini-batch, while data augmentation can widen the gap between sample representation and facilitate the model to learn instance-level invariant features.

$$\begin{aligned} P^{t}_{i} = momentum \times p^{hs}_{i} + (1-momentum) \times p^{ns}_{i} \end{aligned}$$

(1)

$p^{ns}_{i}$ is the contrast feature obtained through the projection head, and $p^{hs}_{i}$ is the historical feature that already exists in the memory-bank. After obtaining the corresponding features, we compute the feature similarity in the contrast space.

$$\begin{aligned} \begin{aligned} sim = \frac{\sum _{i=1}^{N}(P^{s}_{i} \times \sum _{i=1}^{n}proj(f^{t}_{i}))}{\sqrt{\sum _{i=1}^{N}(P^{s}_{i})^{2}} \times \sqrt{\sum _{i=1}^{n}(proj(f^{t}_{i}))^{2}} } \end{aligned} \end{aligned}$$

(2)

Note that in Eq. 2 we involve samples from both source and target domains and complete instance-level alignment. We use contrastive learning to relate samples of the same class whether they are in the same domain or not, which enables knowledge transfer across domains.

We take the sample with the largest feature similarity as the positive sample. We do the same operation on the augmented target domain samples to meet the requirement of discriminating positive and negative samples.

$$\begin{aligned} \begin{aligned} Sim_{p1} = \mathop {\arg \max }\limits _{D} \ sim(P^{s}, f_{Au}) \end{aligned} \end{aligned}$$

(3)

$$\begin{aligned} \begin{aligned} Sim_{p2} = \mathop {\arg \max }\limits _{D} \ sim(P^{s}, f_{t}) \end{aligned} \end{aligned}$$

(4)

Based on the above, we can get the final cross-domain contrastive learning loss.

$$\begin{aligned} \begin{aligned} L_{CD} = -log\frac{exp(S_{p1}/\tau ) + exp(S_{p2}/\tau )}{exp(S_{p1} + S_{p2}/\tau ) + \sum _{j = 1}^{2N-1}{exp(N_{2n}/\tau )}} \end{aligned} \end{aligned}$$

(5)

where $S_{p1}$ and $S_{p2}$ are both positive sample pairs and $N_{2n}$ is a negative sample pair. Compared with N-pair loss, we learned UDA task by comparison considering knowledge transfer. At the same time, the source domain sample in memory-Bank is used as anchor to realize the instance-level alignment between source domain and target domain. Through formula (5), we implemented compact representation.

3.3 Energy Transfer

We explicitly express our desire to address the problem of domain distribution alignment in UDA problems from another perspective. In some researches of out-of-distribution detection, it has been clearly indicated that out-of-distribution samples have higher energy [28], and the purpose of our energy transfer is to encourage the classifier to obtain the target domain data of the distribution closer to the source domain.

We first introduce the definition of the energy model. The essence of energy-based models(EBM) is to construct a function E(X) that maps each point in the input space to a non-probabilistic scalar called energy [15].

With the Gibbs distribution we can convert the energy into a probability density and get the Gibbs free energy E(x) for any point as:

$$\begin{aligned} \begin{aligned} E(x) = -Tlog\int _{y'} e^{-E(x,y)}/T \end{aligned} \end{aligned}$$

(6)

where T is the temperature parameter. We can easily associate the classification model with the energy model, and get the free energy for x as:

$$\begin{aligned} \begin{aligned} E(x,f) = -Tlog\sum _{i=1}^{N}e^{f_{i}(x)}/T \end{aligned} \end{aligned}$$

(7)

Note that the energy here has nothing to do with the label of the data, it can be regarded as a kind of norm of the output vector $f_{i}(x)$. We use the definition of thermodynamic internal energy to express information entropy and energy at the same time [32], the internal energy of a system can be expressed as:

$$\begin{aligned} \begin{aligned} U = E + T^{'}G \end{aligned} \end{aligned}$$

(8)

where $T^{'}$ is the temperature parameter, E is the free energy, G is the entropy, and U is the internal energy of the system. The temperature parameter $T^{'}$ is a hyperparameter. Since the classifier and feature extractor share parameters and weights, we believe that in both systems, the source and target domains, the internal energy is not affected by external factors.

However, if the energy transfer between the source and target domains is done directly through the internal energy of the system, two problems arise. First, we cannot guarantee that the energy transfer directions of the source and target domains are consistent. Second, the energy discrepancy of the mini-batch energy in the source and target domains may be too large, making it difficult for the loss function to converge. Based on this, we restrict the energy transfer loss function as follows:

$$\begin{aligned} \begin{aligned} L1 = -\mathbb {E}_{D_{s}}X_{s} + \mathbb {E}_{D_{t}}X_{t} \end{aligned} \end{aligned}$$

(9)

$$\begin{aligned} \begin{aligned} L2 = \mathbbm {1}(|U_{s} - U{t}| < \beta ) \end{aligned} \end{aligned}$$

(10)

To address the above issues, we use the $L_{1}$ proposed in [7] to reduce the range represented by the probability distribution in each mini-batch. The purpose of $L_{2}$ is to alleviate the negative optimization caused by the large energy discrepancy.

In Simclr [8], Hinton et al. effectively improves the performance of unsupervised learning through a simple projection head, and in [30], Wang et al. demonstrate in detail that MLP can effectively improve the representation ability of samples. In order to make the internal energy function U better represent the two different systems of the source domain and the target domain, we combine the features of different dimensions through the MLP layer to get a better expression. It can be expressed as:

$$\begin{aligned} \begin{aligned} x_{i} = \sum _{i=1}^{K}MLP_{i}(f_{i}) \end{aligned} \end{aligned}$$

(11)

where the $f_{i}$ are features of different dimensions. We can integrate features of different dimensions together for better distributional representation, which can more effectively focus on domain-invariant features [7]. So energy expression and the energy transfer loss function can be summarized as follows:

$$\begin{aligned} \begin{aligned} E(x,x_{i}) = -Tlog\sum _{i=1}^{N}e^{x_{i}}/T \end{aligned} \end{aligned}$$

(12)

$$\begin{aligned} \begin{aligned} L_{Trans} = \mathbbm {1}(|U_{s}-U_{t}|<\beta ) + L1 \end{aligned} \end{aligned}$$

(13)

Finally, we utilize a simple cross-entropy loss $L_{cls}$ to guarantee classification accuracy in the source domain, while utilizing the domain adversarial loss $L_Da$ [4] as a preliminary transfer target based on classification loss.

Our energy transfer contrast network can effectively learn a special feature representation, and use this to achieve knowledge transfer between source and target domains. Based on this, our overall loss function is as follows:

$$ L = L_{cls} + L_{Da} + a \times L_{CD} +b \times L_{Trans} $$

4 Experment

4.1 Datasets and Criteria

OfficeHome. [10] contains 4 domains, and each domain contains 65 categories, which is the most commonly used dataset for UDA tasks.

Office31. [11] Contains 3 domains, each domain has a total of 31 categories, and there are about 4100 images in total.

Criteria. Following [1, 20], we select two domain pairs (e.g. A2P, P2C) from the dataset for each training, and use the classification accuracy to judge the pros and cons of the model. Finally, we use the average accuracy of all domain pairs as the criterion for evaluating the algorithm

4.2 Implementation Details

For fair comparison, following [7, 13, 19], we use Resnet-50 [12] trained on ImageNet [34] as the backbone for UDA. In this paper, an SGD optimizer with momentum 0.9 is used to train all UDA tasks. The learning rate is adjusted by $l = l_{1}(1+\alpha \beta )^{\gamma }$, where $l_{1}=0.01$, $\alpha =10$, $\gamma =0.75$, and $\beta $ varies from 0 to 1 linearly with the training epochs.

4.3 Comparison with State of the Art

Table 2 shows the performance of different methods on the Office-Home dataset under the UDA task scenario. The experiments are conducted on 12 domain pairs, and we list the average scores under this dataset in the rightmost column. From the table we can observe As a result, our accuracy is at least 0.7% higher than other baselines, and even compared with MetaAlign, which has done further work on GVB, our average accuracy is still improved by 0.3%. Similarly, as shown in Table 1, for the Office31 dataset, our average accuracy is consistent with the state of the art.

Table 1. Accuracy(%) on Office-31 for unsupervised domain adaptation (ResNet-50).

Full size table

Table 2. Accuracy (%) of different UDAs on Offce-Home with ResNet-50 as backbone. Best in bold.

Full size table

4.4 Ablation Studies

Feature Visualization. To demonstrate our approach’s achievement of intra-class compactness across samples across domains, we use T-SNE [24] to reduce sample dimensionality and visualize our method and HDA. We randomly selected 11 categories from the P-R domain pair of Office-Home, where the same color represents the same label. As shown in Fig. 3, we can achieve the effect of HDA 7500 iterations with only 1500 iterations, and the intra-class compactness gets better with increasing iterations.

The Effect of the Number MLP Module. The energy transfer network is the most important part of our model. By setting up multiple MLP modules for the energy transfer network, we can effectively represent features of different dimensions as energy, providing better generalization ability for the whole model. In this ablation experiment, we selected a total of four sets of experiments with different source domains from the UDA task to explore the effect of different numbers of energy transfer networks on the model. The experimental results are shown in Fig. 4. Experiments show that we choose N = 3 as the default number setting for our MLP module.

Selecting Positive Samples by Thresholding in Contrastive Learning. In the experimental process of contrastive learning, we inevitably think of determining positive and negative sample pairs through a threshold $\tau $. When it is greater than the threshold $\tau $, it is a positive sample pair, otherwise it is a negative sample pair. However, there is a fatal problem in the threshold judgment method that the optimal results of different domain pairs are not the same threshold. Figure 5 shows the effect of different thresholds on two UDA tasks.

5 Conclusion

In this paper, we propose an energy representation to further improve the accuracy of UDA tasks. Specifically, we extract information of different dimensions of features through multiple MLP layers, and then represent the difference between the source and target domains through the combination of entropy and free energy, and mitigate the effect of domain shift by reducing this gap. At the same time, we also achieve instance-level alignment across domains through contrastive learning. Furthermore, our method is compared with many previous state-of-the-art on three datasets, which demonstrates the effectiveness of our method.

References

Long, M., Cao, Y., Wang, J., Jordan, M.: Learning transferable features with deep adaptation networks. In: PMLR, pp. 97–105 (2015)
Google Scholar
Long, M., Zhu, H., Wang, J., Jordan, M.I.: Deep transfer learning with joint adaptation networks. In: PMLR, pp. 2208–2217 (2017)
Google Scholar
Kang, G., Zheng, L., Yan, Y., Yang, Y.: Deep adversarial attention alignment for unsupervised domain adaptation: the benefit of target expectation maximization. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11215, pp. 420–436. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01252-6_25
Chapter Google Scholar
Ganin, Y., Lempitsky, V.: Unsupervised domain adaptation by backpropagation. In: PMLR, pp. 1180–1189 (2015)
Google Scholar
Hoffman, J., et al.: Cycada: cycle-consistent adversarial domain adaptation. In: ICML. PMLR, pp. 1989–1998 (2018)
Google Scholar
Shao, H., Yuan, Z., Peng, X., Wu, X.: Contrastive learning in frequency domain for non-i.I.D. image classification. In: International Conference on Multimedia Modeling (2021)
Google Scholar
Cui, S., Jin, X., Wang, S., He, Y., Huang, Q.: Heuristic domain adaptation. In: NeurlPS vol. 33, pp. 7571–7583 (2020)
Google Scholar
Chen, T., Kornblith, S., Norouzi, M., Hinton, G.: A simple framework for contrastive learning of visual representations. In: PMLR, vol. 10, pp. 1597–1607 (2020)
Google Scholar
He, K., Fan, H., Wu, Y., Xie, S., Girshick, R.: Momentum contrast for unsupervised visual representation learning. In: CVPR, pp. 9729–9738 (2020)
Google Scholar
Venkateswara, H., Eusebio, J., Chakraborty, S., Panchanathan, S.: Deep hashing network for unsupervised domain adaptation. In: CVPR, pp. 5018–5027 (2017)
Google Scholar
Saenko, K., Kulis, B., Fritz, M., Darrell, T.: Adapting visual category models to new domains. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 213–226. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15561-1_16
Chapter Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR, pp. 770–778 (2016)
Google Scholar
Saito, K., Watanabe, K., Ushiku, Y., Harada, T.: Maximum classifier discrepancy for unsupervised domain adaptation. In: CVPR, pp. 3723–3732 (2018)
Google Scholar
Zhang, Y., Liu, T., Long, M., Jordan, M.: Bridging theory and algorithm for domain adaptation. In: ICML. PMLR, pp. 7404–7413 (2019)
Google Scholar
Ben-David, S., Blitzer, J., Crammer, K., Pereira, F.: Analysis of representations for domain adaptation. In: NeurlPS, vol. 19 (2006)
Google Scholar
Chen, Y.-C., Gao, C., Robb, E., Huang, J.-B.: NAS-DIP: learning deep image prior with neural architecture search. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12363, pp. 442–459. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58523-5_26
Chapter Google Scholar
Saito, K., Saenko, K.: OVANet:: one-vs-all network for universal domain adaptation. In: ICCV, pp. 9000–9009 (2021)
Google Scholar
Singh, A.: CLDA: contrastive learning for semi-supervised domain adaptation. In: NerulPS, vol. 34 (2021)
Google Scholar
Cui, S., Wang, S., Zhuo, J., Su, C., Huang, Q., Tian, Q.: Gradually vanishing bridge for adversarial domain adaptation. In: CVPR, pp. 12 455–12 464 (2020)
Google Scholar
Hu, L., Kan, M., Shan, S., Chen, X.: Unsupervised domain adaptation with hierarchical gradient synchronization. In: CVPR, pp. 4043–4052 (2020)
Google Scholar
Wei, G., Lan, C., Zeng, W., Chen, Z.: Metaalign: coordinating domain alignment and classification for unsupervised domain adaptation. In: CVPR, pp. 16 643–16 653 (2021)
Google Scholar
James, S., et al.: Sim-to-real via sim-to-sim: data-efficient robotic grasping via randomized-to-canonical adaptation networks. In: CVPR, pp. 12 627–12 637 (2019)
Google Scholar
Wang, Y., et al.: Clusterscl: cluster-aware supervised contrastive learning on graphs. In: Proceedings of the ACM Web Conference 2022, 1611–1621 (2022)
Google Scholar
Van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9(11) (2008)
Google Scholar
Goodfellow, I.J., et al.: Generative adversarial nets. In: NeurIPS (2014)
Google Scholar
Kang, G., Jiang, L., Yang, Y., Hauptmann, A.G.: Contrastive adaptation network for unsupervised domain adaptation. In: ICCV, pp. 4893–4902 (2019)
Google Scholar
Zhong, H., et al.: Graph contrastive clustering. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9224–9233 (2021)
Google Scholar
Liu, W., Wang, X., Owens, J., Li, Y.: Energy-based out-of-distribution detection. In: Advances in Neural Information Processing Systems, vol. 33, pp. 21 464–21 475 (2020)
Google Scholar
LeCun, Y., Chopra, S., Hadsell, R., Ranzato, M., Huang, F.: A tutorial on energy-based learning. In: Predicting Structured Data, vol. 1, no. 0 (2006)
Google Scholar
Wang, Y., et al.: Revisiting the transferability of supervised pretraining: an MLP perspective. arXiv preprint arXiv:2112.00496 (2021)
Li, Y., Hu, P., Liu, Z., Peng, D., Zhou, J.T., Peng, X.: Contrastive clustering. In: Proceedings of the AAAI Conference on Artificial Intelligence 35(10), 8547–8555 (2021)
Google Scholar
Gao, Y., Chaudhari, P.: A free-energy principle for representation learning. In: International Conference on Machine Learning. PMLR, pp. 3367–3376 (2020)
Google Scholar
Vu, T.-H., Jain, H., Bucher, M., Cord, M., Pérez, P.: Advent: adversarial entropy minimization for domain adaptation in semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2517–2526 (2019)
Google Scholar
Deng, J., et al.:ImageNet: a large-scale hierarchical image database. In: CVPR, pp. 248–255 (2009)
Google Scholar
Wang, F., Ding, Y., Liang, H., Wen, J.: Discriminative and selective pseudo-labeling for domain adaptation. In: International Conference on Multimedia Modeling (2021)
Google Scholar

Download references

Acknowledgement

This work was supported by the Natural Science Foundation of China under Grant 41906177 and 41927805, the Hainan Provincial Joint Project of Sanya Yazhou Bay Science and Technology City under Grants 2021JJLH0061, The National Key Research and Development Program of China under Grants 2018AAA0100605.

Author information

Authors and Affiliations

Ocean University of China, Qingdao, 266100, China
Jiajun Ouyang, Qingxuan Lv, Shu Zhang & Junyu Dong

Authors

Jiajun Ouyang
View author publications
You can also search for this author in PubMed Google Scholar
Qingxuan Lv
View author publications
You can also search for this author in PubMed Google Scholar
Shu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Junyu Dong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shu Zhang .

Editor information

Editors and Affiliations

University of Bergen, Bergen, Norway
Duc-Tien Dang-Nguyen
Dublin City University, Dublin, Ireland
Cathal Gurrin
Radboud University Nijmegen, Nijmegen, The Netherlands
Martha Larson
Dublin City University, Dublin, Ireland
Alan F. Smeaton
University of Amsterdam, Amsterdam, The Netherlands
Stevan Rudinac
National Institute of Information and Communications Technology, Tokyo, Japan
Minh-Son Dao
Department of Information Science and Media Studies, University of Bergen, Bergen, Norway
Christoph Trattner
La Trobe University, Melbourne, VIC, Australia
Phoebe Chen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ouyang, J., Lv, Q., Zhang, S., Dong, J. (2023). Energy Transfer Contrast Network for Unsupervised Domain Adaption. In: Dang-Nguyen, DT., et al. MultiMedia Modeling. MMM 2023. Lecture Notes in Computer Science, vol 13834. Springer, Cham. https://doi.org/10.1007/978-3-031-27818-1_10

Download citation

DOI: https://doi.org/10.1007/978-3-031-27818-1_10
Published: 31 March 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-27817-4
Online ISBN: 978-3-031-27818-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Energy Transfer Contrast Network for Unsupervised Domain Adaption

Abstract

Similar content being viewed by others

NaCL: noise-robust cross-domain contrastive learning for unsupervised domain adaptation

Contrastive Vicinal Space for Unsupervised Domain Adaptation

Subdomain adaptation via correlation alignment with entropy minimization for unsupervised domain adaptation

1 Introduction