【DA】 Self-Loss methods in DA & DG

IAST: Instance Adaptive Self-Training for UDAS -ECCV20

Paper: Instance Adaptive Self-Training for Unsupervised Domain Adaptation
핵심 정리
- 최적의 방법으로 Threshold per class 를 만드는 방법을 제안한다.
- TopK는 Instance specific한 Threshold라면, IAST는 Momentum으로 Threshold를 update하여 Global한 Threshold를 만들고자 한다.
- Threshold per class 보다 높은 곳은 Confident region 아닌 곳은 Ignored region이다.
- Confident region은 Entropy maximization 같은 Smoothing Loss를 걸어준다.
- Ignored region은 Entropy minimization을 적용한 Sharpening Loss를 걸어준다.
초반 learning
- 처음에 Adversarial learning 으로 warn up한다
- 그 다음 Figure4의 Constant Threshold Self-loss로 다시 warn up

ISAT의 주요 Method 2개
1. (아래 그림 왼쪽) pseudo-label generation strategy with an instance adaptive selector
2. (아래 그림 오른쪽) the region-guided regularization
3. 학습을 반복하면서 M을 학습시키고, G(pseudo label generator)를 붙여넣음으로써 Pseudo Label 생성 모델을 Step-by-step으로 갱신했다.

Entropy Loss 문제점: 위 그래프와 같이 Easy sample에 대해서 훨씬 강한 gradient를 만든다. 이를 probability imbalance라고 한다. (the entropy minimization method will allow for adequate training of samples that are easy to transfer, which hinders the training process of sam- ples that are difficult to transfer)
Scaled H는 (논문 주황색 형광 필기) 추가 파라미터가 필요하다. 이것은 tricky to select 이다.
Maximum squas Loss: a more balanced gradient for different classes 를 만들어 낸다.
Image-wise Class-balanced Weighting Factor: 이미지 안에 Class 갯수를 사용해서 Loss를 Regularize한다.
Multi-level Self-produced Guidance: ResNet 중간에 ASPP를 하나 더 달고, Low-level output을 만들어 낸다. 그리고 Pseudo label은 high-level output + 자신의 output의 Ensemble 결과를 사용해 만들어 낸다. high-level 이 low-level 보다 더 정확할 것이라는 점을 이용해서, Low-level feature representation 능력을 향상시켜 high final performance를 유도한다.