!2677 metric_learn_loss

Merge pull request !2677 from gitsYu/master

!2677 metric_learn_loss
Merge pull request !2677 from gitsYu/master
5cd93e4d · i-robot · Gitee · 2300b506 · 7db5ff4c · 5cd93e4d
Unverified Commit 5cd93e4d authored 3 years ago by i-robot Committed by Gitee 3 years ago
--- a/research/cv/metric_learn/README_CN.md
+++ b/research/cv/metric_learn/README_CN.md
@@ -30,7 +30,7 @@
 如下为MindSpore使用Triplet loss和Quadruptlet loss在SOP数据集调优ResNet50的示例，Triplet loss可参考[论文1](https://arxiv.org/abs/1503.03832)，Quadruptlet loss是Triplet loss的一个变体，可参考[论文2](https://arxiv.org/abs/1704.01719)。
-为了训练度量学习模型，我们需要一个神经网络模型作为骨架模型（ResNet50）和度量学习代价函数来进行优化。残差神经网络（ResNet）由微软研究院何凯明等五位华人提出，效果非常显著。整个网络只需要学习输入和输出的差异部分，简化了学习目标和难度。ResNet的结构大幅提高了神经网络训练的速度，并且大大提高了模型的准确率。正因如此，ResNet十分受欢迎，经常被各个领域用作backbone网络，在这选择ResNet-50结构作为度量学习的主干网络。我们首先使用softmax来进行预训练，然后使用其它的代价函数来进行微调，例如：triplet，quadruplet。下面就是先在SOP数据集上预训练个pretrain模型，然后用triplet，quadruplet代价函数来微调从softmax得到的pretrain模型，使用8卡Ascend 910训练网络模型，仅需30个周期，就可以在SOP数据集的5184种类别上，TOP1准确率达到了73.9%和74.3%。
+为了训练度量学习模型，我们需要一个神经网络模型作为骨架模型（ResNet50）和度量学习代价函数来进行优化。残差神经网络（ResNet）由微软研究院何凯明等五位华人提出，效果非常显著。整个网络只需要学习输入和输出的差异部分，简化了学习目标和难度。ResNet的结构大幅提高了神经网络训练的速度，并且大大提高了模型的准确率。正因如此，ResNet十分受欢迎，经常被各个领域用作backbone网络，在这选择ResNet-50结构作为度量学习的主干网络。我们首先加载ResNet-50-ImageNet[模型权重](https://www.mindspore.cn/resources/hub/details/en?MindSpore/ascend/1.3/resnet50_v1.3_imagenet2012)作为预训练模型，然后修改分类层使用softmax函数在SOP数据集上对模型进行微调，最后利用度量学习损失（例如：triplet，quadruplet）进一步finetune模型。下面就是在SOP数据集上分别使用softmax、triplet和quadruplet代价函数微调的结果，使用8卡Ascend 910训练网络模型，仅需30个周期，就可以在SOP数据集的5184种类别上，TOP1准确率达到了73.9%和74.3%。
 ## 论文

--- a/research/cv/metric_learn/src/loss.py
+++ b/research/cv/metric_learn/src/loss.py
@@ -18,10 +18,11 @@ import mindspore
 import mindspore.nn as nn
 from mindspore import Tensor
 from mindspore.common import dtype as mstype
+from mindspore.nn.loss.loss import LossBase
 from mindspore.ops import operations as P
 from mindspore.ops import functional as F
-class Softmaxloss():
+class Softmaxloss(LossBase):
    """Softmaxloss"""
    def __init__(self, sparse=True, smooth_factor=0.1, num_classes=5184):
        super(Softmaxloss, self).__init__()
@@ -37,7 +38,7 @@ class Softmaxloss():
        loss = self.ce(logit, label)
        return loss
-class Tripletloss():
+class Tripletloss(LossBase):
    """Tripletloss"""
    def __init__(self, margin=0.1):
        super(Tripletloss, self).__init__()
@@ -92,7 +93,7 @@ def generate_index(batch_size, samples_each_class):
    res = np.array(res).astype(np.int32)
    return res
-class Quadrupletloss():
+class Quadrupletloss(LossBase):
    """Quadrupletloss"""
    def __init__(self, train_batch_size=30, samples_each_class=2, margin=0.1):
        super(Quadrupletloss, self).__init__()