max pooling in caffe

我们来看max pooling 在caffe 中怎么实现的吧

reshape

首先 reshap的时候：

  // If max pooling, we will initialize the vector index part.
  if (this->layer_param_.pooling_param().pool() ==
      PoolingParameter_PoolMethod_MAX && top.size() == 1) {
    max_idx_.Reshape(bottom[0]->num(), channels_, pooled_height_,
        pooled_width_);
  }

如是max pooling 则需要reshape max_idx 用来记录每次max pooling是提取哪个地方的位置。
大小为num×channel×pooled_height×pooled_width

forward

再看forward：

case PoolingParameter_PoolMethod_MAX:
    // Initialize 如果top有两个分支，就有top_mask 没研究这个。遇到再说，目前是进else分支
    if (use_top_mask) {
      top_mask = top[1]->mutable_cpu_data();
      caffe_set(top_count, Dtype(-1), top_mask);
    } else {
    //get 到 max_idx_的指针
      mask = max_idx_.mutable_cpu_data();
      caffe_set(top_count, -1, mask);
    }
    //top_data 全部变成大浮点数的相反数。方便后面的取max运算
    caffe_set(top_count, Dtype(-FLT_MAX), top_data);
    // The main loop 找最大值
    for (int n = 0; n < bottom[0]->num(); ++n) {
      for (int c = 0; c < channels_; ++c) {
        for (int ph = 0; ph < pooled_height_; ++ph) {
          for (int pw = 0; pw < pooled_width_; ++pw) {
            int hstart = ph * stride_h_ - pad_h_;
            int wstart = pw * stride_w_ - pad_w_;
            int hend = min(hstart + kernel_h_, height_);
            int wend = min(wstart + kernel_w_, width_);
            hstart = max(hstart, 0);
            wstart = max(wstart, 0);
            const int pool_index = ph * pooled_width_ + pw;
            for (int h = hstart; h < hend; ++h) {
              for (int w = wstart; w < wend; ++w) {
                const int index = h * width_ + w;
                if (bottom_data[index] > top_data[pool_index]) {
                  top_data[pool_index] = bottom_data[index];
                  if (use_top_mask) {
                    top_mask[pool_index] = static_cast<Dtype>(index);
                  } else {
                    mask[pool_index] = index;
                  }
                }
              }
            }
          }
        }
        // compute offset 移动指针位置
        bottom_data += bottom[0]->offset(0, 1);
        top_data += top[0]->offset(0, 1);
        if (use_top_mask) {
          top_mask += top[0]->offset(0, 1);
        } else {
          mask += top[0]->offset(0, 1);
        }
      }
    }
    break;

其中offset函数是这样定义的：

  inline int offset(const int n, const int c = 0, const int h = 0,
      const int w = 0) const {
    CHECK_GE(n, 0);
    CHECK_LE(n, num());
    CHECK_GE(channels(), 0);
    CHECK_LE(c, channels());
    CHECK_GE(height(), 0);
    CHECK_LE(h, height());
    CHECK_GE(width(), 0);
    CHECK_LE(w, width());
    return ((n * channels() + c) * height() + h) * width() + w;
  }

带入的都是0，1 也就是平移height timeswidth大小

backward

case PoolingParameter_PoolMethod_MAX:
    // The main loop
    if (use_top_mask) {
      top_mask = top[1]->cpu_data();
    } else {
      mask = max_idx_.cpu_data();
    }
    for (int n = 0; n < top[0]->num(); ++n) {
      for (int c = 0; c < channels_; ++c) {
        for (int ph = 0; ph < pooled_height_; ++ph) {
          for (int pw = 0; pw < pooled_width_; ++pw) {
            const int index = ph * pooled_width_ + pw;
            //找到对应位置 把上层的梯度加上去就好了
            const int bottom_index =
                use_top_mask ? top_mask[index] : mask[index];
            bottom_diff[bottom_index] += top_diff[index];
          }
        }
        bottom_diff += bottom[0]->offset(0, 1);
        top_diff += top[0]->offset(0, 1);
        if (use_top_mask) {
          top_mask += top[0]->offset(0, 1);
        } else {
          mask += top[0]->offset(0, 1);
        }
      }
    }
    break;

本文链接：https://blog.csdn.net/Love_wanling/article/details/78588898

智能推荐

矩阵max_pooling 二维矩阵滑动窗口

题目链接题目大意给定M×N矩阵，求经过给定size为A×B的最大池化处理后结果 M, N <= 2e3, A <= M, B <= N 直接上二维线段树可能被卡常了。这里用滑动窗口，先对每行使用滑动窗口，再对得到的数组的每列使用滑动窗口即可。...

笔试题——max pooling滑动窗口实现(python 代码)

题目输入：从控制台获取n,m,a,b;其中n*m为矩阵大小，a*b为滑动窗口大小矩阵中的值，通过(i*j)mod 10 得到，在滑动过程中，需要获得每次滑动窗口中的最大值，并存储下来输出：所有最大值的和要求及思路纯暴力求解法，时间复杂度过高，需要使用滑动窗口方法求解题目为2维矩阵，所以需要对行和列依次使用滑动窗口方法即可不了解滑动窗口的可以参考一维滑动窗口这篇文章源码 ...

tensorflow 池化操作实例 tf.nn.max_pooling

输出： The shape of x: (1, 4, 4, 1) [[ 4. 3. 1. 8.] [ 7. 2. 6. 3.] [ 2. 0. 1. 1.] [ 3. 4. 2. 5.]] The shape pf y:...

category-wise max-pooling 操作案例理解

案例：输出结果：...

Caffe源码精读 - 4 - Caffe Layers之pooling_layer(池化层)

Class_4 Caffe Layers之pooling_layer(池化层) 1. 概述池化是卷积神经网络中较为常用的一种操作，根本目的是实现降采样，简化计算。目前池化层从作用面区分，可分为全局池化和局部池化。全局池化是相当于在整张图上做池化，每一张特征图最终得到一个池化值，即H*W*C的特征层，经过全局池化以后得到的是1*1*C的池化输出。局部池化就是指定Feature ma...

代码先锋网代码片段及技术文章聚合

max pooling in caffe

reshape

forward

backward

智能推荐

矩阵max_pooling 二维矩阵滑动窗口

笔试题——max pooling滑动窗口实现(python 代码)

tensorflow 池化操作实例 tf.nn.max_pooling

category-wise max-pooling 操作案例理解

Caffe源码精读 - 4 - Caffe Layers之pooling_layer(池化层)

猜你喜欢

dynamic k-max pooling 动态k-max 池化

Install caffe in Ubuntu

lstm in caffe

Normalize Layer in Caffe

caffe in python ---Classification

相关文章

热门文章

推荐文章

相关标签

代码先锋网 代码片段及技术文章聚合

max pooling in caffe

reshape

forward

backward

智能推荐

猜你喜欢

相关文章

热门文章

推荐文章

相关标签

代码先锋网代码片段及技术文章聚合