编程 Python

对tensorflow中tf.nn.conv1d和layers.conv1d的区别详解

Posted in Python onFebruary 11, 2020

在用tensorflow做一维的卷积神经网络的时候会遇到tf.nn.conv1d和layers.conv1d这两个函数，但是这两个函数有什么区别呢，通过计算得到一些规律。

1.关于tf.nn.conv1d的解释，以下是Tensor Flow中关于tf.nn.conv1d的API注解：

Computes a 1-D convolution given 3-D input and filter tensors.

Given an input tensor of shape [batch, in_width, in_channels] if data_format is "NHWC", or [batch, in_channels, in_width] if data_format is "NCHW", and a filter / kernel tensor of shape [filter_width, in_channels, out_channels], this op reshapes the arguments to pass them to conv2d to perform the equivalent convolution operation.

Internally, this op reshapes the input tensors and invokes `tf.nn.conv2d`. For example, if `data_format` does not start with "NC", a tensor of shape [batch, in_width, in_channels] is reshaped to [batch, 1, in_width, in_channels], and the filter is reshaped to [1, filter_width, in_channels, out_channels]. The result is then reshaped back to [batch, out_width, out_channels] whereoutwidthisafunctionofthestrideandpaddingasinconv2dwhereoutwidthisafunctionofthestrideandpaddingasinconv2d and returned to the caller.

Args: value: A 3D `Tensor`. Must be of type `float32` or `float64`. filters: A 3D `Tensor`. Must have the same type as `input`. stride: An `integer`. The number of entries by which the filter is moved right at each step. padding: 'SAME' or 'VALID' use_cudnn_on_gpu: An optional `bool`. Defaults to `True`. data_format: An optional `string` from `"NHWC", "NCHW"`. Defaults to `"NHWC"`, the data is stored in the order of [batch, in_width, in_channels]. The `"NCHW"` format stores data as [batch, in_channels, in_width]. name: A name for the operation (optional).

Returns:

A `Tensor`. Has the same type as input.

Raises:

ValueError: if `data_format` is invalid.

什么意思呢？就是说conv1d的参数含义：(以NHWC格式为例，即，通道维在最后)

1、value：在注释中，value的格式为：[batch, in_width, in_channels]，batch为样本维，表示多少个样本，in_width为宽度维，表示样本的宽度，in_channels维通道维，表示样本有多少个通道。事实上，也可以把格式看作如下:[batch, 行数, 列数]，把每一个样本看作一个平铺开的二维数组。这样的话可以方便理解。

2、filters：在注释中，filters的格式为：[filter_width, in_channels, out_channels]。按照value的第二种看法，filter_width可以看作每次与value进行卷积的行数，in_channels表示value一共有多少列（与value中的in_channels相对应）。out_channels表示输出通道，可以理解为一共有多少个卷积核，即卷积核的数目。

3、stride：一个整数，表示步长，每次（向下）移动的距离（TensorFlow中解释是向右移动的距离，这里可以看作向下移动的距离）。

4、padding：同conv2d，value是否需要在下方填补0。

5、name：名称。可省略。

首先从参数列表可以看出value指的输入的数据，stride就是卷积的步长，这里我们最有疑问的就是filters这个参数，那么我们对filter进行简单的说明。从上面可以看到filters的格式为：[filter_width, in_channels, out_channels],这是一个数组的维度，对应的是卷积核的大小，输入的channel的格式，和卷积核的个数，下面我们用例子说明问题：

import tensorflow as tf
import numpy as np
 
 
if __name__ == '__main__':
  inputs = tf.constant(np.arange(1, 6, dtype=np.float32), shape=[1, 5, 1])
  w = np.array([1, 2], dtype=np.float32).reshape([2, 1, 1])
  # filter width, filter channels and out channels(number of kernels)
  cov1 = tf.nn.conv1d(inputs, w, stride=1, padding='VALID')
  with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    out = sess.run(cov1)
    print(out)

其输出为：

[[[ 5.],
    [ 8.],
    [11.],
    [14.]]]

我们分析一下，输入的数据为[[[1],[2],[3],[4],[5]]],有5个特征，分别对应的数值为1，2，3，4，5，那么经过卷积的结果为5，8，11，14，那么这个结果是怎么来的呢，我们根据卷积的计算，可以得到5 = 1*1 + 2*2， 8=2*1+ 3*2， 11 = 3*1+4*2， 14=4*1+5*2，也就是W1=1， W2=2，正好和我们先面filters设置的数值相等，

w = np.array([1, 2], dtype=np.float32).reshape([2, 1, 1])

所以可以看到这个filtes设置的是是卷积核矩阵的，换句话说，卷积核矩阵我们是可以设置的。

2. 1.关于tf.layers.conv1d,函数的定义如下

tf.layers.conv1d(
 
inputs,
 
filters,
 
kernel_size,
 
strides=1,
 
padding='valid',
 
data_format='channels_last',
 
dilation_rate=1,
 
activation=None,
 
use_bias=True,
 
kernel_initializer=None,
 
bias_initializer=tf.zeros_initializer(),
 
kernel_regularizer=None,
 
bias_regularizer=None,
 
activity_regularizer=None,
 
kernel_constraint=None,
 
bias_constraint=None,
 
trainable=True,
 
name=None,
 
reuse=None
 
)

比较重要的几个参数是inputs, filters, kernel_size，下面分别说明

inputs : 输入tensor，维度(None, a, b) 是一个三维的tensor

None ：一般是填充样本的个数，batch_size

a ：句子中的词数或者字数

b : 字或者词的向量维度

filters : 过滤器的个数

kernel_size : 卷积核的大小，卷积核其实应该是一个二维的，这里只需要指定一维，是因为卷积核的第二维与输入的词向量维度是一致的，因为对于句子而言，卷积的移动方向只能是沿着词的方向，即只能在列维度移动。一个例子：

import tensorflow as tf
import numpy as np
 
 
if __name__ == '__main__':
  inputs = tf.constant(np.arange(1, 6, dtype=np.float32), shape=[1, 5, 1])
  cov2 = tf.layers.conv1d(inputs, filters=1, kernel_size=2, strides=1, padding='VALID')
  with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    out = sess.run(cov2)
    print(out)

输出结果：

[[[-1.9953331]
 [-3.5520997]
 [-5.108866 ]
 [-6.6656327]]]

也许你得到的结果和我得到的结果不同，因为在这个函数里面只是设置了卷积核的尺寸和步长，没有设置具体的卷积核矩阵，所以这个卷积核矩阵是随机生成的，就会出现可能运行上面的程序出现不同结果的情况。

以上这篇对tensorflow中tf.nn.conv1d和layers.conv1d的区别详解就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持三水点靠木。

对tensorflow中tf.nn.conv1d和layers.conv1d的区别详解

- Author -

孤独的猿行客

声明：登载此文出于传递更多信息之目的，并不意味着赞同其观点或证实其描述。

Python 相关文章推荐

Django中实现一个高性能计数器(Counter)实例

Jul 09 Python

python中pygame模块用法实例

Oct 09 Python

python安装教程 Pycharm安装详细教程

May 02 Python

Python之str操作方法(详解)

Jun 19 Python

机器学习python实战之决策树

Nov 01 Python

Python Requests模拟登录实现图书馆座位自动预约

Apr 27 Python

python3获取当前目录的实现方法

Jul 29 Python

python实现翻转棋游戏（othello）

Jul 29 Python

tensorflow通过模型文件,使用tensorboard查看其模型图Graph方式

Jan 23 Python

jupyter 实现notebook中显示完整的行和列

Apr 09 Python

Python爬虫基础之简单说一下scrapy的框架结构

Jun 26 Python

用Python实现屏幕截图详解

Jan 22 Python

python中文分词库jieba使用方法详解

Feb 11 #Python

Transpose 数组行列转置的限制方式

Feb 11 #Python

Tensorflow:转置函数 transpose的使用详解

Feb 11 #Python

tensorflow多维张量计算实例

Feb 11 #Python

python误差棒图errorbar()函数实例解析

Feb 11 #Python

解决Python3.8用pip安装turtle-0.0.2出现错误问题

Feb 11 #Python

python scatter函数用法实例详解

Feb 11 #Python

You might like

php仿ZOL分页类代码

2008/10/02 PHP

php中3种方法统计字符串中每种字符的个数并排序

2012/08/27 PHP

简单介绍PHP的责任链编程模式

2015/08/11 PHP

php将数组存储为文本文件方法汇总

2015/10/28 PHP

PHP实现小偷程序实例

2016/10/31 PHP

使用jQuery获得内容以及内容的属性

2015/02/26 Javascript

Node.js本地文件操作之文件拷贝与目录遍历的方法

2016/02/16 Javascript

js基本算法:冒泡排序,二分查找的简单实例

2016/10/08 Javascript

浅析javaScript中的浅拷贝和深拷贝

2017/02/15 Javascript

老生常谈Bootstrap媒体对象

2017/07/06 Javascript

React应用中使用Bootstrap的方法

2017/08/15 Javascript

Vuex modules模式下mapState/mapMutations的操作实例

2019/10/17 Javascript

Vue 修改网站图标的方法

2020/12/31 Vue.js

Python sklearn KFold 生成交叉验证数据集的方法

2018/12/11 Python

Python中三元表达式的几种写法介绍

2019/03/04 Python

Django框架视图介绍与使用详解

2019/07/18 Python

Python实现手机号自动判断男女性别(实例解析)

2019/12/22 Python

Python 读取xml数据,cv2裁剪图片实例

2020/03/10 Python

Django中Aggregation聚合的基本使用方法

2020/07/09 Python

咖啡为什么会有酸味？你喝到的咖啡為什麼是酸的？

2021/03/17 冲泡冲煮

澳大利亚天然护肤品、化妆品和健康产品一站式商店：Nourished Life

2018/12/02 全球购物

Myprotein亚太地区：欧洲第一在线运动营养品牌

2020/12/20 全球购物

介绍一下内联、左联、右联

2013/12/31 面试题

写好自荐信的几个要点

2013/12/26 职场文书

教学大赛获奖感言

2014/01/15 职场文书

导师评语大全

2014/04/26 职场文书

体育课外活动总结

2014/07/08 职场文书

2014年办公室个人工作总结

2014/11/12 职场文书

慈善献爱心倡议书

2015/04/27 职场文书

紫日观后感

2015/06/05 职场文书

遗愿清单观后感

2015/06/09 职场文书

同事欢送会致辞

2015/07/31 职场文书

你为什么是穷人？可能是这5个缺点造成

2019/07/11 职场文书

MySql 8.0及对应驱动包匹配的注意点说明

2021/06/23 MySQL

Python+Tkinter打造签名设计工具

2022/04/01 Python

Vue+Flask实现图片传输功能

2022/04/01 Vue.js