编程 Python

Pytorch中Softmax和LogSoftmax的使用详解

Posted in Python onJune 05, 2021

一、函数解释

1.Softmax函数常用的用法是指定参数dim就可以：

（1）dim=0：对每一列的所有元素进行softmax运算，并使得每一列所有元素和为1。

（2）dim=1：对每一行的所有元素进行softmax运算，并使得每一行所有元素和为1。

class Softmax(Module):
    r"""Applies the Softmax function to an n-dimensional input Tensor
    rescaling them so that the elements of the n-dimensional output Tensor
    lie in the range [0,1] and sum to 1.
    Softmax is defined as:
    .. math::
        \text{Softmax}(x_{i}) = \frac{\exp(x_i)}{\sum_j \exp(x_j)}
    Shape:
        - Input: :math:`(*)` where `*` means, any number of additional
          dimensions
        - Output: :math:`(*)`, same shape as the input
    Returns:
        a Tensor of the same dimension and shape as the input with
        values in the range [0, 1]
    Arguments:
        dim (int): A dimension along which Softmax will be computed (so every slice
            along dim will sum to 1).
    .. note::
        This module doesn't work directly with NLLLoss,
        which expects the Log to be computed between the Softmax and itself.
        Use `LogSoftmax` instead (it's faster and has better numerical properties).
    Examples::
        >>> m = nn.Softmax(dim=1)
        >>> input = torch.randn(2, 3)
        >>> output = m(input)
    """
    __constants__ = ['dim']
 
    def __init__(self, dim=None):
        super(Softmax, self).__init__()
        self.dim = dim
 
    def __setstate__(self, state):
        self.__dict__.update(state)
        if not hasattr(self, 'dim'):
            self.dim = None
 
    def forward(self, input):
        return F.softmax(input, self.dim, _stacklevel=5)
 
    def extra_repr(self):
        return 'dim={dim}'.format(dim=self.dim)

2.LogSoftmax其实就是对softmax的结果进行log，即Log(Softmax(x))

class LogSoftmax(Module):
    r"""Applies the :math:`\log(\text{Softmax}(x))` function to an n-dimensional
    input Tensor. The LogSoftmax formulation can be simplified as:
    .. math::
        \text{LogSoftmax}(x_{i}) = \log\left(\frac{\exp(x_i) }{ \sum_j \exp(x_j)} \right)
    Shape:
        - Input: :math:`(*)` where `*` means, any number of additional
          dimensions
        - Output: :math:`(*)`, same shape as the input
    Arguments:
        dim (int): A dimension along which LogSoftmax will be computed.
    Returns:
        a Tensor of the same dimension and shape as the input with
        values in the range [-inf, 0)
    Examples::
        >>> m = nn.LogSoftmax()
        >>> input = torch.randn(2, 3)
        >>> output = m(input)
    """
    __constants__ = ['dim']
 
    def __init__(self, dim=None):
        super(LogSoftmax, self).__init__()
        self.dim = dim
 
    def __setstate__(self, state):
        self.__dict__.update(state)
        if not hasattr(self, 'dim'):
            self.dim = None
 
    def forward(self, input):
        return F.log_softmax(input, self.dim, _stacklevel=5)

二、代码示例

输入代码

import torch
import torch.nn as nn
import numpy as np
 
batch_size = 4
class_num = 6
inputs = torch.randn(batch_size, class_num)
for i in range(batch_size):
    for j in range(class_num):
        inputs[i][j] = (i + 1) * (j + 1)
 
print("inputs:", inputs)

得到大小batch_size为4，类别数为6的向量（可以理解为经过最后一层得到）

tensor([[ 1., 2., 3., 4., 5., 6.],
[ 2., 4., 6., 8., 10., 12.],
[ 3., 6., 9., 12., 15., 18.],
[ 4., 8., 12., 16., 20., 24.]])

接着我们对该向量每一行进行Softmax

Softmax = nn.Softmax(dim=1)
probs = Softmax(inputs)
print("probs:\n", probs)

得到

tensor([[4.2698e-03, 1.1606e-02, 3.1550e-02, 8.5761e-02, 2.3312e-01, 6.3369e-01],
[3.9256e-05, 2.9006e-04, 2.1433e-03, 1.5837e-02, 1.1702e-01, 8.6467e-01],
[2.9067e-07, 5.8383e-06, 1.1727e-04, 2.3553e-03, 4.7308e-02, 9.5021e-01],
[2.0234e-09, 1.1047e-07, 6.0317e-06, 3.2932e-04, 1.7980e-02, 9.8168e-01]])

此外，我们对该向量每一行进行LogSoftmax

LogSoftmax = nn.LogSoftmax(dim=1)
log_probs = LogSoftmax(inputs)
print("log_probs:\n", log_probs)

得到

tensor([[-5.4562e+00, -4.4562e+00, -3.4562e+00, -2.4562e+00, -1.4562e+00, -4.5619e-01],
[-1.0145e+01, -8.1454e+00, -6.1454e+00, -4.1454e+00, -2.1454e+00, -1.4541e-01],
[-1.5051e+01, -1.2051e+01, -9.0511e+00, -6.0511e+00, -3.0511e+00, -5.1069e-02],
[-2.0018e+01, -1.6018e+01, -1.2018e+01, -8.0185e+00, -4.0185e+00, -1.8485e-02]])

验证每一行元素和是否为1

# probs_sum in dim=1
probs_sum = [0 for i in range(batch_size)]
 
for i in range(batch_size):
    for j in range(class_num):
        probs_sum[i] += probs[i][j]
    print(i, "row probs sum:", probs_sum[i])

得到每一行的和，看到确实为1

0 row probs sum: tensor(1.)
1 row probs sum: tensor(1.0000)
2 row probs sum: tensor(1.)
3 row probs sum: tensor(1.)

验证LogSoftmax是对Softmax的结果进行Log

# to numpy
np_probs = probs.data.numpy()
print("numpy probs:\n", np_probs)
 
# np.log()
log_np_probs = np.log(np_probs)
print("log numpy probs:\n", log_np_probs)

得到

numpy probs:
[[4.26977826e-03 1.16064614e-02 3.15496325e-02 8.57607946e-02 2.33122006e-01 6.33691311e-01]
[3.92559559e-05 2.90064461e-04 2.14330270e-03 1.58369839e-02 1.17020354e-01 8.64669979e-01]
[2.90672347e-07 5.83831024e-06 1.17265590e-04 2.35534250e-03 4.73083146e-02 9.50212955e-01]
[2.02340233e-09 1.10474026e-07 6.03167746e-06 3.29318427e-04 1.79801770e-02 9.81684387e-01]]
log numpy probs:
[[-5.4561934e+00 -4.4561934e+00 -3.4561934e+00 -2.4561932e+00 -1.4561933e+00 -4.5619333e-01]
[-1.0145408e+01 -8.1454077e+00 -6.1454072e+00 -4.1454072e+00 -2.1454074e+00 -1.4540738e-01]
[-1.5051069e+01 -1.2051069e+01 -9.0510693e+00 -6.0510693e+00 -3.0510693e+00 -5.1069155e-02]
[-2.0018486e+01 -1.6018486e+01 -1.2018485e+01 -8.0184851e+00 -4.0184855e+00 -1.8485421e-02]]

验证完毕

三、整体代码

import torch
import torch.nn as nn
import numpy as np
 
batch_size = 4
class_num = 6
inputs = torch.randn(batch_size, class_num)
for i in range(batch_size):
    for j in range(class_num):
        inputs[i][j] = (i + 1) * (j + 1)
 
print("inputs:", inputs)
Softmax = nn.Softmax(dim=1)
probs = Softmax(inputs)
print("probs:\n", probs)
 
LogSoftmax = nn.LogSoftmax(dim=1)
log_probs = LogSoftmax(inputs)
print("log_probs:\n", log_probs)
 
# probs_sum in dim=1
probs_sum = [0 for i in range(batch_size)]
 
for i in range(batch_size):
    for j in range(class_num):
        probs_sum[i] += probs[i][j]
    print(i, "row probs sum:", probs_sum[i])
 
# to numpy
np_probs = probs.data.numpy()
print("numpy probs:\n", np_probs)
 
# np.log()
log_np_probs = np.log(np_probs)
print("log numpy probs:\n", log_np_probs)

基于pytorch softmax,logsoftmax 表达

import torch
import numpy as np
input = torch.autograd.Variable(torch.rand(1, 3))

print(input)
print('softmax={}'.format(torch.nn.functional.softmax(input, dim=1)))
print('logsoftmax={}'.format(np.log(torch.nn.functional.softmax(input, dim=1))))

以上为个人经验，希望能给大家一个参考，也希望大家多多支持三水点靠木。

Pytorch中Softmax和LogSoftmax的使用详解

- Author -

悲恋花丶无心之人

声明：登载此文出于传递更多信息之目的，并不意味着赞同其观点或证实其描述。

Python 相关文章推荐

使用beaker让Facebook的Bottle框架支持session功能

Apr 23 Python

使用Python来编写HTTP服务器的超级指南

Feb 18 Python

Python开发中爬虫使用代理proxy抓取网页的方法示例

Sep 26 Python

python使用pandas实现数据分割实例代码

Jan 25 Python

python opencv之SIFT算法示例

Feb 24 Python

使用Python爬了4400条淘宝商品数据,竟发现了这些“潜规则”

Mar 23 Python

Numpy数据类型转换astype,dtype的方法

Jun 09 Python

Python爬虫实现验证码登录代码实例

May 10 Python

python绘制地震散点图

Jun 18 Python

通过python实现随机交换礼物程序详解

Jul 10 Python

Python接口自动化判断元素原理解析

Feb 24 Python

python json.dumps中文乱码问题解决

Apr 01 Python

Pytorch中Softmax与LogSigmoid的对比分析

Jun 05 #Python

Pytorch反向传播中的细节-计算梯度时的默认累加操作

pytorch 梯度NAN异常值的解决方案

Jun 05 #Python

pytorch 权重weight 与梯度grad 可视化操作

PyTorch 如何检查模型梯度是否可导

python-opencv 中值滤波{cv2.medianBlur(src, ksize)}的用法

解决Pytorch修改预训练模型时遇到key不匹配的情况

Jun 05 #Python

You might like

php学习之变量的使用

2011/05/29 PHP

深入理解php的MySQL连接类

2013/06/07 PHP

php面向对象的用户登录身份验证

2017/06/08 PHP

php微信开发之音乐回复功能

2018/06/14 PHP

php的lavarel框架中join和orWhere的用法

2020/12/28 PHP

jQuery基于当前元素进行下一步的遍历

2014/05/20 Javascript

jQuery的text()方法用法分析

2014/12/20 Javascript

js全选实现和判断是否有复选框选中的方法

2015/02/17 Javascript

简述Jquery与DOM对象

2015/07/10 Javascript

Angular发布1.5正式版，专注于向Angular 2的过渡

2016/02/18 Javascript

JavaScript常用本地对象小结

2016/03/28 Javascript

js实现(全选)多选按钮的方法【附实例】

2016/03/30 Javascript

让你一句话理解闭包(简单易懂)

2016/06/03 Javascript

Javascript中document.referrer隐藏来源的方法

2017/01/16 Javascript

JS高仿抛物线加入购物车特效实现代码

2017/02/20 Javascript

简单实现js拖拽效果

2017/07/25 Javascript

浅谈Vuejs Prop基本用法

2017/08/17 Javascript

微信小程序实现星星评价效果

2018/11/02 Javascript

一步步教你利用Docker设置Node.js

2018/11/20 Javascript

javascript中undefined的本质解析

2019/07/31 Javascript

[09:34]2018DOTA2国际邀请赛寻真——永不放弃的iG

2018/08/14 DOTA

[33:33]完美世界DOTA2联赛PWL S2 FTD.C vs SZ 第二场 11.27

2020/11/30 DOTA

python使用htmllib分析网页内容的方法

2015/05/08 Python

python开发之thread实现布朗运动的方法

2015/11/11 Python

python中requests使用代理proxies方法介绍

2017/10/25 Python

Python：Scrapy框架中Item Pipeline组件使用详解

2017/12/27 Python

python中for用来遍历range函数的方法

2018/06/08 Python

Python文件循环写入行时防止覆盖的解决方法

2018/11/09 Python

python中property和setter装饰器用法

2019/12/19 Python

前端面试必备之html5的新特性

2017/09/05 HTML / CSS

德国高尔夫商店：Golfshop.de

2019/06/22 全球购物

部队学习十八大感言

2014/01/11 职场文书

2015年七夕情人节感言

2015/08/03 职场文书

2016年清明节寄语

2015/12/04 职场文书

创业计划书之寿司

2019/07/19 职场文书

Java实战之课程信息管理系统的实现

2022/04/01 Java/Android