七三笔记

pytorch shape解读维度操作矩阵乘法模型保存与加载 torch使用CPU torch使用GPU torch固定随机数参考

pytorch shape解读

 
import torch 
a=torch.linspace(start=1,end=6,steps=6).reshape(2,3)
a
tensor([[1., 2., 3.],
        [4., 5., 6.]])

a.shape
torch.Size([2, 3])，
在pytorch看来，a的shape是(2,3), 应该读作 3个 维度为2的向量

pytorch维度从0开始计数，2维数表在展示给人看的时候，第0维展示为列方向

pytorch中shape：(第0维,第1维,...,第n维)

维度操作

 
a = torch.randn(2, 3, 4)

torch.permute(a, (2, 1, 0))

a.permute(2, 1, 0)

交换两个维度，也只能交换两个维度

 
torch.transpose(a, 0, 1).shape

转置

 
import torch
a=torch.tensor([[1,2,3]])
a.shape
torch.Size([1, 3])


b=a.t()
b.shape
torch.Size([3, 1])

如果只有2维，可以从行与列的角度去看


shape为（2,3），读作3个维度为2的向量，但这也只是读作，
或者在pytorch看来如此，
或者说对于既定的业务含义有不同的读法，

就数字而言，它也是 2个 维度为3的向量,二者等价
当维度更多时，就是你已经无法一下就说清每个维度的含义，
或者是单纯地从 数学/数字/矩阵 角度去看，
从某个维度取数据时，只改变所取维度，不改变其他维度的大小

 
(2,3) 2行3列，取一行数据，其shape为(3,)
(n1,n2) n1行n2列，取第1维(索引为0)的某个数据，其shape为(n2,)
(n1,n2) n1行n2列，取第2维(索引为1)的某个数据，其shape为(n1,)

(n1,n2,...,nm-1,nm,nm+1,...,nk),
已经无法从行列的角度来说了，但后面的逻辑是一样的

取第nm维后的某个数据，其shape为(n1,n2,...,nm-1,nm+1,...,nk)

pytorch中的维度索引从0开始计数

取第2维（索引为1）的某个数

 
```python
import torch

torch.manual_seed(73)
a = torch.ones(1,3,2,2)
print(a)

a = a[:,1,:,:]
print(a.shape)  # torch.Size([1, 2, 2])  这里消失了１维

a = torch.unsqueeze(input=a,dim=1)  # 添加一个维度
print(a.shape) # torch.Size([1, 1, 2, 2])
print(a)

"""
tensor([[[[1., 1.],
          [1., 1.]],

          [[1., 1.],
           [1., 1.]],

          [[1., 1.],
           [1., 1.]]]])
torch.Size([1, 2, 2])
torch.Size([1, 1, 2, 2])
tensor([[[[1., 1.],
          [1., 1.]]]])
"""

```

取某个维度的一个切片

 
a[:,n:m,:]

按维取均值

 
```python 
import torch

torch.manual_seed(73)
a = torch.ones(1,3,2,2)

# 取均值默认会消失一个维度，比如一串数字的均值是一个数值
print(a.mean(dim=1).shape)  # torch.Size([1, 2, 2])
print(a.mean(dim=1,keepdim=True).shape)
print(a)

"""
torch.Size([1, 2, 2])
torch.Size([1, 1, 2, 2])
tensor([[[[1., 1.],
            [1., 1.]],

            [[1., 1.],
            [1., 1.]],

            [[1., 1.],
            [1., 1.]]]])

"""
```

按维取最大值

 
pytorch 取第0维的最大值
values,indices = a.max(dim=0)
values
tensor([4., 5., 6.])

indices
tensor([1, 1, 1])

取第1维的最大值
values,indices = a.max(dim=1)
values
tensor([3., 6.])

unsqueeze，在指定dim上增加一维

 
import torch
probs = torch.tensor([[0.9, 0.1], [0.1, 0.9], [0.8, 0.2]]).unsqueeze(0)
probs
tensor([[[0.9000, 0.1000],
            [0.1000, 0.9000],
            [0.8000, 0.2000]]])

x.view:可兼容onnx转换

 
import torch

x = torch.tensor([[1, 2, 3], [4, 5, 6]])
x_flatten = x.view(2,-1)  # -1 表示自动计算此维度的大小
print(x_flatten)

 
tensor([[1, 2, 3],
        [4, 5, 6]])

 
x = torch.randn(3,2,3)
x 
tensor([[[-1.8351, -0.6714, -1.3090],
         [ 1.0558,  1.2539, -0.6256]],

        [[-0.8478,  1.4329, -1.0031],
         [-1.2600,  0.6706,  1.2318]],

        [[ 0.9180,  0.0779, -1.1079],
         [-2.0011, -0.6397, -1.0539]]])

 
x.view(3,-1)
tensor([[-1.8351, -0.6714, -1.3090,  1.0558,  1.2539, -0.6256],
        [-0.8478,  1.4329, -1.0031, -1.2600,  0.6706,  1.2318],
        [ 0.9180,  0.0779, -1.1079, -2.0011, -0.6397, -1.0539]])

x.reshape:兼容onnx

 
x.reshape(3,-1)
tensor([[-1.8351, -0.6714, -1.3090,  1.0558,  1.2539, -0.6256],
        [-0.8478,  1.4329, -1.0031, -1.2600,  0.6706,  1.2318],
        [ 0.9180,  0.0779, -1.1079, -2.0011, -0.6397, -1.0539]])

nn.Flatten()：兼容onnx

 
# 展平层
self.flatten = nn.Flatten()

x = self.flatten(x)

pytorch矩阵乘法

matmul

 
import torch
a = torch.rand(3,3,7)
b = torch.rand(3,7,3)
torch.matmul(a,b)
tensor([[[1.4016, 1.6263, 1.0781],
         [1.0632, 1.0419, 0.7032],
         [0.9459, 1.0396, 0.6318]],

        [[1.0026, 1.4292, 0.8176],
         [1.2285, 1.4016, 1.1455],
         [0.8004, 1.0875, 0.7757]],

        [[0.7738, 1.4511, 1.5687],
         [1.5427, 2.4797, 2.1463],
         [1.3152, 1.6549, 2.1859]]])

 
a@b
tensor([[[1.4016, 1.6263, 1.0781],
         [1.0632, 1.0419, 0.7032],
         [0.9459, 1.0396, 0.6318]],

        [[1.0026, 1.4292, 0.8176],
         [1.2285, 1.4016, 1.1455],
         [0.8004, 1.0875, 0.7757]],

        [[0.7738, 1.4511, 1.5687],
         [1.5427, 2.4797, 2.1463],
         [1.3152, 1.6549, 2.1859]]])

torch.mm:仅限2维矩阵

 
import torch
torch.manual_seed(73) 
a = torch.rand(3,7)
b = torch.rand(7,3)
torch.mm(a,b)
tensor([[1.2050, 1.7727, 1.8881],
        [2.2234, 2.1534, 2.0374],
        [1.4380, 1.7234, 1.8297]])

a = a.unsqueeze(dim=0)
b = b.unsqueeze(dim=0)
torch.matmul(a,b)
tensor([[[1.2050, 1.7727, 1.8881],
         [2.2234, 2.1534, 2.0374],
         [1.4380, 1.7234, 1.8297]]])

torch.mm(a,b)
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
Cell In[3], line 1
----> 1 torch.mm(a,b)

RuntimeError: self must be a matrix

torch.bmm

 
import torch
torch.manual_seed(73) 
a = torch.rand(3,3,7)
b = torch.rand(3,7,3)
torch.matmul(a,b)
tensor([[[0.7847, 1.2145, 1.3052],
         [0.8276, 2.3949, 1.8271],
         [0.8289, 1.2427, 1.0018]],

        [[2.5169, 1.9156, 2.4408],
         [2.0237, 1.2965, 1.1980],
         [2.3762, 1.9028, 2.2781]],

        [[0.9137, 1.8393, 1.1406],
         [0.8456, 1.5484, 1.5531],
         [1.2973, 2.3349, 2.3504]]])

torch.bmm(a,b)
tensor([[[0.7847, 1.2145, 1.3052],
         [0.8276, 2.3949, 1.8271],
         [0.8289, 1.2427, 1.0018]],

        [[2.5169, 1.9156, 2.4408],
         [2.0237, 1.2965, 1.1980],
         [2.3762, 1.9028, 2.2781]],

        [[0.9137, 1.8393, 1.1406],
         [0.8456, 1.5484, 1.5531],
         [1.2973, 2.3349, 2.3504]]])

mm仅限2维矩阵
bmm不限维数，等价于matmul,@

pytorch模型保存与加载

保存模型字典

 
torch.save(model.state_dict(), model_param_path)

加载模型字典

 
model = MyModel()
model.load_state_dict(torch.load(PATH))

再次训练前执行
model.train()

预测前执行
model.eval()

torch使用CPU

控制线程数

使用 torch.set_num_threads

 
import torch

# 设置全局线程数
torch.set_num_threads(4)

# 现在导入你的模型和其他 PyTorch 代码...

 
注意，torch.set_num_threads 设置的是全局线程数，
它将影响所有的 PyTorch 操作，包括数据加载、模型前向传播等。

设置环境变量：

 
import os

# 设置环境变量以限制线程数
os.environ['OMP_NUM_THREADS'] = '4'
os.environ['MKL_NUM_THREADS'] = '4'

# 现在导入 PyTorch 和你的模型
import torch
import torch.nn as nn

# 你的模型定义和加载代码...

在数据加载时控制线程数：

 
如果你的瓶颈在于数据加载而不是模型推理，
你可以通过调整 DataLoader 的 num_workers 参数来控制用于数据加载的线程数。

 
from torch.utils.data import DataLoader, Dataset

# 假设你有一个自定义的 Dataset 类
dataset = MyDataset()

# 创建 DataLoader 并设置 num_workers
dataloader = DataLoader(dataset, batch_size=32, num_workers=4)

torch使用GPU

torch 调用GPU


device = "cuda:0" if torch.cuda.is_available() else "cpu"
X = X.to(device=device)
y = y.to(device=device)


USE_CUDA = torch.cuda.is_available()
device = torch.device("cuda" if USE_CUDA else "cpu")

 
import torch
use_gpu = torch.cuda.is_available()
if use_gpu:
    model.cuda()

 
import torch
x = torch.randn(2, 3)
x
tensor([[ 1.3599,  1.0726, -1.0500],
        [-0.1350, -1.6993, -1.1693]])

 
x.device
device(type='cpu')

 
这是因为电脑上没有GPU，则默认为CPU

从CPU加载到GPU

 
# Returns a copy of this object in CUDA memory.
x.cuda()

device = "cuda:0" if torch.cuda.is_available() else "cpu"
X = X.to(device=device)

torch固定随机数种子

CPU

 
torch.manual_seed(seed) # 为CPU设置随机种子

np.random.seed(seed)  # Numpy module.
random.seed(seed)  # Python random module.

GPU

 
torch.cuda.manual_seed(seed) # 为当前GPU设置随机种子
torch.cuda.manual_seed_all(seed)  # if you are using multi-GPU，为所有GPU设置随机种子
torch.backends.cudnn.benchmark = False
torch.backends.cudnn.deterministic = True

参考

    PyTorch固定随机数种子

七三笔记路线：学习，记录，分享