SRMD Pytorch

(English version is down below)

Python库依赖(建议在Anaconda环境下操作)

本仓库改编于原作者项目cszn/KAIR

相关论文：CVPR 2018 Learning a Single Convolutional Super-Resolution Network for Multiple Degradations

模型结构：

SRMD已经训练的模型保存在model_zoo中。(模型来源于原作者 https://drive.google.com/drive/folders/13kfr3qny7S2xwG9h7v95F5mkWs0OmU0D)

SRMD模型：

srmd_x2.pth
srmd_x3.pth
srmd_x4.pth

输入：19维的数据，其中15维为经过PCA降维后进行维度拉伸的模糊核，还有1个维度为图片噪声维度，另外3个维度分别为图片的RGB通道，即模型输入为：(19,图片宽,图片高)

图片经过GAN网络的处理，最后进行PixelShuffle，放大指定的倍数。

输出：放大了的图片的RGB通道，即模型输出为：(3,图片宽,图片高)

SRMD Noise Free模型：

srmdnf_x2.pth
srmdnf_x3.pth
srmdnf_x4.pth

输入：18维的数据，其中15维为经过PCA降维后进行维度拉伸的模糊核，另外3个维度分别为图片的RGB通道，即模型输入为：(18,图片宽,图片高)

图片经过GAN网络的处理，最后进行PixelShuffle，放大指定的倍数。

输出：图片的RGB通道，即模型输出为：(3,图片宽,图片高)

使用模型

本脚本使用原作者训练的模型进行预测，如有需要请自行训练模型，自行定义模糊核和PCA降维数据。

使用双三次插值方法实现Alpha（透明）通道的放大，弥补了源模型不支持透明通道的缺憾。

使用方法：

main_srmd.py -i 输入路径 -o 输出路径 [选项]...

  -h help              显示这个帮助
  -i input-path        输入图片路径 (jpg/png/...) 或文件夹 (默认=运行文件夹)
  -o output-path       输出图片路径 (jpg/png/...) 或文件夹 (默认=运行文件夹)
  -n noise-level       降噪等级 (-1/0/1/2/3/4/5/6/7/8/9/10, 默认=3)
  -s scale             放大比例 (2/3/4, 默认=2)
  -m model-path        srmd 模型路径 (默认='model_zoo')
  -p srmd-pca-path     srmd 模糊核 pca 数据路径 (默认='kernels/srmd_pca_matlab.mat')
  -x tta               开启 x8 性能模式 (默认关闭)
  -c cuda              开启 CUDA GPU 计算 (默认关闭)
  -f format            输出图片格式 (jpg/png/..., 默认=png)

支持的图片格式 'jpg', 'JPG', 'jpeg', 'JPEG', 'png', 'PNG', 'ppm', 'PPM', 'bmp', 'BMP', 'tif'

注：X8性能模式（又称TTA模式）为讲图片另外进行7种不同的旋转等数据增强操作，然后取放大后各个像素点平均值，从而使得图片质量更佳，但是会比原来慢8倍

input-path和output-path接受文件路径或目录路径
noise-level=降噪等级，值越大表示去噪效果越强，-1=不降噪
scale=放大比例，2=放大2x，3=放大3x，4=放大4x

直接运行main_srmd.py即可进行模型的使用来对图片进行放大。

运行过程中出现 RuntimeError: CUDA error: unspecified launch failure

此问题在开启-c参数 CUDA GPU 计算时在处理分辨率较大的图片会出现此问题，一般为显存溢出导致，建议通过运行时不加-c关闭 CUDA GPU 计算。

训练模型

options/train_srmd.json中定义了模型的参数，可以进行修改调参。同时其中也给定了训练集和测试集路径的定义等。

按照默认配置，请向trainsets/trainH中添加高质量图片来进行模型的训练。

testsets/set5为测试集。

直接运行main_train_srmd.py即可进行训练。

训练日志保存在superresolution/srmd/train.log，每次训练开始时的配置保存在superresolution/srmd/options文件夹中。

options/train_srmd.json中checkpoint_test配置每进行5个epoch训练后进行测试，并将测试生成的图片保存在superresolution/srmd/images中;checkpoint_save配置每进行5个epoch训练后保存当前模型。

训练产生的模型保存在superresolution/srmd/models文件夹中。

温馨提示：如果设置了batch_size>1，请保证训练集中图片分辨率相同，且个数超过一个batch_size大小，否则建议batch_size值设为1。

测试模型

直接运行main_test_srmd.py即可对指定模型进行测试。

按照默认配置，默认testsets/set5为测试集。

请确保PCA降维数据文件kernels/srmd_pca_matlab.mat存在（使用该文件预定义的参数对模糊核进行PCA降维，准备处理输入数据）。

测试结果和记录存放在results中。

查看模型描述

直接运行describe_model.py即可查看每个SRMD模型(6个)的网络结构和输出。

转换为ONNX/NCNN模型

确保SRMD已经训练的模型保存在model_zoo中，直接运行pytorch2onnx.py即可将pytorch模型转为onnx模型。

转换后的模型存放在onnx_models中。

随后步骤参考：https://github.com/Tencent/ncnn/blob/master/docs/how-to-use-and-FAQ/use-ncnn-with-pytorch-or-onnx.md

使用onnx-simplifier简化生成后的ONNX模型：

cd onnx_models
pip install onnx-simplifier
python -m onnxsim srmd_x2.onnx srmd_x2-sim.onnx
python -m onnxsim srmd_x3.onnx srmd_x3-sim.onnx
python -m onnxsim srmd_x4.onnx srmd_x4-sim.onnx
python -m onnxsim srmdnf_x2.onnx srmdnf_x2-sim.onnx
python -m onnxsim srmdnf_x3.onnx srmdnf_x3-sim.onnx
python -m onnxsim srmdnf_x4.onnx srmdnf_x4-sim.onnx

然后就可以使用编译好的NCNN工具将ONNX模型转化成NCNN模型。

这里我在onnx2ncnn文件夹下准备了在Win64环境下通过使用windows-vs2019-avx2 CI MSVC 19.27.29112.0环境编译NCNN #36a591d仓库得到的NCNN工具可执行文件onnx2ncnn.exe。如果你的系统为Win64，在确保上述步骤已经完成的情况下，可以直接双击执行脚本convert.cmd，会直接在onnx2ncnn/srmd_ncnn_models文件夹下自动生成SRMD NCNN模型文件。

否则请自行编译NCNN，得到onnx2ncnn工具，然后手动执行命令:

onnx2ncnn srmd_x2-sim.onnx srmd_x2.param srmd_x2.bin
onnx2ncnn srmd_x3-sim.onnx srmd_x3.param srmd_x3.bin
onnx2ncnn srmd_x4-sim.onnx srmd_x4.param srmd_x4.bin
onnx2ncnn srmdnf_x2-sim.onnx srmdnf_x2.param srmdnf_x2.bin
onnx2ncnn srmdnf_x3-sim.onnx srmdnf_x3.param srmdnf_x3.bin
onnx2ncnn srmdnf_x4-sim.onnx srmdnf_x4.param srmdnf_x4.bin

可选

你还可以使用ncnnoptimize进行模型优化：(参考 https://github.com/Tencent/ncnn/wiki/use-ncnnoptimize-to-optimize-model)

当然，同onnx2ncnn，我也提供了Win64版本的ncnnoptimize.exe。如果你的系统为Win64，在确保上述步骤已经完成的情况下，可以直接双击执行脚本optimize.cmd，会直接在onnx2ncnn/srmd_ncnn_models文件夹下自动生成优化后的SRMD NCNN模型文件并覆盖。

否则请自行编译NCNN，得到ncnnoptimize工具。

下面命令将会将优化后得到的模型覆盖原模型:
ncnnoptimize srmd_x2.param srmd_x2.bin srmd_x2.param srmd_x2.bin 65536
ncnnoptimize srmd_x3.param srmd_x3.bin srmd_x3.param srmd_x3.bin 65536
ncnnoptimize srmd_x4.param srmd_x4.bin srmd_x4.param srmd_x4.bin 65536
ncnnoptimize srmdnf_x2.param srmdnf_x2.bin srmdnf_x2.param srmdnf_x2.bin 65536
ncnnoptimize srmdnf_x3.param srmdnf_x3.bin srmdnf_x3.param srmdnf_x3.bin 65536
ncnnoptimize srmdnf_x4.param srmdnf_x4.bin srmdnf_x4.param srmdnf_x4.bin 65536

最后，由于某些未知原因，你需要修改*.param中的参数，使得模型能够被srmd-ncnn-vulkan使用。

例如:

经过模型优化的srmd_x2.param修改前：

7767517
14 14
Input                    input.1                  0 1 input.1
Convolution              Conv_0                   1 1 input.1 26 0=128 1=3 4=1 5=1 6=21888 9=1
Convolution              Conv_2                   1 1 26 28 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_4                   1 1 28 30 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_6                   1 1 30 32 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_8                   1 1 32 34 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_10                  1 1 34 36 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_12                  1 1 36 38 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_14                  1 1 38 40 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_16                  1 1 40 42 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_18                  1 1 42 44 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_20                  1 1 44 46 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_22                  1 1 46 47 0=12 1=3 4=1 5=1 6=13824
PixelShuffle             Reshape_27               1 1 47 52 0=2

修改后：

7767517
14 14
Input                    input                    0 1 input
Convolution              Conv_0                   1 1 input 26 0=128 1=3 4=1 5=1 6=21888 9=1
Convolution              Conv_2                   1 1 26 28 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_4                   1 1 28 30 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_6                   1 1 30 32 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_8                   1 1 32 34 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_10                  1 1 34 36 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_12                  1 1 36 38 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_14                  1 1 38 40 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_16                  1 1 40 42 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_18                  1 1 42 44 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_20                  1 1 44 46 0=128 1=3 4=1 5=1 6=147456 9=1
Convolution              Conv_22                  1 1 46 47 0=12 1=3 4=1 5=1 6=13824
PixelShuffle             output                   1 1 47 output 0=2

你需要将最后一行的Reshape_27与52替换为output来指定输出，并且把第三行和第四行input.1替换为input.

你可以直接使用Python运行fix_input_output.py来进行修改。

SRMD Pytorch

Python Dependencies (Recommend running under Anaconda)

This repository is derived from the original author project cszn/KAIR.

Related papers: CVPR 2018 Learning a Single Convolutional Super-Resolution Network for Multiple Degradations

Model structure:

SRMD trained models are saved in model_zoo. (The model comes from the original author https://drive.google.com/drive/folders/13kfr3qny7S2xwG9h7v95F5mkWs0OmU0D )

SRMD model：

srmd_x2.pth
srmd_x3.pth
srmd_x4.pth

Input: 19 dimensional data, of which 15 dimensions are blur kernels after dimension reduction by PCA, 1 dimension is image noise dimension, and the other 3 dimensions are RGB channels of images, so the model input is: (19, image width, image height)

The image is processed by GAN network, and finally PixelSuffle is performed to enlarge the specified pictures.

Output: the RGB channel of the enlarged image, so the model output is: (3, picture width, image height)

SRMD Noise Free model：

srmdnf_x2.pth
srmdnf_x3.pth
srmdnf_x4.pth

Input: 18 dimensional data, of which 15 dimensions are blur kernels after dimension reduction by PCA, and the other 3 dimensions are RGB channels of images, so the model input is: (18, image width, image height)

The image is processed by GAN network, and finally PixelSuffle is performed to enlarge the specified pictures.

Output: the RGB channel of the enlarged image, so the model output is: (3, picture width, image height)

Use Model

This script uses the model trained by the original author for prediction. If necessary, please train the model, define blur kernel and PCA dimension reduction data by yourself.

The bicubic interpolation method is used to enlarge the alpha channel, which makes up for the defect that the source model does not support transparent (Alpha) channel.

Usage：

< 8000 div class="snippet-clipboard-content notranslate position-relative overflow-auto" data-snippet-clipboard-copy-content="main_srmd.py -i infile -o outfile [options]... -h help show this help -i input-path input image path (jpg/png/...) or directory (default=running directory) -o output-path output image path (jpg/png/...) or directory (default=running directory) -n noise-level denoise level (-1/0/1/2/3/4/5/6/7/8/9/10, default=3) -s scale upscale ratio (2/3/4, default=2) -m model-path srmd model path (default='model_zoo') -p srmd-pca-path srmd blur kernel pca data path (default='kernels/srmd_pca_matlab.mat') -x tta enable x8 performance mode (default disabled) -c cuda enable CUDA GPU caculating (default disabled) -f format output image format (jpg/png/..., default=png) ">

main_srmd.py -i infile -o outfile [options]...

  -h help              show this help
  -i input-path        input image path (jpg/png/...) or directory (default=running directory)
  -o output-path       output image path (jpg/png/...) or directory (default=running directory)
  -n noise-level       denoise level (-1/0/1/2/3/4/5/6/7/8/9/10, default=3)
  -s scale             upscale ratio (2/3/4, default=2)
  -m model-path        srmd model path (default='model_zoo')
  -p srmd-pca-path     srmd blur kernel pca data path (default='kernels/srmd_pca_matlab.mat')
  -x tta               enable x8 performance mode (default disabled)
  -c cuda              enable CUDA GPU caculating (default disabled)
  -f format            output image format (jpg/png/..., default=png)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SRMD Pytorch

使用模型

运行过程中出现 RuntimeError: CUDA error: unspecified launch failure

训练模型

测试模型

查看模型描述

转换为ONNX/NCNN模型

可选

SRMD Pytorch

Use Model

Encounter `RuntimeError: CUDA error: unspecified launch failure`

Training Models

Testing Models

View Models description

Convert to ONNX / NCNN model

Optimal

About

Releases

Sponsor this project

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github		.github
kernels		kernels
model_zoo		model_zoo
onnx2ncnn		onnx2ncnn
onnx_models		onnx_models
options		options
results/set5_srmdnf_x4		results/set5_srmdnf_x4
superresolution/srmd		superresolution/srmd
testsets/set5		testsets/set5
trainsets/trainH		trainsets/trainH
utils		utils
LICENSE		LICENSE
README.md		README.md
describe_model.py		describe_model.py
main_srmd.py		main_srmd.py
main_test_srmd.py		main_test_srmd.py
main_train_srmd.py		main_train_srmd.py
pytorch2onnx.py		pytorch2onnx.py
requirements.txt		requirements.txt

License

HollowMan6/SRMD-Pytorch

Folders and files

Latest commit

History

Repository files navigation

SRMD Pytorch

使用模型

运行过程中出现 RuntimeError: CUDA error: unspecified launch failure

训练模型

测试模型

查看模型描述

转换为ONNX/NCNN模型

可选

SRMD Pytorch

Use Model

Encounter RuntimeError: CUDA error: unspecified launch failure

Training Models

Testing Models

View Models description

Convert to ONNX / NCNN model

Optimal

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Encounter `RuntimeError: CUDA error: unspecified launch failure`

Packages