python - 将图像另存为numpy数组

标签 python image numpy opencv multidimensional-array

我无法将图像加载到numpy数组中,并得到这样的错误...

ValueError: could not broadcast input array from shape (175,217,3) into shape (100,100,3)



功能代码:
import cv2
import numpy as np
import os

train_data_dir = '/home/ec2-user/SageMaker/malaria-detection-model/malaria/training'
valid_data_dir = '/home/ec2-user/SageMaker/malaria-detection-model/malaria/validation'

# declare the number of samples in each category
nb_train_samples = 22045 #  training samples
nb_valid_samples = 5513#  validation samples
num_classes = 2
img_rows_orig = 100
img_cols_orig = 100

def load_training_data():
    labels = os.listdir(train_data_dir)
    total = len(labels)

    X_train = np.ndarray((nb_train_samples, img_rows_orig, img_cols_orig, 3), dtype=np.uint8)
    Y_train = np.zeros((nb_train_samples,), dtype='uint8')

    i = 0
    j = 0
    for label in labels:
        image_names_train = os.listdir(os.path.join(train_data_dir, label))
        total = len(image_names_train)
        print(label, total)
        for image_name in image_names_train:
            img = cv2.imread(os.path.join(train_data_dir, label, image_name), cv2.IMREAD_COLOR)
            img = np.array([img])
            X_train[i] = img
            Y_train[i] = j

            if i % 100 == 0:
                print('Done: {0}/{1} images'.format(i, total))
            i += 1
        j += 1    
    print(i)                
    print('Loading done.')

    np.save('imgs_train.npy', X_train, Y_train)
    return X_train, Y_train

该功能是文件load_data.py的一部分,该文件可以从以下位置的malaria_cell_classification_code.zip文件中找到:

https://ceb.nlm.nih.gov/repositories/malaria-datasets/

我试图将X_train和Y_train更改为列表而不是numpy数组。该函数在np.save方法处停止。
X_train = Y_train = list()
        X_train.append(img)
        Y_train.append(j)

在numpy中保存图像的正确和标准方法是什么?

调整图像大小后,出现不同的错误:
Done: 19400/9887 images
Done: 19500/9887 images
Done: 19600/9887 images
Done: 19700/9887 images
Done: 19800/9887 images
19842
Loading done.
Transform targets to keras compatible format.
Done: 19800/9887 images
19842
Loading done.
Transform targets to keras compatible format.
------------------------------
Creating validation images...
------------------------------
Parasitized 1098
---------------------------------------------------------------------------
error                                     Traceback (most recent call last)
<ipython-input-6-8008be74f482> in <module>()
      2 #load data for training
      3 X_train, Y_train = load_resized_training_data(img_rows, img_cols)
----> 4 X_valid, Y_valid = load_resized_validation_data(img_rows, img_cols)
      5 #print the shape of the data
      6 print(X_train.shape, Y_train.shape, X_valid.shape, Y_valid.shape)

~/SageMaker/malaria-detection-model/malaria_cell_classification_code/load_data.py in load_resized_validation_data(img_rows, img_cols)
    103 def load_resized_validation_data(img_rows, img_cols):
    104 
--> 105     X_valid, Y_valid = load_validation_data()
    106 
    107     # Resize images

~/SageMaker/malaria-detection-model/malaria_cell_classification_code/load_data.py in load_validation_data()
     75 
     76             img = np.array([img])
---> 77             img2 = cv2.resize(img, (100, 100))
     78             X_valid[i] = img2
     79             Y_valid[i] = j

error: OpenCV(4.0.0) /io/opencv/modules/imgproc/src/resize.cpp:3427: error: (-215:Assertion failed) !dsize.empty() in function 'resize'


------------------------------
Creating validation images...
------------------------------
Parasitized 1098
---------------------------------------------------------------------------
error                                     Traceback (most recent call last)
<ipython-input-6-8008be74f482> in <module>()
      2 #load data for training
      3 X_train, Y_train = load_resized_training_data(img_rows, img_cols)
----> 4 X_valid, Y_valid = load_resized_validation_data(img_rows, img_cols)
      5 #print the shape of the data
      6 print(X_train.shape, Y_train.shape, X_valid.shape, Y_valid.shape)

~/SageMaker/malaria-detection-model/malaria_cell_classification_code/load_data.py in load_resized_validation_data(img_rows, img_cols)
    103 def load_resized_validation_data(img_rows, img_cols):
    104 
--> 105     X_valid, Y_valid = load_validation_data()
    106 
    107     # Resize images

~/SageMaker/malaria-detection-model/malaria_cell_classification_code/load_data.py in load_validation_data()
     75 
     76             img = np.array([img])
---> 77             img2 = cv2.resize(img, (100, 100))
     78             X_valid[i] = img2
     79             Y_valid[i] = j

error: OpenCV(4.0.0) /io/opencv/modules/imgproc/src/resize.cpp:3427: error: (-215:Assertion failed) !dsize.empty() in function 'resize'

完整的脚本可以在这里找到...

https://gist.github.com/shantanuo/cfe0913b367647890451f5ae3f6fb691

最佳答案

opencv2已经返回一个numpy数组。不要创建新的嵌套,尤其是不要嵌套更多的嵌套:

img = cv2.imread(os.path.join(train_data_dir, label, image_name), cv2.IMREAD_COLOR)
img = cv2.resize(img, (100, 100))

关于python - 将图像另存为numpy数组,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/54301388/

相关文章:

python - 如何使用 Python 在 Linux 中获取总运行进程数和线程数?

php - 如何计算给定图像的比例尺寸不超过

java - 带有 base64 图像的 HTML 到 PDF 抛出 FileNotFoundException

python - 快速、干净地查找 Numpy 数组索引的方法

python - 数据均值/方差的变化会影响svm分类器吗?

python - 有条件删除 Pandas 中的重复条目

Python setattr() 找不到明显存在的属性

html - 如何在不将其设置为背景的情况下将图像垂直和水平居中?

python - 来自 numpy 数组的 RawArray?

python - 矩阵每行包含特定数字