Python pgm解析和格式轉換

阿新 • • 發佈：2018-12-19

下載ORL人臉資料庫，發現其影象檔案格式為pgm，之前也遇到過這種情況，這次仔細分析它的使用，並編寫指令碼用於影象格式之間的轉換

參考：

`PGM`解析

pgm（行動式灰度圖，Portable Gray Map）是Netpbm開源工程設計的一種影象格式，除了pgm外，還有pbm，ppm

一個pgm檔案可以表示一個或多個pgm影象，其檔案內容如下：

1. A "magic number" for identifying the file type. A pgm image's magic number is the two characters "P5".
2. Whitespace (blanks, TABs, CRs, LFs).
3. A width, formatted as ASCII characters in decimal.
4. Whitespace.
5. A height, again in ASCII decimal.
6. Whitespace.
7. The maximum gray value (Maxval), again in ASCII decimal. Must be less than 65536, and more than zero.
8. A single whitespace character (usually a newline).
9. A raster of Height rows, in order from top to bottom. Each row consists of Width gray values, in order from left to right. Each gray value is a number from 0 through Maxval, with 0 being black and Maxval being white. Each gray value is represented in pure binary by either 1 or 2 bytes. If the Maxval is less than 256, it is 1 byte. Otherwise, it is 2 bytes. The most significant byte is first.

1. 用於識別檔案型別的“幻數”。PGM影象的幻數是兩個字元“P5”。
2. 空格（blanks, TABs, CRs, LFs）。
3. 寬度，格式為ASCII十進位制數字。
4. 空格。
5. 高度，同樣為ASCII十進位制數字。
6. 空格。
7. 最大灰度值（Maxval），同時是ASCII十進位制。範圍為[0，6536]。
8. 單個空白字元（通常是換行符）。
9. 從上到下，從左到右排列灰度值。每個灰度值取值為[0，Maxval]，其中0表示黑色，Maxval表示白色。每個灰度值由1個或2個位元組的純二進位制表示。如果最大值小於256，則為1位元組。否則，它是2位元組。最重要的位元組是第一個。

用Notepad++開啟一個PGM檔案

P5
92 112
255
01-/19'*515<L[c_PKB6/12+.5=FTi厒n^Qk_P9...

它的幻數是P5，寬為92，高為112，最大值為255。

`Plain PGM`

還有其中格式的PGM檔案，它的幻數是P2，稱為Plain PGM，這種格式的變化在於：

1. There is exactly one image in a file.
2. The magic number is P2 instead of P5. 
3. Each pixel in the raster is represented as an ASCII decimal number (of arbitrary size).
4. Each pixel in the raster has white space before and after it. There must be at least one character of white space between any two pixels, but there is no maximum.
5. No line should be longer than 70 characters.

1. 檔案中僅有單個影象。
2. 幻數是P2。
3. 柵格中的每個畫素表示為ASCII十進位制數（任意大小）。
4. 光柵中的每個畫素在其前後都有白色空間。在任何兩個畫素之間必須至少有一個空白字元。
5. 沒有一行應該長於70個字元。

示例如下：

P2
# feep.pgm
24 7
15
0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0
0  3  3  3  3  0  0  7  7  7  7  0  0 11 11 11 11  0  0 15 15 15 15  0
0  3  0  0  0  0  0  7  0  0  0  0  0 11  0  0  0  0  0 15  0  0 15  0
0  3  3  3  0  0  0  7  7  7  0  0  0 11 11 11  0  0  0 15 15 15 15  0
0  3  0  0  0  0  0  7  0  0  0  0  0 11  0  0  0  0  0 15  0  0  0  0
0  3  0  0  0  0  0  7  7  7  7  0  0 11 11 11 11  0  0 15  0  0  0  0
0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0

為什麼灰度圖會有一個灰度取值兩個位元組？

平常都是在RGB模式下工作，灰度取值為[0 - 255]，所以一個位元組就好了

但其實還有更多的灰度級數，比如16位灰度，那就需要2個位元組

格式轉換

寫了一個python程式，可以批量處理，也可以單個影象轉換

# -*- coding: utf-8 -*-
from __future__ import print_function

import cv2
import time
import os
import operator
import numpy as np
import argparse
from PIL import Image

__author__ = 'zj'

image_formats = ['jpg', 'JPG', 'jpeg', 'JPEG', 'png', 'PNG']


def is_pgm_file(in_path):
    if not os.path.isfile(in_path):
        return False
    if in_path is not str and not in_path.endswith('.pgm'):
        return False
    return True


def convert_pgm_by_PIL(in_path, out_path):
    if not is_pgm_file(in_path):
        raise Exception("%s 不是一個PGM檔案" % in_path)
    # 讀取檔案
    im = Image.open(in_path)
    im.save(out_path)


def convert_pgm_P5(in_path, out_path):
    """
    將pgm檔案轉換成其它影象格式
    讀取二進位制檔案，先讀取幻數，再讀取寬和高，以及最大值
    :param in_path: 輸入pgm檔案路徑
    :param out_path: 輸出檔案路徑
    """
    if not is_pgm_file(in_path):
        raise Exception("%s 不是一個PGM檔案" % in_path)
    with open(in_path, 'rb') as f:
        # 讀取兩個位元組 - 幻數，並解碼成字串
        magic_number = f.readline().strip().decode('utf-8')
        if not operator.eq(magic_number, "P5"):
            raise Exception("該影象有誤")
        # 讀取高和寬
        width, height = f.readline().strip().decode('utf-8').split(' ')
        width = int(width)
        height = int(height)
        # 讀取最大值
        maxval = f.readline().strip()
        # 每次讀取灰度值的位元組數
        if int(maxval) < 256:
            one_reading = 1
        else:
            one_reading = 2
        # 建立空白影象，大小為(行，列)=(height, width)
        img = np.zeros((height, width))
        img[:, :] = [[ord(f.read(one_reading)) for j in range(width)] for i in range(height)]
        cv2.imwrite(out_path, img)
        print('%s save ok' % out_path)


def convert_pgm_P5_batch(in_dir, out_dir, res_format):
    """
    批量轉換PGM檔案
    :param in_dir: pgm資料夾路徑
    :param out_dir: 輸出資料夾路徑
    :param res_format: 結果影象格式
    """
    if not os.path.isdir(in_dir):
        raise Exception('%s 不是路徑' % in_dir)
    if not os.path.isdir(out_dir):
        raise Exception('%s 不是路徑' % out_dir)
    if not res_format in image_formats:
        raise Exception('%s 暫不支援' % res_format)
    file_list = os.listdir(in_dir)
    for file_name in file_list:
        file_path = os.path.join(in_dir, file_name)
        # 若為pgm檔案路徑，那麼將其進行格式轉換
        if is_pgm_file(file_path):
            file_out_path = os.path.join(out_dir, os.path.basename(file_name) + '.' + res_format)
            convert_pgm_P5(file_path, file_out_path)
        # 若為目錄，則新建結果檔案目錄，遞迴處理
        elif os.path.isdir(file_path):
            file_out_dir = os.path.join(out_dir, file_name)
            if not os.path.exists(file_out_dir):
                os.mkdir(file_out_dir)
            convert_pgm_P5_batch(file_path, file_out_dir, res_format)
        else:
            pass
    print('batch operation over')


if __name__ == '__main__':
    script_start_time = time.time()

    parser = argparse.ArgumentParser(description='Format Converter - PGM')

    ### Positional arguments

    ### Optional arguments

    parser.add_argument('-i', '--input', type=str, help='Path to the pgm file')
    parser.add_argument('-o', '--output', type=str, help='Path to the result file')
    parser.add_argument('--input_dir', type=str, help='Dir to the pgm files')
    parser.add_argument('--output_dir', type=str, help='Dir to the result files')
    parser.add_argument('-f', '--format', default='png', type=str, help='result image format')
    parser.add_argument('-b', '--batch', action="store_true", default=False, help='Batch processing')

    args = vars(parser.parse_args())
    # print(args)
    in_path = args['input']
    out_path = args['output']

    isbatch = args['batch']
    in_dir = args['input_dir']
    out_dir = args['output_dir']
    res_format = args['format']

    if in_path is not None and out_path is not None:
        # 轉換單個pgm檔案格式
        convert_pgm_P5(in_path, out_path)
        # convert_pgm_by_PIL(in_path, out_path)
    elif isbatch:
        # 批量轉換
        convert_pgm_P5_batch(in_dir, out_dir, res_format)
    else:
        print('請輸入相應引數')

    print('Script took %s seconds.' % (time.time() - script_start_time,))

使用PIL庫也可以讀取PGM檔案，然後儲存為其它格式影象，我自己寫了一個解析二進位制檔案的方式，速度比呼叫PIL庫快大約2.5倍

轉換單個`PGM`檔案：

python PGMConverter.py -i INPUT -o OUTPUT

例如：

python PGMConverter.py -i 1.pgm -o 3.png

轉換整個`PGM`資料夾

python PGMConverter.py --batch --input_dir INPUT_DIR --output_dir OUTPUT_DIR -f FORMAT

INPUT_DIR替換成PGM資料夾路徑，OUTPUT_DIR替換成結果檔案路徑（該資料夾需提前新建），FORMAT替換成結果影象格式

例如：

python PGMConverter.py --batch --input_dir c:\\face\\att_faces --output_dir c:\\face\\att_face_png -f png

Python pgm解析和格式轉換

下載ORL人臉資料庫，發現其影象檔案格式為pgm，之前也遇到過這種情況，這次仔細分析它的使用，並編寫指令碼用於影象格式之間的轉換參考： pgm 目錄 PGM解析格式轉換 PGM解析 pgm（行動式灰度圖，Portable Gray Map）是Ne

python 將ipv4的格式轉換

ipv4 python 格式轉換 Python的 socket 庫提供了很多用來處理不同IP地址格式的函數,要使用底層網絡函數，有時普通的字符串形式的IP地址並不是很有用，需要把它們轉換成打包後的32位二進制格式,Python的socke庫提供了很多用來處理不同IP地址格式的函數，這裏我們使用

【原創】用python將時間unix格式轉換總結

接受 bsp 時間戳 pretty 需要字符串解析 time函數 spa datetime 我們可以用python裏面的time模塊mktime方法將轉為unix時間戳，mktime函數只能接受相應時間的元祖序列。在此之前需要先將輸入的時間轉為元組序列：如果輸入的時間為

使用CxImage進行圖形和格式轉換(CBitmap to jpg or png or gif or bmp)

CxImage類庫介紹 CxImage類庫是一個幾乎可以管理所有的圖象檔案的C++類庫。它可以快捷地存取、顯示、轉換各種影象。其他的圖形庫？有那麼多優秀的圖形庫，如OpenIL,FreeImage,PaintLib等等，它們是功能強大，齊全，而且是經常更新的。然而，如果

Python簡單解析和封裝json

python list物件轉換成json格式 #!/usr/bin/env python import json data = [{'type':'trigger','addr':'0x1234','data':'0x1234'}] print "data:", data

Python實現批量圖片格式轉換

深度學習過程中總是繞不開資料集的製作，有時候實際圖片格式或大小可能與需要關心的圖片資訊不一致，那麼我們只能手動做好資料預處理，再進行training dataset.現在將介紹最簡單的格式轉換問題。可以支援批量圖片任意格式轉換。直接上程式碼：# 將jpg格式轉位

Python列表解析和字典解析

python筆記_列表解析相比於for迴圈，列表解析的語法是由底層c語言實現的，它和使用for迴圈遍歷pyobject物件相比，效能會有很大的提升。無條件子句的列表解析式 In [2]: [2*i for i in range(4)] Out[2]: [0, 2, 4, 6] 帶條件子句的列表解析

Python之UTC和Local轉換

好記性不如爛筆頭. from dateutil import tz from dateutil.tz import tzlocal from datetime import datetim

利用python批量化音樂檔案格式轉換

最近在做聲音檔案資料處理，寫了一個自動將m4a檔案轉化為wav的指令碼。 import os m4a_path = "/Users/Downloads/start1/" m4a_file = os.listdir(m4a_path) for i, m

SpringBoot環境屬性佔位符解析和型別轉換

前提前面寫過一篇關於Environment屬性載入的原始碼分析和擴充套件，裡面提到屬性的佔位符解析和型別轉換是相對複雜的，這篇文章就是要分析和解讀這兩個複雜的問題。關於這兩個問題，選用一個比較複雜的引數處理方法PropertySourcesPropertyR

python時間獲取、格式轉換、格式化符號

1、獲取當前日期(年月日)：date.today() ，如2015-04-23 2、獲取當前時間戳：time.time()，如1429789216.65，1970紀元後經過的浮點秒數 3、獲取當前日期

《python》 str 和 list 轉換以及eval()函式

python 操作中常對list和字元創的轉換進行操作，特此備註。 str –> list str1 = 'abc' list1 = list(str1) list2 = str1.split() print list1

java系統時間的呼叫和格式轉換

java在java.text java.util java.lang包中查詢 import java.util.*; import java.text.*; public class Tex

python變數型別和型別轉換

int(x [,base ]) 將x轉換為一個整數 long(x [,base ]) 將x轉換為一個長整數 float(x ) 將x轉換到一個浮點數 complex(real [,imag ]) 建立一個複數 str(x )

影象處理簡單的效果處理和格式轉換小程式

{ int r=0,g=0,b=0; int Index=0; //int a=0;for(int col=-1;col<=1;col++) f

MRT(MODIS Reprojection Tool)安裝、影像批量拼接、重投影和格式轉換

一、安裝MRT(MODIS Reprojection Tool) 　　安裝準備：檢查是否安裝java.exe。Java版本至少為Java 2 Runtime Environment version 1.5或者是Java 2 SDK version 1.5或者更高的版本。在W

Python-列表解析和生成表示式

先看一道比較簡單的題目：列出小於10，並且能被3或者5整除的數字，並求和。如果是10000，10000000呢？一般的程式碼 nums = [] for i in range(1, 10): if i%3 == 0 or i%5 ==

Python-opencv 筆記8 -- PIL.Image和OpenCV影象格式轉換

Python-opencv 筆記8 – PIL.Image和OpenCV影象格式轉換 1、PIL.Image 轉 OpenCV import cv2 from PIL import Image imp

Python解析和生成用於Google Earth的KML格式檔案，解決Python3匯入pyKML錯誤

0 格式介紹 Google Earth生成的檔案格式是KML/KMZ，這裡介紹如何解析和生成KML格式檔案，KMZ格式可以在Google Earth中另存為KML格式。更詳細的瞭解KML可以檢視Google 官方教程。KML用於Google Earth和G

Python中xml和dict格式轉換

在做介面自動化的時候，請求資料之前都是JSON格式的，Python有自帶的包來解決。最近在做APP的介面，遇到XML格式的請求資料，費了很大勁來解決，解決方式是：介面文件拿到的是XML，線上轉化為json格式（目的是拿到xml資料的模板），存放到json檔案中，根據介面名去提取。 github原文介紹：

Python pgm解析和格式轉換

Plain PGM

為什麼灰度圖會有一個灰度取值兩個位元組？

轉換單個PGM檔案：

轉換整個PGM資料夾

相關推薦

`Plain PGM`

轉換單個`PGM`檔案：

轉換整個`PGM`資料夾