1. 程式人生 > >基於python3.6.6的scrapy環境部署+圖像識別插件安裝

基於python3.6.6的scrapy環境部署+圖像識別插件安裝

軟件 b- ont -o cal text bit 添加 all

一、Python3.6.6安裝
1、安裝依賴的二進制軟件包
yum -y install zlib zlib-devel bzip2 bzip2-devel ncurses ncurses-devel readline readline-devel openssl openssl-devel openssl-static xz lzma xz-devel sqlite sqlite-devel gdbm gdbm-devel tk tk-devel gcc gcc-c++
2、將Python-3.6.6.tgz放到/usr/local/src目錄下
3、解壓軟件包
tar xf Python-3.6.6.tgz

4、進入Python-3.6.6
cd Python-3.6.6
5、檢查環境配置
./configure --prefix=/usr/local/python3
6、編譯安裝
make && make install
ln -s /usr/local/python3/bin/python3 /usr/bin/python3
7、添加python3的環境變量
vim /etc/profile.d/python3.sh加入如下內容
export PATH=$PATH:$HOME/bin:/usr/local/python3/bin
重讀配置文件
source /etc/profile.d/python3.sh
查看下版本:python3 --version

# python3 --version
Python 3.6.6

二、安裝scrapy框架
pip3 install lxml
pip3 install wheel
pip3 install scrapy

三、安裝selenium, PhantomJS
wget https://bitbucket.org/ariya/phantomjs/downloads/phantomjs-2.1.1-linux-x86_64.tar.bz2
tar -xvjf phantomjs-2.1.1-linux-x86_64.tar.bz2
cp -R phantomjs-2.1.1-linux-x86_64 /usr/local/share/
ln -sf /usr/local/share/phantomjs-2.1.1-linux-x86_64/bin/phantomjs /usr/local/bin/

pip3 install selenium

四、安裝tesserocr, PIL
yum install -y tesseract tesseract-devel leptonica-devel
git clone https://github.com/tesseract-ocr/tessdata.git
mv tessdata/* /usr/share/tesseract/tessdata
pip3 install tesserocr pillow

五、測試一下
>>> import PIL import Image
>>> from PIL import Image
>>> import tesserocr
>>> p=Image.open(‘/opt/20180823090940.png‘)
>>> s=tesserocr.image_to_text(p)
>>> print(s)
5890

基於python3.6.6的scrapy環境部署+圖像識別插件安裝