python學習筆記 Day 18 下載資料及 Web API

阿新 • • 發佈：2018-12-21

Day 18 下載資料及 Web API

python常用模組小結

CSV資料檔案訪問分析

使用CSV

import csv

filename = 'sitka_weather_07-2014.csv'
with open(filename) as f:
	reaer = csv.reader(f)
	header_row = next(reader)

enumerate()函式：enumerate() 函式用於將一個可遍歷的資料物件(如列表、元組或字串)組合為一個索引序列，同時列出資料和資料下標，一般用在 for 迴圈當中。

enumerate 
(sequence, [start=0])

Sample:

	with open(filename) as f:
		reader = csv.reader(f)
		header_row = next(reader)
		for index, column_header in enumerate(header_row):
			print (index, column_header)

遍歷csv檔案並提取資料：for + append

with open(filename) as f:
reader = csv.reader(f)
header_row = next(reader) 


dates, highs, lows = [], [], []
for row in reader:
	current_date = datetime.strptime(row[0], "%Y-%m-%d")
	high = int(row[1])
	low = int(row[3])
	dates.append(current_date)
	highs .append(high)
	lows.append(low)

錯誤處理

with open(filename) as f:
reader = csv.reader(f)
header_row = next(reader)

dates, 
 highs, lows = [], [], []
for row in reader:
	try:
		current_date = datetime.strptime(row[0], "%Y-%m-%d")
		high = int(row[1])
		low = int(row[3])
	except ValueError:
		print (current_date, 'missing data')
	else:
		dates.append(current_date)
		highs .append(high)
		lows.append(low)

JSON格式
- pygal.i18n 不存在，No module named 'pygal.i18n’錯誤：
  - 改用pygal_maps_world.i18n：
    - OS X
```
$ pip install pygal_maps_world
```
    - Windows
```
\> python -m pip install pygal_maps_world
```
  - 將’ from pygal.i18n import COUNTRIES '改為
```
from pygal_maps_world.i18n import COUNTRIES		```
```
- module ‘pygal’ has no attribute ‘Worldmap’ 錯誤
  - 改用‘pygal_maps_world’
```
import pygal_maps_world.maps

wm = pygal_maps_world.maps.World()
```

Web API

Web API用於與網站進行互動，請求資料（以JSON或CSV返回）。
requests包，讓python能向網站請求資訊以及檢查返回的響應。
- 安裝requests包
  - OS X
```
$ pip install --user requests
```
```
  - Windows
```
```
$ python -m pip install --user requests
```

處理並響應字典

	import requests
	
	#執行API呼叫並存儲響應
	url = "https://api.github.com/search/repositories?q=language:python&sort=stars"
	r = requests.get(url)
	print ("Status code: ", r.status_code)
	
	#將API響應儲存在一個字典變數中
	response_dict = r.json()
	print ("Total repositories: ", response_dict['total_count'])
	
	#探索有關倉庫的資訊
	repo_dicts = response_dict['items']
	print ("Repositories returned: " , len(repo_dicts))
	
	#研究第一個倉庫
	repo_dict = repo_dicts[0]
	print ("\nKeys:", len(repo_dict))
	for key in repo_dict.keys():
		print (key)

進一步研究‘倉庫’

	#研究第一個倉庫
	for repo_dict in repo_dicts:
		print ("\nSelcted information about first repository: ")
		print ('Name: ' + repo_dict['name'])
		print ('Owner: ' , repo_dict['owner']['login'])
		print ('Start: ' , repo_dict['stargazers_count'])
		print ('Repository: ', repo_dict['html_url'])
		print ('Created: ', repo_dict['created_at'])
		print ('Updated: ', repo_dict['updated_at'])
		print ('Description: ', repo_dict['description'])

‘NoneType’ object has no attribute ‘decode’ 錯誤：執行下面的程式碼時出現上述錯誤：

	names, plot_dicts = [], []
	for repo_dict in repo_dicts:
		names.append(repo_dict['name'])
		plot_dict = {
			'value': repo_dict['stargazers_count'],
			'label': repo_dict['description'] ,
			}
		plot_dicts.append(plot_dict)
		
	#視覺化
	my_style = LS('#333366', base_style = LCS)
	
	my_config = pygal.Config()
	my_config.x_label_rotation = 45
	my_config.show_legend = False
	my_config.title_font_size = 24
	my_config.label_font_size = 14
	my_config.major_label_font_size = 18
	my_config.truncate_label = 15
	my_config_show_y_guides = False
	my_config.width = 1000
	
	chart = pygal.Bar(my_config, style = my_style)
	chart.title = 'Most-starred Python Projects on GitHub'
	chart.x_labels = names
	
	chart.add('', plot_dicts)
	chart.render_to_file('python_repos.svg')

參考下面兩種解決辦法：

第一種方法，即：

'label': str(repo_dict['description']),

改為：

'label': str(repo_dict['description']),

既簡單又方便。

Hacker News API，學習以下三個知識點：

根據Web API呼叫返回的列表，動態生成WEB API呼叫網址，並再次呼叫WEB API訪問並獲取資料；
字典的dict.get()函式，不確定某個鍵是否包含在字典中時，可使用方法dict.get()，它在指定的鍵存在時返回與之相關的值，在指定的鍵不存在時返回第二個實參指定的值
模組operator中的函式item getter()，以及與sorted()函式的配合使用。這個函式傳遞鍵’comments’，它將從這個列表中的每個字典中提取與鍵’comments’相關的值，函式sorted()將根據這種值對列表進行排序

import requests
from operator import itemgetter

#執行API呼叫並存儲響應
url = 'https://hacker-news.firebaseio.com/v0/topstories.json'
r = requests.get(url)
print ('Status code: ', r.status_code)

#處理有關每篇文章的資訊
submission_ids = r.json()
#建立submission_dicts空列表，用於儲存熱門文章字典
submission_dicts = []

#取前30個熱門文章ID
for submission_id in submission_ids[:30]:
	#對於每篇文章，都執行一個API呼叫
	#根據儲存在submission_ids列表中的ID生成URL
	url = ('https://hacker-news.firebaseio.com/v0/item/' + 
		str(submission_id) + '.json')
	submission_r = requests.get(url)
	print(submission_r.status_code)

	response_dict = submission_r.json()

	#為當前處理的文章生成一個字典	
	submission_dict = {
	'title': response_dict['title'],
	'link': 'http://news.ycombinator.com/item?id=' + str(submission_id),
	'comments': response_dict.get('descendants', 0)
	}
	submission_dicts.append(submission_dict)

submission_dicts = sorted(submission_dicts, key = 
	itemgetter('comments'),reverse = True)

for submission_dict in submission_dicts:
	print ('\nTitle: ', submission_dict['title'])
	print ('Discussion link: ', submission_dict['link'])
	print ('Comments: ', submission_dict['comments'])

上面這段程式碼返回的資料結果：

[{"title": "Glitter bomb tricks parcel thieves", 
"link": "http://news.ycombinator.com/item?id=18706193", 
"comments": 304}, 
{"title": "Stop Learning Frameworks", 
"link": "http://news.ycombinator.com/item?id=18706785", 
"comments": 175}, 
{"title": "Reasons Python Sucks", 
"link": "http://news.ycombinator.com/item?id=18706174", 
"comments": 175}, 
{"title": "I need to copy 2000+ DVDs in 3 days. What are my options?", 
"link": "http://news.ycombinator.com/item?id=18690587", 
"comments": 167}, 
{"title": "SpaceX Is Raising $500M at a $30.5B Valuation", 
"link": "http://news.ycombinator.com/item?id=18706506", 
"comments": 139}, 
.........
]

python學習筆記 Day 18 下載資料及 Web API

Day 18 下載資料及 Web API python常用模組小結 CSV資料檔案訪問分析使用CSV import csv filename = 'sitka_weather_07-2014.csv' with open(file

Python學習筆記 Day12 json儲存資料及階段總結

Day 12 json儲存資料及階段總結 json格式化 JSON(JavaScript Object Notation) 是一種輕量級的資料交換格式。它基於 ECMAScript (歐洲計算機協會制定的js規範)的一個子集，採用完全獨立於程式語言的文字

python學習筆記 Day 17 資料視覺化

Day 17 資料視覺化安裝matplotlib OS X安裝matplotlib $ pip install --user matplotlib Windows安裝matplotlib 課程中說，在Windows系統中安裝m

python學習筆記——Day 3

calc return pro args 速度 lambda day 開始 class 字典特性：無順序去重查詢速度快，比列表快多了比list占用內存多函數非固定參數：若你的函數在定義時不確定用戶想傳入多少個參數，就可以使用非固定參數 def stu_

Python學習筆記三——文件操作及處理json

r+ 3.4 windows phone wow64 con odin 某個文件 like 一、文件操作基礎知識： 1.open是打開已存在的文件或新建一個文件(在文件名後需加訪問模式) 2.close是把剛剛新建或打開的文件關閉 3.write可以向文件中導入數據

python學習筆記一：基本資料型別

1、python的一切都是物件，物件是包含屬性和方法的一個整體。 2、資料型別的組成：身份（記憶體地址，通過id方法可看它的唯一識別符號）；型別（通過type方法檢視）；值（資料項） 3、常用基本資料型別 int 整型 bool 布林

python學習筆記17：下載微信公眾號相關文章

目的：從零開始學自動化測試公眾號中下載“pytest"一系列文件 1、搜尋微訊號文章關鍵字搜尋 2、對搜尋結果前N頁進行解析，獲取文章標題和對應URL 主要使用的是requests和bs4中的Beautifulsoup Weixin.py import requests from

Python學習筆記 Day 16 專案 -外星人入侵 -4

Day 16 - 外星人入侵-4 建立Button類，用於實現按鈕 python語句可以這麼寫：（自我體會：python語句靈活，例如if、for等語句完全靠冒號‘:’和縮排來定義結構塊，而不是依靠‘{ }’或“( )”，靈活帶來的一個問題就是容易出錯）

Python學習筆記 Day 15 專案 -外星人入侵 -3

Day 15 專案 -外星人入侵 - 3 軟體開發，階段性劃分重構清理；重新複習了range的用法：在繪製外星人群組的時候，用到了下面的語句：for row_number in range(number_rows): range()函式，產生了一個

Python學習筆記 Day 14 專案 -外星人入侵 - 2

Day 14 專案 -外星人入侵 - 2 首先是歸納Day13學習到的有關pygame的知識，用流程圖方式，繪圖軟體：https://www.draw.io/ 初始化視窗，包括獲取控制代碼，獲取視窗矩形 screen = pyga

Python學習筆記 Day 13 專案 -外星人入侵 - 1，pygame安裝，OS X / Windows

Day 13 專案 - 外星人入侵 - 1，pygame安裝，OS X / Windows python基礎學習告一段落，開始進入實習階段。第一個實習內容，利用Pygame構建一個外星人入侵的專案。安裝Pygame：使用pip安裝python包：

Python學習筆記 Day10 類的定義及使用 part2 類的繼承 + 駝峰命名法則

Day 10 類的繼承子類是父類的特殊版本，子類自動獲得父類（超類）所有的屬性及方法，同時可以有自己特殊的屬性和方法，也可以重新定義（重構）父類的方法；類繼承的定義及初始化class SubClass (SuperClass): def __init__(self,

Python學習筆記 Day9 類的定義及使用 part 1

Day 9 類的定義及使用 part 1 類的定義 class Class_name(): 初始化 def init(self, param1, para2, …): 定義屬性，通常，在初始化函式中給類屬

Python學習筆記 Day4 列表 part 3及for迴圈

Day4 列表 part 3及for迴圈與C、C++、Pascal、Java等不同，Python變數隨用隨定義即可？只要有賦值操作即可？ magicians = ['alice', 'david', 'carolina'] for magician in magicians: p

Python學習筆記 Day 20 - Django基礎知識彙總

Day 20 - Django部分基礎知識彙總部分內容來自：Django框架全面講解，感謝原文作者。 1、常用命令虛擬環境的安裝啟用命令程式碼功能描述命令程式碼測試是否安裝

python學習筆記 Day 19 Web應用程式 - Django入門

Day 19 Django入門前言 Django是一個web應用框架，http://djangoproject.com/，在此框架內，可以開發互動式網站；個人認為這篇文章將Django講解的比較清楚： Django框架全面講解文中的配圖較模糊

python學習筆記1：變數+資料型別+字串

變數大駝峰：首字母均大寫，一般用於給類命名 MathTeacher 小駝峰：第一個單詞的首字母大寫，其餘小寫，一般給普通變數或函式命名 numOne posix: 單詞全部小寫，用下劃線連線，推薦此方法 num_one&nb

Python學習筆記 Day 21 - Django設定樣式並部署

Python學習筆記 Day 20 - Django設定樣式並部署設定樣式安裝django-bootstrap3 (env) learning_log$ pip install django_bootstrap3 修

python 學習筆記（2）資料型別1 (bool型, 數值型別,lists列表型別)

宣告：本文系本人學習python3總結，如有侵權等，請及時告知；一、型別預覽 1. Booleans［布林型］或為 True［真］或為 False［假］。 2. Numbers［數值型］可以是 Integers［整數］（1 和 2）、 Floats［浮點數］（1.1

Python學習筆記 Day7 對資料型別的總結、input輸入及函式定義

Day 7 對資料型別的總結、input輸入及函式定義複習前6天的內容 Python基本資料型別之一 python基本資料型別之二：列表複習聯絡問題：用remove，結合for或者while刪除列表內容: bicycles = ['trek', 'canno

python學習筆記 Day 18 下載資料 及 Web API

Day 18 下載資料 及 Web API

相關推薦

python學習筆記 Day 18 下載資料及 Web API

Day 18 下載資料及 Web API