1. 程式人生 > >Python使用Xpath輕鬆爬蟲(腦殘式)

Python使用Xpath輕鬆爬蟲(腦殘式)

1.在PyCharm安裝lxml.

2.找到原始碼

3.F12、copy原始碼的xpath

4.程式碼

from lxml import etree
import requests

wb_data = requests.get("https://www.baidu.com/").text
html = etree.HTML(wb_data)
html_data = html.xpath('//*[@id="lh"]/a[2]');
for i in html_data:
    print(i.text)