1. 程式人生 > >Python使用Xpath輕松爬蟲(腦殘式)

Python使用Xpath輕松爬蟲(腦殘式)

安裝 .html alt img 分享 技術 bubuko www tps

1.在PyCharm安裝lxml.

2.找到源碼

3.F12、copy源碼的xpath

技術分享圖片

4.代碼

from lxml import etree
import requests

wb_data = requests.get("https://www.baidu.com/").text
html = etree.HTML(wb_data)
html_data = html.xpath(‘//*[@id="lh"]/a[2]‘);
for i in html_data:
    print(i.text)

  

Python使用Xpath輕松爬蟲(腦殘式)