统计csdn博客的访问量+评论数
发布时间:2023-11-07 12:11:16 270 相关标签:
两周前,ip被封了,所以爬虫有风险,访问需谨慎,还好有代理IP
# coding=utf-8
import requests
from bs4 import BeautifulSoup
#总的访问量+评论数
sum = 0
#20是页数,自己设
for i in range(20):
print("第",i+1,"页")
url = +str(i+1)
headers = {
"User-Agent": "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/55.0.2883.87 Safari/537.36"}
html = requests.get(url, headers=headers)
soup = BeautifulSoup(html.text, features="html.parser")
for i in soup.find_all("span" , "read-num"):
num = i.string
sum += int(num.split(":")[1])
print(sum)
文章来源: https://blog.51cto.com/u_15879559/5871130
特别声明:以上内容(图片及文字)均为互联网收集或者用户上传发布,本站仅提供信息存储服务!如有侵权或有涉及法律问题请联系我们。
举报