基本信息
源码名称:最简单爬虫示例(入门级)
源码大小:0.70KB
文件格式:.py
开发语言:Python
更新时间:2019-03-19
友情提示:(无需注册或充值,赞助后即可获取资源下载链接)
嘿,亲!知识可是无价之宝呢,但咱这精心整理的资料也耗费了不少心血呀。小小地破费一下,绝对物超所值哦!如有下载和支付问题,请联系我们QQ(微信同号):813200300
本次赞助数额为: 2 元×
微信扫码支付:2 元
×
请留下您的邮箱,我们将在2小时内将文件发到您的邮箱
源码介绍
from lxml import etree import requests def handle_request(url): heades = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/64.0.3282.140 Safari/537.36 Edge/17.17134', } response = requests.get(url=url,headers = heades).text return response def main(): url = 'https://voice.hupu.com/nba' content = handle_request(url) news = etree.HTML(content) news_content = news.xpath(r'//div[@class="news-list"]/ul/li/div/h4/a/text()') news_url = news.xpath(r'//div[@class="news-list"]/ul/li/div/h4/a/@href') for new in zip(news_content ,news_url): print(new) if __name__ == '__main__': main()