首页 > 编程 > Python > 正文

Python3.4编程实现简单抓取爬虫功能示例

2020-01-04 16:44:55
字体:
来源:转载
供稿:网友

本文实例讲述了Python3.4编程实现简单抓取爬虫功能。分享给大家供大家参考,具体如下:

import urllib.requestimport urllib.parseimport reimport urllib.request,urllib.parse,http.cookiejarimport timedef getHtml(url):  cj=http.cookiejar.CookieJar()  opener=urllib.request.build_opener(urllib.request.HTTPCookieProcessor(cj))  opener.addheaders=[('User-Agent','Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2272.101 Safari/537.36'),('Cookie','4564564564564564565646540')]  urllib.request.install_opener(opener)  page = urllib.request.urlopen(url)  html = page.read()  return html#print ( html)#html = getHtml("http://weibo.com/")def getimg(html):  html = html.decode('utf-8')  reg='"screen_name":"(.*?)"'  imgre = re.compile(reg)  src=re.findall(imgre,html)  return src#print ("",getimg(html))uid=['2808675432','3888405676','2628551531','2808587400']for a in list(uid):  print (getimg(getHtml("http://weibo.com/"+a)))  time.sleep(1)

希望本文所述对大家Python程序设计有所帮助。

发表评论 共有条评论
用户名: 密码:
验证码: 匿名发表