首页 > 编程 > Python > 正文

python爬虫爬取淘宝商品信息

2020-01-04 15:48:07
字体:
来源:转载
供稿:网友

本文实例为大家分享了python/291759.html">python爬取淘宝商品的具体代码,供大家参考,具体内容如下

import requests as req import re  def getHTMLText(url):  try:  r = req.get(url, timeout=30)  r.raise_for_status()  r.encoding = r.apparent_encoding  return r.text  except:  return ""  def parasePage(ilt, html):  try:  plt = re.findall(r'/"view_price/"/:/"[/d/.]*/"', html)  tlt = re.findall(r'/"raw_title/"/:/".*?/"', html)  for i in range(len(plt)):   price = eval(plt[i].split(':')[1])  ilt.append([price, title])  except:  print("")   def printGoodsList(ilt):  tplt = "{:4}/t{:8}/t{:16}"  print(tplt.format("序列号", "价格", "商品名称"))  count = 0  for j in ilt:  count = count + 1  print(tplt.format(count, j[0], j[1]))  def main():  goods = "python爬虫"  depth = 3  start_url = 'https://s.taobao.com/search?q=' + goods  infoList = []  for i in range(depth):  try:   url = start_url + '&s=' + str(44*i)   html = getHTMLText(url)   parasePage(infoList, html)  except:   continue  printGoodsList(infoList)  main() 

效果图:

python,爬虫,爬取,淘宝

 

以上就是本文的全部内容,希望对大家的学习有所帮助,也希望大家多多支持VEVB武林网。


注:相关教程知识阅读请移步到python教程频道。
发表评论 共有条评论
用户名: 密码:
验证码: 匿名发表