• 大小: 17KB
    文件类型: .rar
    金币: 1
    下载: 0 次
    发布日期: 2021-05-14
  • 语言: Python
  • 标签: python  

资源简介

python作为人工智能或者大数据的宠儿,我自然要学习,作为一个小白,第一个实现的工能就是爬虫,爬数据,收集数据,我以我爬微博的事情为例子,附上代码,大家一起学习

资源截图

代码片段和文件信息

#!/usr/bin/python
# -*- coding: UTF-8 -*-
import requests
import MySQL
import json
import urllib
import weibobase
import sys
import time
reload(sys)
sys.setdefaultencoding(‘utf-8‘)
cookie = {“Apache“: “4085444405648.557.1517558859962“
          “H5_INDEX“: “2“
          “H5_INDEX_title“: “%E7%A7%8B%E5%86%AC%E6%9A%96%E8%89%B2%E7%B3%BB“
          “M_WEIBOCN_PARAMS“: “lfid%3D1005052109066367%252Fhome%26luicode%3D20000174%26fid%3D102803%26uicode%3D10000011“
          “SCF“: “AlPdz7Wu9iu_xwiWfMtd1hBGr6mZqaKtCcidCgPrDl6ocdl8HcIvA5NZpk0cm36a0xrCpnFl0ZgfV-Bc5BUAktQ.“
          “SSOLoginState“: “1520562809“
          “SUB“: “_2A253pYIoDeRhGeRP61sR9ijPzTuIHXVVaS5grDV6PUJbktAKLRLQkW1NUFPZQRFUxRYf5itrGk6VqEtGIU3izGDT“
          “SUBP“: “0033WrSXqPxfM725Ws9jqgMF55529P9D9W5MyLbIiX5quKaqF190KSgT5JpX5K-hUgL.Fozpeh.7Soq0SoM2dJLoIEXLxKMLBKML12zLxK-L1hqLB-eLxKqL1-2L1KqLxKnL1h.LBozLxKMLBoeLB.zt“
          “SUHB“: “0Elrkzb0Smx-GW“
          “WEIBOCN_FROM“: “1110006030“
          “_T_WM“: “46f8072dc2db4752c9f5f1bb610d6934“
          “browser“: “d2VpYm9mYXhpYW4%3D“
          “h5_deviceID “: “da4db009e6ae38320111cc4fbc8d1998“
          }

cookie2 = {“ALF“: “1522043003“
          “M_WEIBOCN_PARAMS“: “luicode%3D10000011%26lfid%3D102803%26fid%3D102803%26uicode%3D10000011“
          “SCF“: “AlPdz7Wu9iu_xwiWfMtd1hBGr6mZqaKtCcidCgPrDl6oNht3rRthMvGzFst-DncCt1l6_LYi6h6jCGNO6OtXVDU.“
          “SUB“: “_2A253lIvWDeRhGeRP61sR9ijPzTuIHXVVdhWerDV6PUJbktANLVTakW1NUFPZQVmJdEJdcebLE3J8mIqAPe4rxEz4“
          “SUBP“: “0033WrSXqPxfM725Ws9jqgMF55529P9D9W5MyLbIiX5quKaqF190KSgT5JpX5K-hUgL.Fozpeh.7Soq0SoM2dJLoIEXLxKMLBKML12zLxK-L1hqLB-eLxKqL1-2L1KqLxKnL1h.LBozLxKMLBoeLB.zt“
          “SUHB“: “0pHAjcQEUb1cye“
          “WEIBOCN_FROM“: “1110006030“
          “_T_WM“: “46f8072dc2db4752c9f5f1bb610d6934“
          “browser“: “d2VpYm9mYXhpYW4%3D“
          “h5_deviceID “: “da4db009e6ae38320111cc4fbc8d1998“
          }

headers = {
    ‘Accept‘: ‘text/htmlapplication/xhtml+xmlapplication/xml;q=0.9image/webpimage/apng*/*;q=0.8‘
    ‘Accept-Encoding‘:“gzip deflate br“
    ‘Accept-Language‘: ‘zh-CNzh;q=0.9‘
    ‘Cache-Control‘: ‘max-age=0‘
    ‘Connection‘: ‘keep-alive‘
    ‘Host‘: ‘m.weibo.cn‘
    ‘Cookie‘:‘browser=d2VpYm9mYXhpYW4%3D; h5_deviceID=da4db009e6ae38320111cc4fbc8d1998; _T_WM=46f8072dc2db4752c9f5f1bb610d6934; ALF=1523154787; SCF=AlPdz7Wu9iu_xwiWfMtd1hBGr6mZqaKtCcidCgPrDl6ocdl8HcIvA5NZpk0cm36a0xrCpnFl0ZgfV-Bc5BUAktQ.; SUB=_2A253pYIoDeRhGeRP61sR9ijPzTuIHXVVaS5grDV6PUJbktAKLRLQkW1NUFPZQRFUxRYf5itrGk6VqEtGIU3izGDT; SUBP=0033WrSXqPxfM725Ws9jqgMF55529P9D9W5MyLbIiX5quKaqF190KSgT5JpX5K-hUgL.Fozpeh.7Soq0SoM2dJLoIEXLxKMLBKML12zLxK-L1hqLB-eLxKqL1-2L1KqLxKnL1h.LBozLxKMLBoeLB.zt; SUHB=0Elrkzb0Smx-GW; SSOLoginState=1520562809; H5_INDEX=2; H5_INDEX_title=%E7%A7%8B%E5%86%AC%E6%9A%96%E8%89%B2%E7%B3%BB; WEIBOCN_FROM=1110006030; M_WEIBOCN_PARAMS=luicode%3D10000011%26lfid%3D102803%26fid%3D102803%26uicode%3D10000011‘
    ‘RA-Sid

 属性            大小     日期    时间   名称
----------- ---------  ---------- -----  ----

     文件        434  2018-02-27 11:56  weiboforjson\.idea\inspectionProfiles\Project_Default.xml

     文件        106  2018-02-23 14:54  weiboforjson\.idea\markdown-navigator\profiles_settings.xml

     文件       4414  2018-02-23 14:55  weiboforjson\.idea\misc.xml

     文件        276  2018-02-23 14:54  weiboforjson\.idea\modules.xml

     文件        140  2018-02-23 14:54  weiboforjson\.idea\thriftCompiler.xml

     文件        459  2018-02-23 14:55  weiboforjson\.idea\weiboforjson.iml

     文件      45065  2018-04-08 10:53  weiboforjson\.idea\workspace.xml

     文件       6665  2018-03-09 11:21  weiboforjson\GetWeibo.py

     文件       5741  2018-03-09 11:21  weiboforjson\GetWeibo.pyc

     文件       5945  2018-04-08 10:53  weiboforjson\MySQL.py

     文件       5187  2018-02-27 17:00  weiboforjson\MySQL.pyc

     文件        248  2018-03-09 10:55  weiboforjson\weibo.py

     文件        751  2018-02-24 16:32  weiboforjson\weibobase.py

     文件        954  2018-02-24 16:32  weiboforjson\weibobase.pyc

     目录          0  2018-02-27 11:56  weiboforjson\.idea\inspectionProfiles

     目录          0  2018-02-23 14:54  weiboforjson\.idea\markdown-navigator

     目录          0  2018-04-08 10:53  weiboforjson\.idea

     目录          0  2018-04-08 10:53  weiboforjson

----------- ---------  ---------- -----  ----

                76385                    18


评论

共有 条评论