资源简介
python爬虫登录小木虫论坛爬取交友信息,需登录两次,第二次要回答一个简单的问题,见源代码
session = requests.session()
g = session.get('http://muchong.com/bbs/logging.php?action=login')
g.headers = {"User-Agent", "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/54.0.2840.100 Safari/537.36"}
se = re.s
代码片段和文件信息
#coding:utf-8
import requests
import re
from bs4 import BeautifulSoup
if __name__ == “__main__“:
session = requests.session()
g = session.get(‘http://muchong.com/bbs/logging.php?action=login‘)
g.headers = {“User-Agent“ “Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML like Gecko) Chrome/54.0.2840.100 Safari/537.36“}
se = re.search(re.compile(r‘action=login&t=(.*?)“>登录‘) g.text)
urls = r‘http://muchong.com/bbs/logging.php?action=login&t=‘ + se.group(1)
se = re.search(re.compile(r‘name=“loginsubmit“ value=“(.*?)“ class‘) g.text)
loginsubmit = se.group(1)
login_infor = {
‘formhash‘: “46295093“
‘username‘: “xxxxxx“
‘password‘: “xxxxxx“
‘cookietime‘: “31536000“
‘refer‘:““
‘loginsubmit‘: “会员登录“
}
p = session
- 上一篇:thinkphp邮箱找回密码
- 下一篇:基于PHP+MYSQL的在线考试系统
评论
共有 条评论