urllib

2022-01-26 20:32:55 作者：互联网

1. API

1.1 发送POST请求：


import urllib.request
import urllib.parse

url = "http://www.httpbin.org/post"

# 请求数据
data = bytes(urllib.parse.urlencode({"name": "张飞"}), encoding="utf-8")
resp = urllib.request.urlopen(url, data=data)
print(resp.read().decode("utf-8"))

out:

{
  "args": {}, 
  "data": "", 
  "files": {}, 
  "form": {
    "name": "\u5f20\u98de"
  }, 
  "headers": {
    "Accept-Encoding": "identity", 
    "Content-Length": "23", 
    "Content-Type": "application/x-www-form-urlencoded", 
    "Host": "www.httpbin.org", 
    "User-Agent": "Python-urllib/3.10", 
    "X-Amzn-Trace-Id": "Root=1-61f13a4c-0a4e177c2091473a7c9bf70d"
  }, 
  "json": null, 
  "origin": "223.73.1.176", 
  "url": "http://www.httpbin.org/post"
}

1.2 发送GET 请求：

注意 "User-Agent": "Python-urllib/3.10"，在爬虫的时候上送头如果不修改伪装为浏览器的话‘
有的网站会识别出来返回HTTP 418

url = 'http://www.httpbin.org/get'

try:
    # timeout 可以指定等待时间，超过后会抛出异常
    resp = urllib.request.urlopen(url, timeout=0.01)
    print(resp.read().decode('utf-8'))
except urllib.error.URLError as e:
    print(e)

out:

{
  "args": {}, 
  "headers": {
    "Accept-Encoding": "identity", 
    "Host": "www.httpbin.org", 
    "User-Agent": "Python-urllib/3.10", 
    "X-Amzn-Trace-Id": "Root=1-61f13d11-6884fbc355aa3d4b5d7e85cc"
  }, 
  "origin": "223.73.1.176", 
  "url": "http://www.httpbin.org/get"
}

标签：www,httpbin,url,resp,urllib,org
来源： https://www.cnblogs.com/czzz/p/15848050.html