一、前言

前几天在Python白银交流群【斌】问了一个Python网络爬虫的问题，提问截图如下：

报错截图如下：

粉丝需要的数据如下：

二、实现过程

有道翻译之前有做过很多，确实适合练手，主要是需要找到对应的请求。这里【dcpeng】结合粉丝的代码，然后给了一份正确的代码，如下所示：

import requests

headers = {
    "Accept": "application/json, text/javascript, */*; q=0.01",
    "Accept-Language": "zh-CN,zh;q=0.9,en;q=0.8,en-GB;q=0.7,en-US;q=0.6",
    "Connection": "keep-alive",
    "Content-Type": "application/x-www-form-urlencoded; charset=UTF-8",
    "Origin": "https://fanyi.youdao.com",
    "Referer": "https://fanyi.youdao.com/",
    "Sec-Fetch-Dest": "empty",
    "Sec-Fetch-Mode": "cors",
    "Sec-Fetch-Site": "same-origin",
    "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/104.0.5112.102 Safari/537.36 Edg/104.0.1293.70",
    "X-Requested-With": "XMLHttpRequest",
    "sec-ch-ua": "\"Chromium\";v=\"104\", \" Not A;Brand\";v=\"99\", \"Microsoft Edge\";v=\"104\"",
    "sec-ch-ua-mobile": "?0",
    "sec-ch-ua-platform": "\"Windows\""
}
cookies = {
    "OUTFOX_SEARCH_USER_ID": "[email protected]",
    "OUTFOX_SEARCH_USER_ID_NCOO": "242914410.9668874",
    "P_INFO": "pdcfighting",
    "_ga": "GA1.2.1404336446.1645147264",
    "ANTICSRF": "cleared",
    "NTES_OSESS": "cleared",
    "S_OINFO": "",
    "___rl__test__cookies": "1662539503369"
}
url = "https://fanyi.youdao.com/translate_o"
params = {
    "smartresult": "rule"
}
data = {
    "i": "dog",
    "from": "AUTO",
    "to": "AUTO",
    "smartresult": "dict",
    "client": "fanyideskweb",
    "salt": "16625395033719",
    "sign": "2a0056b7249263308d07a3fce52c065c",
    "lts": "1662539503371",
    "bv": "6f1d3ad76bcde34b6b6745e8ab9dc20a",
    "doctype": "json",
    "version": "2.1",
    "keyfrom": "fanyi.web",
    "action": "FY_BY_REALTlME"
}
response = requests.post(url, headers=headers, cookies=cookies, params=params, data=data)

print(response.json())
print(response)

运行之后，便可得到对应的结果了，如下图所示：

后来发现是构造参数少传了，难怪没获取到信息！

后来粉丝发现了最终问题所在，虽然没看懂，但是只要解决问题了就好！

三、总结

大家好，我是皮皮。这篇文章主要盘点了一个Python网络爬虫的问题，文中针对该问题，使用正则表达式匹配出想要的结果，并给出了具体的解析和代码实现，帮助粉丝顺利解决了问题。

最后感谢粉丝【斌】提问，感谢【dcpeng】、【猫药师Kelly】给出的思路和代码解析，感谢【Python狗】等人参与学习交流。

标签：cookies,粉丝,Python,爬虫,报错,fanyi
From： https://www.cnblogs.com/dcpeng/p/16726341.html

跟我学Python图像处理丨带你掌握傅里叶变换原理及实现
摘要：傅里叶变换主要是将时间域上的信号转变为频率域上的信号，用来进行图像除噪、图像增强等处理。本文分享自华为云社区《[Python图像处理]二十二.Python图像傅里叶变换原......
docker-compose up -d启动镜像报错端口被占用
Errorresponsefromdaemon:driverfailedprogrammingexternalconnectivityonendpointxxx:Bindfor0.0.0.0:9005failed:portisalreadyallocated报错显示端......
Mac安装graphviz报错Error: No such file or directory @ rb_sysopen
一、背景在学习使用golang性能分析工具proof时，安装可视化工具graphviz的时候报错Error:Nosuchfileordirectory@rb_sysopen。二、异常Error:Nosuchfileordi......
命令行传递 JSON 参数执行 Python 脚本
先定义一个简单的Python脚本greeter.py。#greeter.pyimportsysimportjsonparam=sys.argv[1]user_info=json.loads(param)print(f'Welcome,{user_info["......
python入门03
python入门day3目录昨日内容回顾§一、计算机的五大组成部分详解和三大核心硬件1、计算机的五大组成部分详解2、计算机的三大核心硬件3、操作系统OperatingSystem4、编......
Python 异步上下文管理器
1、参考来源https://docs.python.org/zh-cn/3.9/reference/datamodel.html?highlight=aiter#asynchronous-context-managers2、代码示例1#-*-coding:utf-8-*-......
python-miio库-米家直流变频落地扇1x
一、先获取tooken原链接：https://github.com/PiotrMachowski/Xiaomi-cloud-tokens-extractor1importbase642importhashlib3importhmac4importjson......
Python 异步迭代器
1、参考来源https://docs.python.org/zh-cn/3.9/reference/datamodel.html?highlight=aiter#asynchronous-iterators2、代码示例：1#-*-coding:utf-8-*-2"""......
python 9.24
classRectangle():defgetperi(self,a,b):return(a+b)*2defgetArea(self,a,b):returna*brect=Rectangle()print(rect.getperi(3,......
【Chrome插件 Chrome extension 】报错 Unchecked runtime.lastError: Could not esta
问题：【Chrome插件Chromeextension】报错Uncheckedruntime.lastError:Couldnotestablishconnection.Receivingenddoesnotexist.在看一个别人插件的时候发现......

盘点一个Python抓取有道翻译爬虫中的报错问题

一、前言

二、实现过程

三、总结

相关文章

赞助商

阅读排行