scray中的Request 不执行回调

时间：2022-11-21 12:02:20浏览次数：73

标签：scray attribute middleware Request offsite scrapy allowed domains 回调

在 scrapy 中，

scrapy.Request(url, headers=self.header, callback=self.parse_detail)

parse_detail 没有被调用，这可能就是被过滤掉了，查看 scrapy 的输出日志 offsite/filtered 会显示过滤的数目。这个问题如何解决呢，查看手册发现(https://doc.scrapy.org/en/latest/faq.html?highlight=offsite%2Ffiltered)这个问题，这些日志信息都是由 scrapy 中的一个 middleware 抛出的，如果没有自定义，那么这个 middleware 就是默认的 Offsite Spider Middleware，它的目的就是过滤掉那些不在 allowed_domainsOffsiteMiddleware 的部分(https://doc.scrapy.org/en/latest/topics/spider-middleware.html#scrapy.spidermiddlewares.offsite.OffsiteMiddleware)

两种方法能够使 requests 不被过滤:

1. 在 allowed_domains 中加入 url

2. 在 scrapy.Request() 函数中将参数 dont_filter=True

如下摘自手册

If the spider doesn’t define an allowed_domains attribute, or the attribute is empty, the offsite middleware will allow all requests.

If the request has the dont_filter attribute set, the offsite middleware will allow the request even if its domain is not listed in allowed domains.

在 scrapy 中，

scrapy.Request(url, headers=self.header, callback=self.parse_detail)

两种方法能够使 requests 不被过滤:

1. 在 allowed_domains 中加入 url

2. 在 scrapy.Request() 函数中将参数 dont_filter=True

如下摘自手册

If the spider doesn’t define an allowed_domains attribute, or the attribute is empty, the offsite middleware will allow all requests.

If the request has the dont_filter attribute set, the offsite middleware will allow the request even if its domain is not listed in allowed domains.

标签：scray,attribute,middleware,Request,offsite,scrapy,allowed,domains,回调
From： https://blog.51cto.com/u_15882671/5873324

回调函数
https://www.runoob.com/w3cnote/c-callback-function.html#include<stdio.h>intCallback_1();intCallback_2();intCallback_3();intHandle(int(*Callback)()......
php监听redis key失效触发回调事件
一、需求分析： 1、设置了生命时间的key，过期的时候能不能提示,能够监听过期的key？ 2、怎样用redis实现定时任务？二、应用场景：在我们程序中经常会有需要定时执行的程序，比如......
springmvc九yxf学RequestParam
源码可以看出RequestParam是用在参数上的，再看，这个注解的源码比较少。required，这是设置是否必须有这个参数。defaultValue，是可以省略的意思，就是这个参数......
TransmittableThreadLocal传递ServletRequestAttributes对象在主线程和线程池，避坑指南
关于HttpServletRequest对象在主线程和线程池传递过程的问题一，针对一般对象，解决主线程和线程池内线程对象解决方案是用阿里的插件TransmittableThreadLocal使用案例（1）将线程......
No 'Access-Control-Allow-Origin' header is present on the requested resource.
最近写前后端分离，遇到了跨域问题，很奇怪的是我已经注入了Cors跨域请求，但是每当被JWT的拦截器拦截下来返回未通过时，前端收到的总是无法加载响应数据，琢磨了好一会之后，发现......
uniapp小程序通过wx.requestSubscribeMessage实现订阅消息
wx.requestSubscribeMessage官网地址：https://developers.weixin.qq.com/miniprogram/dev/framework/open-ability/subscribe-message.html#%E8%AE%A2%E9%98%85%E6%B6%88%E......
三、@RequestMapping注解
......
SpringBoot使用ServletFileUpload上传文件时servletFileUpload.parseRequest(request)
1.问题描述1.1SpringBoot使用ServletFileUpload上传文件时List<FileItem>items=servletFileUpload.parseRequest(request)为空//获取ServletFileUploadServletF......
request(请求)和response(响应)
request:获取请求数据如网页上输入的信息response：设置响应数据如设置网页显示的信息如上图，在deGet方法中使用了request来获取页面中的name值，使用response来输出r......
.net core 获取本地ip及request请求端口
1.获取ip和端口stringstr=(Request.HttpContext.Connection.LocalIpAddress.MapToIPv4().ToString()+":"+Request.HttpContext.Connection.LocalPort); 输出s......

scray中的Request 不执行回调

相关文章

赞助商

阅读排行