Commit Graph

29 Commits

Author SHA1 Message Date
hantmac
7c2e77d44e fix GetGitRemoteBranchesPlain when git url has port
Signed-off-by: hantmac <hantmac@outlook.com>
2020-10-20 14:20:51 +08:00
marvzhang
9391334152 updated template spiders 2020-08-13 10:11:34 +08:00
marvzhang
f14e7124d4 added colly example 2020-08-13 09:23:34 +08:00
marvzhang
afc9a6ca48 updated baidu_colly spider 2020-08-03 17:38:01 +08:00
marvzhang
c2e90bc6d1 added baidu colly example spider 2020-08-03 16:20:33 +08:00
Seven2Nine
a7d3ca3bd3 Update spider.py
修复“//” 开头的url报错   ValueError('Missing scheme in request url: %s' % self._url)
2020-07-08 11:06:52 +08:00
marvzhang
a0c47e54e8 updated spiders 2020-02-11 10:06:03 +08:00
marvzhang
ef991e1056 added jd_mask_spider 2020-02-10 11:15:37 +08:00
marvzhang
d4cdfda5ff fixed sinastock_spider issue 2020-02-08 09:51:11 +08:00
marvzhang
4daeab594f 版本更新 2020-02-03 16:43:29 +08:00
marvzhang
2c410feed3 changed dir 2020-02-03 11:58:05 +08:00
marvzhang
c86a6dda9c 修复可配置爬虫无法解析 "//" 打头的URL 2020-01-30 17:36:38 +08:00
marvzhang
525ffd78b7 updated version 2020-01-15 20:40:02 +08:00
marvzhang
3ef794f7a2 将可配置爬虫stages调整为列表 2019-12-13 12:55:53 +08:00
marvzhang
be4a5f6667 优先调整xpath顺序 2019-12-04 13:57:27 +08:00
marvzhang
82c6b80063 更新Spiderfile模版 2019-12-03 14:25:54 +08:00
marvzhang
589190d674 加入Spiderfile模版 2019-12-03 13:37:41 +08:00
marvzhang
9c5c0bd270 更新可配置爬虫 2019-12-03 12:14:38 +08:00
marvzhang
af680d576a 更新可配置爬虫 2019-12-02 22:35:45 +08:00
marvzhang
fd27949a40 加入依赖模块 2019-12-02 17:46:57 +08:00
marvzhang
a1bfe00eee 准备可配置爬虫自定义设置变量 2019-11-29 13:42:50 +08:00
marvzhang
e2e61c621e 加入环境变量 2019-11-25 22:07:20 +08:00
marvzhang
825962f8ba 加入储存逻辑 2019-11-25 17:52:19 +08:00
marvzhang
2e5468e4c1 refactor code 2019-11-25 16:45:55 +08:00
marvzhang
b8592b89ce 更新可配置爬虫,修复一些问题 2019-11-24 19:45:21 +08:00
marvzhang
6a07afa279 更新可配置爬虫,修复一些问题 2019-11-24 18:51:32 +08:00
marvzhang
38d103da39 加入可配置爬虫 2019-11-24 17:57:12 +08:00
marvzhang
b51fa81b79 code cleanup 2019-11-23 18:16:24 +08:00
marvzhang
b5c3ca9c32 准备可配置爬虫 2019-11-23 18:13:47 +08:00