scrapydweb:实现 Scrapyd 服务器集群监控和交互, Scrapy 日志分析和可视化 - V2EX
爱意满满的作品展示区。
my8100

scrapydweb:实现 Scrapyd 服务器集群监控和交互, Scrapy 日志分析和可视化

  •  
  •   my8100
    my8100 Oct 9, 2018 3566 views
    This topic created in 2775 days ago, the information mentioned may be changed or developed.

    功能特性

    • Scrapyd 服务器集群监控和交互

      • 支持通过分组和过滤选中特定服务器节点
      • 一次点击,批量执行
    • Scrapy 日志分析

      • 统计信息展示
      • 爬虫进度可视化
      • 关键日志分类
    • 支持所有 Scrapyd API

      • Deploy project, Run Spider, Stop job
      • List projects/versions/spiders/running_jobs
      • Delete version/project

    GitHub

    scrapydweb

    欢迎 Star 和提交 Issue

    安装

    通过 pip 安装:

    $ pip install scrapydweb 

    启动

    通过命令行终端运行 "scrapydweb -h" 以查看帮助和选项. 第一次运行将在当前工作目录生成配置文件 "scrapydweb_settings.py",可用于自定义 Scrapyd 服务器列表等选项.

    $ scrapydweb 

    通过浏览器访问 http://127.0.0.1:5000

    运行界面截图

    • 集群监控和交互 overview

    • 当前节点任务列表 dashboard

    • 日志分析

      • 统计信息展示 log_stats

      • 爬虫进度可视化 log_chart

      • 关键日志分类 log_extracted

    • 部署项目 deploy

    • 运行爬虫 run

    • 管理项目 manage

    Supplement 1    Oct 25, 2018

    更新截图

    • Overview overview

    • Dashboard dashboard

    • Log Analysis

      • Stats collection log_stats

      • Progress visualization log_charts

      • Logs categorization log_logs

    • Deploy a Project deploy

    • Run a Spider run

    • Manage Projects manage

    Supplement 2    Nov 21, 2018

    V2EX 文章发布后不能随时更新,图片容易挂,移步查看最新版本

    推荐阅读:如何简单高效地部署和监控分布式爬虫项目

    9 replies    2018-11-13 17:52:17 +08:00
    achaocha
        1
    achaocha  
       Oct 9, 2018
    先 star 再说。
    my8100
        2
    my8100  
    OP
       Oct 12, 2018
    v0.9.4 Add basic auth for web UI
    my8100
        3
    my8100  
    OP
       Oct 12, 2018
    v0.9.5 Add Items page
    SpiderXiantang
        4
    SpiderXiantang  
       Oct 14, 2018
    star!
    my8100
        5
    my8100  
    OP
       Oct 16, 2018
    v0.9.6 Support accessing Scrapyd servers protected by basic auth
    my8100
        6
    my8100  
    OP
       Oct 17, 2018
    v0.9.7: Fix the bug that fails to read lastModifiedDate of egg file in Firefox and Safari
    my8100
        7
    my8100  
    OP
       Oct 20, 2018
    v0.9.8: Kill caching subprocess when main process is killed
    my8100
        8
    my8100  
    OP
       Oct 25, 2018
    v0.9.9: Add auto eggifying
    my8100
        9
    my8100  
    OP
       Nov 13, 2018
    v1.0.0rc1: Add Email Notice; Refactor codes
    About     Help     Advertise     Blog     API     FAQ     Solana     906 Online   Highest 6679       Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 106ms UTC 22:30 PVG 06:30 LAX 15:30 JFK 18:30
    Do have faith in what you're doing.
    ubao msn snddm index pchome yahoo rakuten mypaper meadowduck bidyahoo youbao zxmzxm asda bnvcg cvbfg dfscv mmhjk xxddc yybgb zznbn ccubao uaitu acv GXCV ET GDG YH FG BCVB FJFH CBRE CBC GDG ET54 WRWR RWER WREW WRWER RWER SDG EW SF DSFSF fbbs ubao fhd dfg ewr dg df ewwr ewwr et ruyut utut dfg fgd gdfgt etg dfgt dfgd ert4 gd fgg wr 235 wer3 we vsdf sdf gdf ert xcv sdf rwer hfd dfg cvb rwf afb dfh jgh bmn lgh rty gfds cxv xcv xcs vdas fdf fgd cv sdf tert sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf shasha9178 shasha9178 shasha9178 shasha9178 shasha9178 liflif2 liflif2 liflif2 liflif2 liflif2 liblib3 liblib3 liblib3 liblib3 liblib3 zhazha444 zhazha444 zhazha444 zhazha444 zhazha444 dende5 dende denden denden2 denden21 fenfen9 fenf619 fen619 fenfe9 fe619 sdf sdf sdf sdf sdf zhazh90 zhazh0 zhaa50 zha90 zh590 zho zhoz zhozh zhozho zhozho2 lislis lls95 lili95 lils5 liss9 sdf0ty987 sdft876 sdft9876 sdf09876 sd0t9876 sdf0ty98 sdf0976 sdf0ty986 sdf0ty96 sdf0t76 sdf0876 df0ty98 sf0t876 sd0ty76 sdy76 sdf76 sdf0t76 sdf0ty9 sdf0ty98 sdf0ty987 sdf0ty98 sdf6676 sdf876 sd876 sd876 sdf6 sdf6 sdf9876 sdf0t sdf06 sdf0ty9776 sdf0ty9776 sdf0ty76 sdf8876 sdf0t sd6 sdf06 s688876 sd688 sdf86