web-scraping - 在 Nixos 上使用 systemd 计时器运行脚本

标签 web-scraping timer systemd nixos

我有一个小的 shellscript scrape.sh抓取网站并将结果数据放入新目录:

website='test.com'
dir_name="datev_data/$(date -u +'%Y-%m-%dT%H:%M:%S')"
mkdir $dirname
wget --directory-prefix="$dir_name" "$website"
(我真的不在乎数据在哪里结束,只要它每次都获得一个新目录并且我可以访问数据。因此,我现在将它放在我的主目录 /home/kaligule 中。)
手动运行这个脚本工作正常,所以现在我想在我的 nixos-server 上每小时运行一次这个脚本。因此,我将以下内容放入我的配置中(灵感来自 this post):
    systemd.services.test_systemd_timers = {
      serviceConfig.Type = "oneshot";
      script = ''
        echo "Will start scraping now."
        {pkgs.bash}/bin/bash /home/kaligule/scrape.sh
        echo "Done scraping."
      '';
    };

    systemd.timers.test_systemd_timers = {
      wantedBy = [ "timers.target" ];
      partOf = [ "test_systemd_timers.service" ];
      timerConfig.OnCalendar = [ "*-*-* *:00:00" ];
    };
然后我测试一下:
sudo nixos-rebuild switch # everything is fine
sudo systemctl start test_systemd_timers # run it immediatelly for testing
我得到:
Job for test_systemd_timers.service failed because the control process exited with error code.
See "systemctl status test_systemd_timers.service" and "journalctl -xe" for details.
第一个建议的命令给了我这个:
● test_systemd_timers.service
     Loaded: loaded (/nix/store/f8348svxpnn6qx08adrv5s7ksc2zy1sk-unit-test_systemd_timers.service/test_systemd_timers.service; linked; vendor preset: enabled)
     Active: failed (Result: exit-code) since Fri 2021-04-02 14:50:02 CEST; 2min 36s ago
TriggeredBy: ● test_systemd_timers.timer
    Process: 5686 ExecStart=/nix/store/4smyxxxlhnnmw8l6l3nnfjyvmg0wxcwh-unit-script-test_systemd_timers-start/bin/test_systemd_timers-start (code=exited, status=127)
   Main PID: 5686 (code=exited, status=127)
         IP: 0B in, 0B out
        CPU: 11ms

Apr 02 14:50:02 regulus systemd[1]: Starting test_systemd_timers.service...
Apr 02 14:50:02 regulus test_systemd_timers-start[5686]: Will start scraping now.
Apr 02 14:50:02 regulus test_systemd_timers-start[5687]: /nix/store/4smyxxxlhnnmw8l6l3nnfjyvmg0wxcwh-unit-script-test_systemd_timers-start/bin/test_systemd_timers-start: line 3: {pkgs.bash}/bin/bash: No such file or directory
Apr 02 14:50:02 regulus systemd[1]: test_systemd_timers.service: Main process exited, code=exited, status=127/n/a
Apr 02 14:50:02 regulus systemd[1]: test_systemd_timers.service: Failed with result 'exit-code'.
Apr 02 14:50:02 regulus systemd[1]: Failed to start test_systemd_timers.service.
第二个建议的命令给了我:
Apr 02 14:54:42 regulus systemd[1]: Starting test_systemd_timers.service...
░░ Subject: A start job for unit test_systemd_timers.service has begun execution
░░ Defined-By: systemd
░░ Support: https://lists.freedesktop.org/mailman/listinfo/systemd-devel
░░ 
░░ A start job for unit test_systemd_timers.service has begun execution.
░░ 
░░ The job identifier is 34454.
Apr 02 14:54:42 regulus test_systemd_timers-start[5734]: Will start scraping now.
Apr 02 14:54:42 regulus test_systemd_timers-start[5735]: /nix/store/4smyxxxlhnnmw8l6l3nnfjyvmg0wxcwh-unit-script-test_systemd_timers-start/bin/test_systemd_timers-start: line 3: {pkgs.bash}/bin/bash: No such file or directory
Apr 02 14:54:42 regulus systemd[1]: test_systemd_timers.service: Main process exited, code=exited, status=127/n/a
░░ Subject: Unit process exited
░░ Defined-By: systemd
░░ Support: https://lists.freedesktop.org/mailman/listinfo/systemd-devel
░░ 
░░ An ExecStart= process belonging to unit test_systemd_timers.service has exited.
░░ 
░░ The process' exit code is 'exited' and its exit status is 127.
Apr 02 14:54:42 regulus systemd[1]: test_systemd_timers.service: Failed with result 'exit-code'.
░░ Subject: Unit failed
░░ Defined-By: systemd
░░ Support: https://lists.freedesktop.org/mailman/listinfo/systemd-devel
░░ 
░░ The unit test_systemd_timers.service has entered the 'failed' state with result 'exit-code'.
Apr 02 14:54:42 regulus systemd[1]: Failed to start test_systemd_timers.service.
░░ Subject: A start job for unit test_systemd_timers.service has failed
░░ Defined-By: systemd
░░ Support: https://lists.freedesktop.org/mailman/listinfo/systemd-devel
░░ 
░░ A start job for unit test_systemd_timers.service has finished with a failure.
░░ 
░░ The job identifier is 34454 and the job result is failed.
Apr 02 14:54:42 regulus sudo[5731]: pam_unix(sudo:session): session closed for user root
所以因为我得到了 Will start scraping now.在日志中,我认为作业已启动但无法运行脚本。我的问题是:
  • 我应该将脚本放在哪里(以及具有哪些权限)?
  • 我应该如何添加我的配置以便脚本按照我的描述运行?

  • (当然,我很感激对我的方法的每一个反馈。)

    最佳答案

    你的问题是,在你的脚本中,{pkgs.bash}应该是 ${pkgs.bash} ;没有 $ ,你不会得到变量插值。

    关于web-scraping - 在 Nixos 上使用 systemd 计时器运行脚本,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/66919744/

    相关文章:

    python - 如何抓取多个 div(并将它们放入 csv 中)?

    c++ - 使用 Boost 进行网页抓取,返回十六进制而不是 HTML

    swift - 我无法在 Swift 3 中使用计时器更改 label.text

    logging - 使用不带日志的 systemd

    linux - 作为用户的 Systemd 服务单元文件

    python - Web抓取数据到文件,结果以逗号分隔

    python - BeautifulSoup find_all 仅返回前 50 个标签

    javascript - 与定时器异步运行

    java - 关于java定时器

    python - Systemd + 非根 Gunicorn 服务 = 不存在的子进程