docs(runbook): monitoring update — TCP-25 rationale + blacklist monitors
ci / tc_portal (push) Has been skipped
ci / tc_operator (push) Has been skipped
ci / build_operator (push) Has been skipped
ci / changes (push) Successful in 3s
ci / tc_booking (push) Has been skipped
ci / tc_website (push) Has been skipped
ci / tc_platform_api (push) Has been skipped
ci / test_platform_api (push) Has been skipped
ci / build_portal (push) Has been skipped
ci / build_booking (push) Has been skipped
ci / build_platform_api (push) Has been skipped
ci / deploy (push) Has been skipped
ci / tc_portal (push) Has been skipped
ci / tc_operator (push) Has been skipped
ci / build_operator (push) Has been skipped
ci / changes (push) Successful in 3s
ci / tc_booking (push) Has been skipped
ci / tc_website (push) Has been skipped
ci / tc_platform_api (push) Has been skipped
ci / test_platform_api (push) Has been skipped
ci / build_portal (push) Has been skipped
ci / build_booking (push) Has been skipped
ci / build_platform_api (push) Has been skipped
ci / deploy (push) Has been skipped
This commit is contained in:
@@ -39,12 +39,15 @@ bottom to rebuild it. Per-layer detail lives in `host/README.md`,
|
|||||||
type): keep newest 5 versions per image + `latest`, remove older than 7
|
type): keep newest 5 versions per image + `latest`, remove older than 7
|
||||||
days. Applied by Gitea's daily cleanup cron.
|
days. Applied by Gitea's daily cleanup cron.
|
||||||
- **Monitoring** — HetrixTools (Ronni's account): 11 uptime monitors via API
|
- **Monitoring** — HetrixTools (Ronni's account): 11 uptime monitors via API
|
||||||
(HTTPS on the five apps + Gitea w/ SSL verify, ping, IMAPS/SMTPS TCP, SMTP
|
(HTTPS on the five apps + Gitea w/ SSL verify, ping, IMAPS/SMTPS/port-25
|
||||||
protocol on :25; 1-min checks from ams/fra/lon, alert after 2 fails) + the
|
TCP — port 25 is a TCP check ON PURPOSE: Stalwart's DNSBL screening
|
||||||
Linux server agent on node1 (root mode, per-minute cron in
|
rejects HetrixTools' probe IPs, so an SMTP-protocol check reads down while
|
||||||
/etc/hetrixtools/; watches stalwart/k3s/dockerd processes, mdadm RAID,
|
real MTAs are fine; 1-min checks from ams/fra/lon, alert after 2 fails),
|
||||||
NVMe SMART via smartmontools). Re-create monitors via their v2 API
|
blacklist monitors on dezky.eu + 46.4.78.187, and the Linux server agent
|
||||||
(uptime/add, Type 9 = server agent — hidden in the new UI); agent install:
|
on node1 (root mode, per-minute cron in /etc/hetrixtools/; watches
|
||||||
|
stalwart/k3s/dockerd processes, mdadm RAID, NVMe SMART via smartmontools).
|
||||||
|
Re-create monitors via their v2 API (uptime/add, Type 9 = server agent —
|
||||||
|
hidden in the new UI); agent install:
|
||||||
hetrixtools_install.sh <server_id from API response> 1 "stalwart,k3s,dockerd" 1 1.
|
hetrixtools_install.sh <server_id from API response> 1 "stalwart,k3s,dockerd" 1 1.
|
||||||
|
|
||||||
## Deploy flow (day-to-day)
|
## Deploy flow (day-to-day)
|
||||||
|
|||||||
Reference in New Issue
Block a user