User Tools

Site Tools


tamiwiki:internal:networks:tami_sre

This is an old revision of the document!


Tame Site Reliability Engineering

This page detials our efforts to keep the systems online reliable


Currently, we have a Raspberry pi in the space that runs a realtime Status webpage from UptimeRobot The Status page of our Services is here

In case the network is not functioning, you should reach out to someone from the contact page.

Troubleshooting steps

The first step is to identify the nature of the root cause and whether it is related to the network or the infrastructure. Use the [https://stats.uptimerobot.com/Jx4JQiBDEZ|UptimeRobot site]] to see what could be up or down.

  • If everything is down, is it definetly a Network issue but maybe also infra
  • If the IP address of Tami is reachable (Ping and Telnet), but the yunohost services are down, its likely just an infra issue

Network

Relevant Link: Physical Infra

Infra

Relevant Link: Physical Infra

If there is an issue with a single service

  • The first step is to see if you can log into yunohost admin panel
  • Then check the service at Tools > Services
  • Review the logs, restart the service if necessary and maybe share logs with yunopast into a relevant group in tamis communication channel

If there is an issue with a multiple services

  • Attempt the steps above for each services but if its all services, it might be something related to yunohost or the device it is running on
  • Try to ssh into yunohost. The password is your yunohost SSO password
    • ssh <yunohost username>, telavivmakers.space
tamiwiki/internal/networks/tami_sre.1677967209.txt.gz ยท Last modified: 2023/03/05 00:00 by 444b