• 1 Post
  • 969 Comments
Joined 3 years ago
cake
Cake day: July 2nd, 2023

help-circle
  • foggy@lemmy.worldtoHumor@lemmy.worldRule-follower
    link
    fedilink
    English
    arrow-up
    21
    arrow-down
    4
    ·
    5 days ago

    I am a massive single point of failure for my org.

    Anything bad or unexpected happens to the website that I migrated on prem for us and I am basically the only one who has any idea where to even begin.

    And that just constitutes one grafana dashboard I built, of many, that are used by basically every team now. I also stood up the grafana server.

    There was an RTO. My boss was literally like “this doesn’t apply to you.”



















  • I might be misunderstanding this concept but it seems like extra work, or a recipe for an insecure mess that could become difficult to maintain.

    I run elk stack and log basically everything which has created a centralized point for observability. This lets me granularly investigate and thereby control the state of all of my networks services.

    It’s a little ram hungry, but I’ve got some overhead.