Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Darkly)
  • No Skin
Collapse
Brand Logo
  1. Home
  2. Uncategorized
  3. A few days ago, a client’s data center (well, actually a server room) "vanished" overnight.

A few days ago, a client’s data center (well, actually a server room) "vanished" overnight.

Scheduled Pinned Locked Moved Uncategorized
sysadminhorrorstoriesithorrorstoriesmonitoring
176 Posts 77 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • Stefano MarinelliS This user is from outside of this forum
    Stefano MarinelliS This user is from outside of this forum
    Stefano Marinelli
    wrote last edited by
    #1

    A few days ago, a client’s data center (well, actually a server room) "vanished" overnight. My monitoring showed that all devices were unreachable. Not even the ISP routers responded, so I assumed a sudden connectivity drop. The strange part? Not even via 4G.

    I then suspected a power failure, but the UPS should have sent an alert.

    The office was closed for the holidays, but I contacted the IT manager anyway. He was home sick with a serious family issue, but he got moving.

    To make a long story short: the company deals in gold and precious metals. They have an underground bunker with two-meter thick walls. They were targeted by a professional gang. They used a tactic seen in similar hits: they identify the main power line, tamper with it at night, and send a massive voltage spike through it.

    The goal is to fry all alarm and surveillance systems. Even if battery-backed, they rarely survive a surge like that. Thieves count on the fact that during holidays, owners are away and fried systems can't send alerts. Monitoring companies often have reduced staff and might not notice the "silence" immediately.

    That is exactly what happened here. But there is a "but": they didn't account for my Uptime Kuma instance monitoring their MikroTik router, installed just weeks ago. Since it is an external check, it flagged the lack of response from all IPs without needing an internal alert to be triggered from the inside.

    The team rushed to the site and found the mess. Luckily, they found an emergency electrical crew to bypass the damage and restore the cameras and alarms. They swapped the fried server UPS with a spare and everything came back up.

    The police warned that the chances of the crew returning the next night to "finish" the job were high, though seeing the systems back online would likely make them move on. They also warned that thieves sometimes break in just to destroy servers to wipe any video evidence.

    Nothing happened in the end. But in the meantime, I had to sync all their data off-site (thankfully they have dual 1Gbps FTTH), set up an emergency cluster, and ensure everything was redundant.

    Never rely only on internal monitoring. Never.

    #IT #SysAdmin #HorrorStories #ITHorrorStories #Monitoring

    advokattK mkjM Elena ``of Valhalla''V Wulfy—Speaker to the machinesN KevA 49 Replies Last reply
    1
    0
    • Stefano MarinelliS Stefano Marinelli

      A few days ago, a client’s data center (well, actually a server room) "vanished" overnight. My monitoring showed that all devices were unreachable. Not even the ISP routers responded, so I assumed a sudden connectivity drop. The strange part? Not even via 4G.

      I then suspected a power failure, but the UPS should have sent an alert.

      The office was closed for the holidays, but I contacted the IT manager anyway. He was home sick with a serious family issue, but he got moving.

      To make a long story short: the company deals in gold and precious metals. They have an underground bunker with two-meter thick walls. They were targeted by a professional gang. They used a tactic seen in similar hits: they identify the main power line, tamper with it at night, and send a massive voltage spike through it.

      The goal is to fry all alarm and surveillance systems. Even if battery-backed, they rarely survive a surge like that. Thieves count on the fact that during holidays, owners are away and fried systems can't send alerts. Monitoring companies often have reduced staff and might not notice the "silence" immediately.

      That is exactly what happened here. But there is a "but": they didn't account for my Uptime Kuma instance monitoring their MikroTik router, installed just weeks ago. Since it is an external check, it flagged the lack of response from all IPs without needing an internal alert to be triggered from the inside.

      The team rushed to the site and found the mess. Luckily, they found an emergency electrical crew to bypass the damage and restore the cameras and alarms. They swapped the fried server UPS with a spare and everything came back up.

      The police warned that the chances of the crew returning the next night to "finish" the job were high, though seeing the systems back online would likely make them move on. They also warned that thieves sometimes break in just to destroy servers to wipe any video evidence.

      Nothing happened in the end. But in the meantime, I had to sync all their data off-site (thankfully they have dual 1Gbps FTTH), set up an emergency cluster, and ensure everything was redundant.

      Never rely only on internal monitoring. Never.

      #IT #SysAdmin #HorrorStories #ITHorrorStories #Monitoring

      advokattK This user is from outside of this forum
      advokattK This user is from outside of this forum
      advokatt
      wrote last edited by
      #2

      @stefano nice story! and, yeah, internal monitoring is a must, but you also need an external one, operated by someone else than yourself.

      1 Reply Last reply
      0
      • James SewardJ This user is from outside of this forum
        James SewardJ This user is from outside of this forum
        James Seward
        wrote last edited by
        #3

        @rhoot @stefano I have my cronjob scripts touch a file as their final action and my monitoring stuff alarms if the file is too old

        Rihards OlupsR randomizedR 2 Replies Last reply
        0
        • James SewardJ This user is from outside of this forum
          James SewardJ This user is from outside of this forum
          James Seward
          wrote last edited by
          #4

          @rhoot @stefano the central monitor instance knows which remote ones should be checking in and alarms if any of them don't for too long, and finally the status page monitors its own age and adds a warning if it's out of date.

          Beyond that, nothing 😉

          1 Reply Last reply
          0
          • Stefano MarinelliS Stefano Marinelli

            A few days ago, a client’s data center (well, actually a server room) "vanished" overnight. My monitoring showed that all devices were unreachable. Not even the ISP routers responded, so I assumed a sudden connectivity drop. The strange part? Not even via 4G.

            I then suspected a power failure, but the UPS should have sent an alert.

            The office was closed for the holidays, but I contacted the IT manager anyway. He was home sick with a serious family issue, but he got moving.

            To make a long story short: the company deals in gold and precious metals. They have an underground bunker with two-meter thick walls. They were targeted by a professional gang. They used a tactic seen in similar hits: they identify the main power line, tamper with it at night, and send a massive voltage spike through it.

            The goal is to fry all alarm and surveillance systems. Even if battery-backed, they rarely survive a surge like that. Thieves count on the fact that during holidays, owners are away and fried systems can't send alerts. Monitoring companies often have reduced staff and might not notice the "silence" immediately.

            That is exactly what happened here. But there is a "but": they didn't account for my Uptime Kuma instance monitoring their MikroTik router, installed just weeks ago. Since it is an external check, it flagged the lack of response from all IPs without needing an internal alert to be triggered from the inside.

            The team rushed to the site and found the mess. Luckily, they found an emergency electrical crew to bypass the damage and restore the cameras and alarms. They swapped the fried server UPS with a spare and everything came back up.

            The police warned that the chances of the crew returning the next night to "finish" the job were high, though seeing the systems back online would likely make them move on. They also warned that thieves sometimes break in just to destroy servers to wipe any video evidence.

            Nothing happened in the end. But in the meantime, I had to sync all their data off-site (thankfully they have dual 1Gbps FTTH), set up an emergency cluster, and ensure everything was redundant.

            Never rely only on internal monitoring. Never.

            #IT #SysAdmin #HorrorStories #ITHorrorStories #Monitoring

            mkjM This user is from outside of this forum
            mkjM This user is from outside of this forum
            mkj
            wrote last edited by
            #5

            @stefano Sounds like a case of either good design or *very* good luck too that the UPS took the brunt of it.

            We can't protect against everything, but we *can* have an idea for what to do when the unimagined happens.

            1 Reply Last reply
            0
            • mkjM This user is from outside of this forum
              mkjM This user is from outside of this forum
              mkj
              wrote last edited by
              #6

              @stefano @ricardo So in either case, layers of redundancy saved the day.

              1 Reply Last reply
              0
              • mkjM This user is from outside of this forum
                mkjM This user is from outside of this forum
                mkj
                wrote last edited by
                #7

                @stefano Please do make it a blog post!

                @toxy

                1 Reply Last reply
                0
                • DeManiak 🇿🇦K This user is from outside of this forum
                  DeManiak 🇿🇦K This user is from outside of this forum
                  DeManiak 🇿🇦
                  wrote last edited by
                  #8

                  @marios @EnigmaRotor @stefano can recommend Uptime Kuma.

                  Just consider carefully the number of historic records you need to keep - older versions had issues (db corruption) when history got large.
                  Current version I believe addressed this,and now supports mariaDB (external and embedded).

                  1 Reply Last reply
                  0
                  • Ricardo Martín :bsdhead:R This user is from outside of this forum
                    Ricardo Martín :bsdhead:R This user is from outside of this forum
                    Ricardo Martín :bsdhead:
                    wrote last edited by
                    #9

                    #NoirThriller
                    @stefano @toxy

                    1 Reply Last reply
                    0
                    • Stefano MarinelliS This user is from outside of this forum
                      Stefano MarinelliS This user is from outside of this forum
                      Stefano Marinelli
                      wrote last edited by
                      #10

                      @marios @EnigmaRotor consider this: https://it-notes.dragas.net/2024/07/22/install-uptime-kuma-freebsd-jail/

                      Marios EfstathiouM 1 Reply Last reply
                      0
                      • Stefano MarinelliS Stefano Marinelli

                        @marios @EnigmaRotor consider this: https://it-notes.dragas.net/2024/07/22/install-uptime-kuma-freebsd-jail/

                        Marios EfstathiouM This user is from outside of this forum
                        Marios EfstathiouM This user is from outside of this forum
                        Marios Efstathiou
                        wrote last edited by
                        #11

                        @stefano

                        You were reading my mind

                        1 Reply Last reply
                        0
                        • Utarg of Utarg 🔬🇪🇺🇸🇪🇬🇧🇺🇦T This user is from outside of this forum
                          Utarg of Utarg 🔬🇪🇺🇸🇪🇬🇧🇺🇦T This user is from outside of this forum
                          Utarg of Utarg 🔬🇪🇺🇸🇪🇬🇧🇺🇦
                          wrote last edited by
                          #12

                          @stefano Featuring Hans Gruber?

                          Stefano MarinelliS 1 Reply Last reply
                          0
                          • Utarg of Utarg 🔬🇪🇺🇸🇪🇬🇧🇺🇦T Utarg of Utarg 🔬🇪🇺🇸🇪🇬🇧🇺🇦

                            @stefano Featuring Hans Gruber?

                            Stefano MarinelliS This user is from outside of this forum
                            Stefano MarinelliS This user is from outside of this forum
                            Stefano Marinelli
                            wrote last edited by
                            #13

                            @toxy featuring me 😆

                            1 Reply Last reply
                            0
                            • Stefano MarinelliS Stefano Marinelli

                              A few days ago, a client’s data center (well, actually a server room) "vanished" overnight. My monitoring showed that all devices were unreachable. Not even the ISP routers responded, so I assumed a sudden connectivity drop. The strange part? Not even via 4G.

                              I then suspected a power failure, but the UPS should have sent an alert.

                              The office was closed for the holidays, but I contacted the IT manager anyway. He was home sick with a serious family issue, but he got moving.

                              To make a long story short: the company deals in gold and precious metals. They have an underground bunker with two-meter thick walls. They were targeted by a professional gang. They used a tactic seen in similar hits: they identify the main power line, tamper with it at night, and send a massive voltage spike through it.

                              The goal is to fry all alarm and surveillance systems. Even if battery-backed, they rarely survive a surge like that. Thieves count on the fact that during holidays, owners are away and fried systems can't send alerts. Monitoring companies often have reduced staff and might not notice the "silence" immediately.

                              That is exactly what happened here. But there is a "but": they didn't account for my Uptime Kuma instance monitoring their MikroTik router, installed just weeks ago. Since it is an external check, it flagged the lack of response from all IPs without needing an internal alert to be triggered from the inside.

                              The team rushed to the site and found the mess. Luckily, they found an emergency electrical crew to bypass the damage and restore the cameras and alarms. They swapped the fried server UPS with a spare and everything came back up.

                              The police warned that the chances of the crew returning the next night to "finish" the job were high, though seeing the systems back online would likely make them move on. They also warned that thieves sometimes break in just to destroy servers to wipe any video evidence.

                              Nothing happened in the end. But in the meantime, I had to sync all their data off-site (thankfully they have dual 1Gbps FTTH), set up an emergency cluster, and ensure everything was redundant.

                              Never rely only on internal monitoring. Never.

                              #IT #SysAdmin #HorrorStories #ITHorrorStories #Monitoring

                              Elena ``of Valhalla''V This user is from outside of this forum
                              Elena ``of Valhalla''V This user is from outside of this forum
                              Elena ``of Valhalla''
                              wrote last edited by
                              #14
                              @stefano feeling of :xkcd:`705` intensifies 😄
                              Stefano MarinelliS 1 Reply Last reply
                              0
                              • Elena ``of Valhalla''V Elena ``of Valhalla''
                                @stefano feeling of :xkcd:`705` intensifies 😄
                                Stefano MarinelliS This user is from outside of this forum
                                Stefano MarinelliS This user is from outside of this forum
                                Stefano Marinelli
                                wrote last edited by
                                #15

                                @valhalla totally!

                                Luca Sironi (fantasma edition)L 1 Reply Last reply
                                0
                                • Stefano MarinelliS Stefano Marinelli

                                  @valhalla totally!

                                  Luca Sironi (fantasma edition)L This user is from outside of this forum
                                  Luca Sironi (fantasma edition)L This user is from outside of this forum
                                  Luca Sironi (fantasma edition)
                                  wrote last edited by
                                  #16

                                  @stefano @valhalla shit, we have to deal with a bsd guy 😈

                                  Stefano MarinelliS 1 Reply Last reply
                                  0
                                  • James SewardJ James Seward

                                    @rhoot @stefano I have my cronjob scripts touch a file as their final action and my monitoring stuff alarms if the file is too old

                                    Rihards OlupsR This user is from outside of this forum
                                    Rihards OlupsR This user is from outside of this forum
                                    Rihards Olups
                                    wrote last edited by
                                    #17

                                    @jamesoff @rhoot @stefano When I managed such things in the past, I had the backup script use zabbix_sender to send a value to Zabbix and then alert if that is missing, like you just said.

                                    But after one incident I also added monitoring of backup size and alerting if it changes by > 10% from the previous.

                                    If backup starts getting failed DB dumps, it's good to know early that "hey, backups just dropped in size by 90%" 🙂

                                    Also, if a backup suddenly grows a lot, something's weird.

                                    James SewardJ 1 Reply Last reply
                                    0
                                    • Rihards OlupsR Rihards Olups

                                      @jamesoff @rhoot @stefano When I managed such things in the past, I had the backup script use zabbix_sender to send a value to Zabbix and then alert if that is missing, like you just said.

                                      But after one incident I also added monitoring of backup size and alerting if it changes by > 10% from the previous.

                                      If backup starts getting failed DB dumps, it's good to know early that "hey, backups just dropped in size by 90%" 🙂

                                      Also, if a backup suddenly grows a lot, something's weird.

                                      James SewardJ This user is from outside of this forum
                                      James SewardJ This user is from outside of this forum
                                      James Seward
                                      wrote last edited by
                                      #18

                                      @richlv @rhoot @stefano I also do this 🙂

                                      (https://simplemonitor.readthedocs.io/en/latest/monitors/filestat.html)

                                      1 Reply Last reply
                                      0
                                      • Luca Sironi (fantasma edition)L Luca Sironi (fantasma edition)

                                        @stefano @valhalla shit, we have to deal with a bsd guy 😈

                                        Stefano MarinelliS This user is from outside of this forum
                                        Stefano MarinelliS This user is from outside of this forum
                                        Stefano Marinelli
                                        wrote last edited by
                                        #19

                                        @luca @valhalla those are terrible! 😆

                                        1 Reply Last reply
                                        0
                                        • Stefano MarinelliS Stefano Marinelli

                                          A few days ago, a client’s data center (well, actually a server room) "vanished" overnight. My monitoring showed that all devices were unreachable. Not even the ISP routers responded, so I assumed a sudden connectivity drop. The strange part? Not even via 4G.

                                          I then suspected a power failure, but the UPS should have sent an alert.

                                          The office was closed for the holidays, but I contacted the IT manager anyway. He was home sick with a serious family issue, but he got moving.

                                          To make a long story short: the company deals in gold and precious metals. They have an underground bunker with two-meter thick walls. They were targeted by a professional gang. They used a tactic seen in similar hits: they identify the main power line, tamper with it at night, and send a massive voltage spike through it.

                                          The goal is to fry all alarm and surveillance systems. Even if battery-backed, they rarely survive a surge like that. Thieves count on the fact that during holidays, owners are away and fried systems can't send alerts. Monitoring companies often have reduced staff and might not notice the "silence" immediately.

                                          That is exactly what happened here. But there is a "but": they didn't account for my Uptime Kuma instance monitoring their MikroTik router, installed just weeks ago. Since it is an external check, it flagged the lack of response from all IPs without needing an internal alert to be triggered from the inside.

                                          The team rushed to the site and found the mess. Luckily, they found an emergency electrical crew to bypass the damage and restore the cameras and alarms. They swapped the fried server UPS with a spare and everything came back up.

                                          The police warned that the chances of the crew returning the next night to "finish" the job were high, though seeing the systems back online would likely make them move on. They also warned that thieves sometimes break in just to destroy servers to wipe any video evidence.

                                          Nothing happened in the end. But in the meantime, I had to sync all their data off-site (thankfully they have dual 1Gbps FTTH), set up an emergency cluster, and ensure everything was redundant.

                                          Never rely only on internal monitoring. Never.

                                          #IT #SysAdmin #HorrorStories #ITHorrorStories #Monitoring

                                          Wulfy—Speaker to the machinesN This user is from outside of this forum
                                          Wulfy—Speaker to the machinesN This user is from outside of this forum
                                          Wulfy—Speaker to the machines
                                          wrote last edited by
                                          #20

                                          @stefano

                                          You are the hero I aspire to be!

                                          Stefano MarinelliS 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups