TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Redundant Against What?

32 pointsby r4umabout 4 years ago

4 comments

tkinomabout 4 years ago
I have designed&#x2F;coded two redundant systems before.<p>First one is super complex redundant router done by 100+ engineers. All TCP states, BGP states, complete configurations were replicated to backup CPU. If the primary failed, power off, etc, the standby will take over, etc. When the failed unit comes backup, it become the new stanby. Demo works fine, but the company sold just 1 system for $200K. The company did sold for four hundred millions. It paid off my mortgage.<p>2nd is relatively simple done by me in 3 weeks including testing, etc. It is a system convert 48 channels digital MPEG streams to analog NTSC signals for analog cable plant such as Comcast. Xilinx power PC running linux. All configurations were sync. Auto switch over on power off, network disconnect, etc. The system can detect all failure conditions in 50 milliseconds. Very cool to demo - if you blink your eye and you will miss the system failure over event. Support in service SW upgrade - update backup to new SW, sync config over, force switch the primary and update the new backup to new SW with ZERO downtime.<p>The product was very successful - only one firmware bug found after two years in the field. Sold $100 millions+ to various cable companies. But the VC air drop a &quot;professional CEO&quot; and &quot;scale&quot; the company from 16 people to 300+ people. Did two more rounds and manage to burn the company to the ground and sold it for pennies.<p>Redundant system design are not that hard and can be simple. It is best to remember KISS - &quot;Keep It Simple Stupid&quot;.
smiley1437about 4 years ago
I was called in to troubleshoot system once, a small 2-node cluster with a SAN with no single point of failure in the fabric.<p>They had both their 1500va UPSs plugged into a single 15amp circuit.
评论 #26824471 未加载
two_handfulsabout 4 years ago
Yes, it is indeed a problem when all replicas run the same software.<p>One approach that&#x27;s been suggested is to use multiple independent implementations of the program. Or, if that&#x27;s too onerous, N-Version Programming (<a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;N-version_programming" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;N-version_programming</a>).
cyberlababout 4 years ago
I like the word <i>antifragile</i> instead of redundancy. I don&#x27;t operate a datacenter, but in my own personal computing space, I try to build antifragile systems that can bounce back from failure rather rapidly. It is astonishing for example, just how many times I have to roll back a Windows 10 install because it likes to randomly break for whatever reason. It could be anything: a faulty update that corrupts the whole OS. The hard-drive gets filled up really quickly when doing gaming video captures, or games that do really heavy writes to the SSD and wearing it out within a year of use.<p>Currently I use virtual machines to mitigate this, and you need a beefy setup to do this. If a virtual machines fails, I have a &#x27;template&#x27; virtual disk image where I can start afresh. One caveat to VMs is you can&#x27;t do gaming since you&#x27;re emulating an OS, so when I can I use a bare metal solution for gaming. You can do basic gaming for games that use low resources (like Minecraft). But forget about playing Crysis or other monsters in a VM!
评论 #26824398 未加载
评论 #26824409 未加载
评论 #26824313 未加载
评论 #26824376 未加载