Spirit of Service

By

I worked for and deeply admired the old Bell System. There was an obsession with quality and with service. Every employee had the feeling in their bones that their job was important, even essential.

The classic picture of a lineman up on a pole in a storm, ensuring that service was uninterrupted captures how people really felt. Switchboard operators would skip going home so that they could provide service during a storm.

The design guidelines were clearly stated: There could be no more than 1 minute of  cumulative downtime in a central office prefix during a four year period. The entire central office could have no more than three minutes of cumulative downtime over forty years. Dial tone had to be rendered 97% of the time within six seconds during the business hour of the year. During a National emergency, emergency services would be unaffected and have priority. In the event of a central office fire or flood, the central office was to be restored within 30days no matter how sever the damage.

The internet just suffered the largest outage in history for no justifiable reason. During perfectly fine weather the internet simply went down…for hours. Phone calls, emergency calls, video and commerce went down because of two cable breaks located thousands of miles apart.

This could never, ever have happened under the old Bell System. Calls would be re-routed. The network was physically diverse. 24 hour a day NOC (Network Operations Center) personnel would instruct switches to route around traffic, calls would be load balanced. The problem would be minimized.

The internet was specifically designed to self-heal. It was an explicit requirement by the Department of Defense (who oversaw the design of the internet) to make it invulnerable. No combination of cable or switch failures is supposed to allow such a massive failure as we just experienced.

But it happened. Something is seriously wrong. I don’t know what it is that is wrong, but this should scare the daylights out of everybody. Te fact that the Internet failed should be a wake-up call that something is badly amiss.

I don’t know what the issue is. Perhaps.

  • Lack of geographic diversity in fiber cables.
  • Lack of alternate rerouting in router tables?
  • Over consolidation of the industry?
  • Lack of caring about outages by management or regulators?
  • Mismanagement?

This outage should be a wake up call.

 

But I don’t think it will raise many alarms. Who really cares if the network goes down?

Comcast is going to make the same money whether it is up or down. So will cell phone companies? Those that suffer financial losses cannot sue because of service agreement prohibitions. Regulators don’t seem to care much about the public.

Who really would care? (Hint: The fact that you do care doesn’t count.)

Colin Berkshire