One risky point
Single point of failure. That's the right term for talking about the mess in San Francisco, where last week the city government finally regained control of its backbone network. Terry Childs, the net admin jailed for locking down administrative access, turned over the passwords during a secret visit from Mayor Gavin Newsom.
Childs' lawyer said Childs hadn't divulged the passwords sooner because he believed "none of the persons who requested the password information ... were qualified to have it," according to court filings.
We're starting to get solid information on the case, now that the impasse has broken (see story). But until Childs revealed the passwords, all we knew for sure was that Childs was in jail and that the network was still working but couldn't be managed.
Beyond that, it's been Rashomon in IT. Depending on who's telling the story, Childs is a brilliant network engineer who did nothing wrong. Or possibly a cyberterrorist who held the government hostage. Or maybe just an overstressed, burned-out guy who's the victim of a misunderstanding.
San Francisco's IT management? That's a bunch of tech-clueless bureaucrats. Or maybe it's a gang of goons who are out to get Childs no matter the cost. Or perhaps it's a group of conscientious public servants whose only concern was regaining control of a crucial network that might have been full of booby traps.
Childs' erstwhile co-workers? They're halfwits who couldn't manage that backbone with both hands and a map. Or innocent victims of a network guru with a God complex. Or enablers who helped create the mess by their silence.
From news reports to blog comments, the reactions have been stunning in their vehemence and variety. And there's not one yawning gulf here, but many: between techies and nontechnical managers, between gurus and regular IT grunts, between designers and administrators, between security wonks and operations guys, between practicing network experts and best-practices pundits.
It seems like suddenly we can agree on nothing. But maybe we can all recognize this:
Terry Childs was a single point of failure.
Never mind whether he's saint or sinner, villain or victim. Set that aside for now.
Focus on this: Childs was the only guy who understood that fiber backbone network. He designed it. He ran it. He maintained it. He controlled it. And nobody could replace him.
In other words, a single point of failure.
Forget whether that situation was because of cheapness, arrogance, incompetence or paranoia. The result was the same: If something happened to Childs -- a stroke, a car accident, a breakdown, a job-related "misunderstanding" -- that single point would fail.
And it did.
Look, this San Francisco fiasco has thrown a spotlight on every ugly division in the IT profession. We see it as a matter of control or expertise or responsibility or stupidity or freedom. We see it as us vs. them , and that reaches into our deepest fears and anger.
So remember this: A single point of failure is a reliability problem. That's something techies and managers, gurus and grunts can understand.
We all have at least one single point of failure lurking somewhere in our IT operations. Waiting until it generates a crisis that spirals into finger-pointing, frustration and fear is not the way to go.
There's really only one good way to deal with a single point of failure: Find it and cure it before it fails.
Frank Hayes is Computerworld 's senior news columnist. Contact him at frank_hayes@computerworld.com.
» posted by ITworld staff
Computerworld
Build your tech library with our book giveaways.
Windows PowerShell 2.0 Unleashed
By Tyson Kopczynski, Pete Handley, Marco Shaw; Published by Sams
Windows PowerShell Unleashed will not only give you deep mastery over PowerShell but also a greater understanding of the features being introduced in PowerShell 2.0–and show you how to use it to solve your challenges in your production environment. Enter now!

Ubuntu Server Administration
By Michael Jang; Published by McGraw-Hill Osborne Media
Realize a dynamic, stable, and secure Ubuntu Server environment with expert guidance, tips, and techniques from a Linux professional. Ubuntu Server Administration covers every facet of system management -- from users and file systems to performance tuning and troubleshooting. Enter now!








