Wed Dec 28 20:05:44 EST 2005

Christmas Present from Choice One


As far as Christmas celebrations go this year was typical for us. Two fun and exhausting days where the kids get overloaded with excitement and my wife and I get less sleep that we would have on a typical day during the work week. All in all it was a good time with family and the presents were mostly practical for the adults which is what I wanted anyway.

My extra fun was on Dec 23rd just before the holiday weekend. I was thinking it was going to be a nice easy day. I was scheduled to install a new PC for someone around 10AM but instead was redirected to one of our largest customers, a multi-branch local bank. There really isn't much to the story to be told really but keep in mind that even though it's short it was highly stress inducing.

The problem specifically was that every branch was cut off from the main branch. Meaning they could not access any of their files located on the server there. Nor could their database function without the links being up 100%. The odd thing though was that I could access the Internet, ping all the servers across the VPN for all the other branches, and even PCAnywhere to them. What I couldn't do was access a SMB share or see anything other than local LAN systems in the browse list. It would have made 100% perfect sense if I couldn't ping, PCAnywhere, etc but this seemed to target just SMB related things. We had a case open with Microsoft support recently for an issue that was close enough that I could get help without started a new case so I called them up and we checked out the server. It turned out that, for some odd reason, ports 135, 139, and 445 were being blocked and it appeared that Windows wasn't the culprit this time. More detective work showed me that this was the case at all branches that used Choice One as their managed VPN provider but thing were perfectly fine at branches that didn't use Choice One. The next step was clear, and that was to call Choice One up and see what the heck was going on.

Once I slogged through the menus and got an actual person (located in the USA no less!) he didn't see anything wrong, no open tickets, no troubles reported, and no notes related to recent security related changes. Still, I had already proven that I couldn't telnet to any of the SMB related ports, and Windows looked OK so I asked him to check the ports instead of trusting the account notes. Besides, if the OS was the problem we would likely have had local problems as well and been only limited to effected servers. He opened up the ports and in a flash things were working as before.

It turns out that over the past few months Choice One had been closing off the SMB related ports for their clients on externally accessible addresses to combat the spread of viruses. It seems someone took it upon him/herself to close off these ports on the banks VPN stopping it on the internal side and then didn't make any notes in the case related to it. And in doing so managed in a few moments to bring every branch to a halt.

Really, it was such a simple problem at the root of it all, but it took on the order of 5 hours to pin point he problem, get it fixed and then tested. All the while having the manager freaking out and thinking it was some how our fault. Anyway, at least it worked out. The bank was up by the end of the day and it wasn't caused by anything we or anyone at the bank did wrong. Even so, thanks goes to Choice One for poor documentation, lack of procedure, and inducing a mountain of stress just prior to a holiday due to some individuals incompetence.

Posted by Brian | Permalink | Categories: Computers and Technology, Miscellaneous, Work | |