Three days with Fluctuating Internet

We had three days with totally fluctuating internet. The fluctuation was almost like a sine wave. Nobody could really figure out what went wrong and where the problem was.

NOTE : This post is not just another ‘masalla’ post. I am writing down the actual experience I had.

DAY 1 : August 26

All this started on August 26th sometime in the early morning hours when browsing speeds and the bandwidth usage touched the lowest levels in the last month. As I keep monitoring the bandwidth usage (bandwidth monitoring and download progress bars appeal me somehow for certain unknown reasons. I keep looking at progress bars when I download something. I just get lostΒ  in kind of dreamworld while looking at them.), I was surprised to see the low usage because everyone was in the campus and usage should touch the peak levels. It returned to normal after a short period of time and browsing was normal. But this pattern kept repeating itself. I went to attend the class. I returned at 11:30AM and rushed to server room to checkout whats going on. By that time server room was swamped by the phone calls from different research centers.

Nobody was actually able to figure out what was going on. All that we knew was that there was heavy broadcast from a segment on the network. We suspected it as the same problem which we faced last week. But isolating the problematic area is heck of a job and nobody was ready to check the network devices at the leaf level because of following reasons (1) It’ll take almost a day to check individual NIC in all the labs, (2) There is no security that problem will be resolved.

We took a tough decision of shutting down the network in entire problematic segment. This worked and network was fine. No fluctuations. But it proved out to be a wrong decision. We didn’t inform the people in the affected network (which unfortunately consisted of major research centers at IIIT i.e. CVIT, CDE, CVEST, LTRC (temp) etc.) and immediately we had to face the phone calls from HODs. One thing that I learnt from this situation is that Internet connectivity is equally important for everyone at IIIT including faculty members. Though we keep blaming students for being addicted to internet. Internet here is not an addiction, its a need. We had to re-up the network. And the rest of the network started fluctuating again. Everybody left for lunch.

As the time passed, the frustration among the users grew and everybody was almost shouting. Everybody wanted to know why its was taking so long to solve this problem. After lunch one of the admins went to the problematic area and started debugging at the individual switch level. But he faced a real tough time as most of the switches at leaf level are unmanageable (you can’t see any error reports unless you plug into individual switch). And we have a lot of switches (by a lot I mean a real lot of switches). And the switches are cascaded in such a dangerous manner that isolating a problem becomes way difficult. By evening that day we could isolate two research labs and three other segments which were generating heavy broadcast. We shut them off and everybody left for the day. There was a kind of blackout in those segments. No internet, no LAN.

During the night, I kept monitoring the network. A lot of people pinged and complained about the DNS resolution problem. Web pages were loading at a high speed but the name resolution was taking a lot of time. I tried looking at the logs and the traffic. Everything was fine except that the nameserver was swamped by the mail servers for name resolution. I tried a few hacks but nothing worked.

DAY 2 : August 27

I didn’t have any class that day. Admin XYZ called me at around 10:30AM and requested to come to server room if possible. I was sleeping and I hardly wake up at that time. But I didn’t want to miss the opportunity. Got up quickly and rushed to server room wasting as least time as possible. I was in server room at 11:00AM.

Admins suspected some problem with proxy as the fluctuation persisted even after cutting off the problematic areas. By the time I reached server, admins switched over to the stand by proxy machine. And to get started from Zero, entire network except the main building was shut down. We waited for almost half an hour. Everything worked absolutely fine. No fluctuations at all. So, main building is fine.

At around 11:40AM, network was restored in all the hostels. We waited for another half an hour. No fluctuation yet. But hell lot of phone calls sensitizing the situation. Everybody including seniors members rushing to server room. We suspected some attacks from hostels on the server in labs. But we were wrong. The problem is in the library building. But where?

Till lunch time, no network in areas except main building and hostels. As the time passed, the issue became more and more serious. It became difficult to answer phone calls from senior members as the word “Heavy Broadcast” now became irritating for them. They were listening to this since last two days.Β  But nobody actually knew the exact answer. The origin of the broadcast was still not known.

Admin XYZ rushed to the library switch. Now XYZ was in live contact with admin PQR in server room and restoring the network in research centers one by one. Restore network in one research center, wait for half an hour. If no fluctuation, proceed otherwise revert back. Using this technique (this was the only solution), we restored network in all the centers except two. Connections to these centers also cascade to other areas. Complete outage in the two research centers. Everybody left for the day, leaving the two research centers in dark.

Network stabilized a bit. And fluctuation was not frequent (almost none). I monitored the network up to 2AM. Didn’t sleep because had a class at 8:30AM.

DAY 3 : August 28

I had a class up to 10AM. Rushed directly to server room after the class. We already narrowed down to a smaller region. Now the problem was smaller and there were lesser number of people after us. Admin ABC with a student was sent down to inspect individual switches. Thats the problem with unmanageable switches. You have to go and check each and every switch for any error messages. Anyway we kept narrowing down the problematic area till lunch. I left for lunch and returned to my room as I didn’t sleep during previous night. I don’t know what happened in the afternoon. I missed that πŸ™ At 6:30PM, I called admin XYZ and asked about the status. He informed that the problem has been isolated. Only two very small labs were left.

Three days and problem was still there. People were really out of control. Anyway network worked perfectly in other areas except those two labs. The good thing was that these labs were at the leaf level and they were not cascading connections further.

DAY 4 : August 29

I had a lab from 10AM-11AM. But it went up to 11:45AM. By the time, I reached server room, the problem was already resolved. Everyone was connected and no more complaints. Rawat sir updated me with a few decisions which are beyond the scope of this post. The problem was the routing queries from one of the ISPs connected to those labs at leaf level.

It really took almost four days to debug this problem. Debugging a network, especially debugging a network which is randomly cascaded, has more than one entry points, has no perimeter and has a lot of unmanageable switch is a real challenge.

Anyways it was again a learning experience for me. I used to blame people for not able to solve the network problems quickly. I just realized that its very easy to blame.

PS : Longest post on the occasion of bloggers’ day πŸ™‚

 

Bangalore Trip

I got a chance to visit Bangalore last month. Actually Google announced a small meetup for all the GSoC students from India at its Bangalore office on July 28th. As I was returning to IIIT on July 25th, I thought of joining others in Bangalore on my way back. Canceled my reservation (Delhi -> Hyderabad) and re-booked a ticket for Bangalore and back to Hyderabad.

We (Me, Sumit Kataria, Gurpartap Singh) boarded the train from Delhi on July 26th. I knew the other guyz only via IRC but never met them before. They are GSoC students working for Drupal. The journey was fine but a bit of bad luck that Rajdhani was late by two hours. We reached Google office late (by 1 hour). But we didn’t really miss anything because there was a presentation about what Google is and why it is THE GOOGLE. As we didn’t have any breakfast in hurry, we had some snacks at Google. After that there was another presentation by one of the project managers (i forgot the name) about the working style and team structure at Google and how they manage to pull huge projects with small teams. He talked about how projects evolve at Google and some blah blah which I don’t remember.

After than one hour presentation, another break. Time forΒ lunch πŸ˜€ As obvious food at Google was just awesome. Then started the real session where students were asked to give a brief overview of the projects they were working on. Few projects were real tough and few others were really creative ideas. I too gave an overview of my project. Google Open Source T-shirts and GSoC stickers were distributed. Then the host took us around the technical wings of Google Bangalore. It was an amazing experience. I was blown by the hardware I saw. The hugeΒ LCD monitors in dual display configuration were eye catching.Β  We left the Google campus at around 5PM.

We headed over to MG road, one of the most famous places in Bangalore. We had real good time over there. After having dinner in McD we returned home. The next day we planned to visit Iskon temple. But it was a complete FAIL as the temple was closed. We got lost and couldn’t find a place to have lunch. After a lot of efforts, we landed at the central bus terminal. Had dossa there and left for home. Traffic is just awesome in Bangalore. It took us around 2.5 hours to get back. The journey was frustrating and we were tired enough to get anywhere. Yet we left for the MG Road again. We enjoyed a lot. Did some shopping too.

The next day I headed back to Hyderabad. All in all Bangalore is way techie as compared to Hyderabad. Hyderabad is still in its nascent stages of development. Only negative factor I saw in Bangalore is terrific traffic.

kulbir saini @ google bangalore

Me presenting my project @ Google Bangalore

Gurpartap @ Google Bangalore

Gurpartap presenting his project @ Google Bangalore

Macbook Air @ iStore

MacBookAir @ iStore

Strange buses found only in bangalore

Strange bus I saw for the very first time πŸ˜€

 

Freaking 50 hours

All this started after returning from dinner at around 8:00PM on August 19.

August 19 – 08:30PM – 10:30PM : Slept.

I woke up at 10:30PM and started browsing random stuff. Gave a finishing touch to GSoC project – IntelligentMirror and announced the release for testing.

August 19 – 10:30PM – 12:30PM : Browsing + Blogging + Browsing.

After that its time for some refreshments.

August 20 – 12:30PM – 02:00AM : Snacks + Toasts.

Then started the same old thing. Browsing. Why the heck we have to browse all the stuff in this world which has no meaning 😐 Also, I had to prepare for my presentation about SMTP protocol in Topics of Information Security class. So, downloaded all the RFCs (thats what we do all the time. download all the academic stuff and feel good about it. Who has time to open and read them πŸ˜› ). Read a bit of stuff from the RFCs.

August 20 – 02:00AM – 07:00AM : Browsing + Reading RFCs + Browsing.

If you are up till 7AM, breakfast is worth a try. Had a breakfast which was good eventually.

August 20 – 07:30AM – 08:00AM : Breakfast.

After that, I though I would go to bed and will sleep for sometime. But then Internet is a real devil which will not let you sleep. Started browsing again and checking everyone’s status message. Reading blogs and some more blah blah. For god sake, don’t put links like this in your status messages.

August 20 – 08:30AM – 12:30PM : Browsing + Fantastic Contraption + Browsing

If you are up till 12:30PM, its good to have lunch before you go to bed πŸ˜€

August 20 – 12:30PM – 01:30PM : Lunch.

After lunch, I was desperate to sleep. But then I had this class called “Music Appreciation” at 5PM. I was afraid of losing an attendance and didn’t sleep. I read RFCs in the meantime. No browsing this time πŸ˜€

August 20 – 02:00PM – 04:30PM : Reading RFCs. No bc.

At around 4:45PM we rushed to coffee shop and then to class. After reaching the class, we realized that prof will not turn up and class stands canceled.

August 20 – 04:30PM – 05:20PM : Coffee shop + Class.

While returning from class, I just wanted to visit server room to see whats going on (actually the network was fluctuating really badly and just wanted to know what exactly is going on. I am a student sysadmin, so kinda concerned about these issues). But I came to know, some serious problem has occurred due to real heavy broadcast from few research labs. Sysadmins were zeroing on the problem. I suddenly forgot about sleeping and all. There may not be a better opportunity to learn how to configure a switch and how to debug a network problem. I was lost somewhere in the switches and servers and I realized it was 7:00 by the time admins sorted out the problem.

August 20 – 05:30PM – 07:00PM : Server room (real good experience. Learnt a bit about how to configure switches πŸ˜€ )

By the time, I left server room it was almost 7:30PM. So just rushed to Yuktahaar to have food. Had some good food. Yuktahaar is probably the only mess in the campus where you can actually eat something.

August 20 – 07:30PM – 08:00PM : Dinner.

As I had to give the presentation on SMTP the next morning, thought of reading some more stuff quickly. Read up to 9:00PM. At this time, I was *REAL* desperate to go to bed. But at about 9:19PM, Himank called up and told about bloggers meet. I was like WTF. Well I wanted to attend the meeting and rushed to main building. Meeting went upto 10:30PM. The meeting was real fun and it was a good experience to meet all the bloggers from my batch. We decided few things to promote blogging in IIIT. Find the details in the link in next line.

August 20 – 08:30PM – 10:30PM : Reading RFCs + Bloggers Meet.

After returning from the meeting, I though of finishing the RFCs and preparing the presentation. But RFCs was a lot more than I assumed, so reading went upto 1:30AM. Prepared the presentation.

August 20 – 10:30PM – 01:30AM : Reading RFCs + Preparing presentation.

I was about to sleep and suddenly Rishabh appeared and now there is something with proxy πŸ™ We discussed about the proxy and other misconfigurations for almost 1 hour.

August 21 – 01:30AM – 02:30AM : Discussion about proxy.

I went to bed at 2:30AM. Felt a bit relaxed. But couldn’t sleep. Why?? I was feeling hungry πŸ™ Get up at 2:45AM and rushed to Deepak’s room for having some snacks.

August 21 – 02:30AM – 03:00AM : Snacks + Toasts.

I was in my room at 3:00AM trying to analyze the situation. “If I go to bed now, I’ll not be able to wake up for 8:30AM class and the whole idea of preparing the presentation will be wasted. Should I sleep or not??” Went to bed but got up again in 15 minutes thinking that its impossible to wake up for the class(lack of confidence??). Browsed stuff for sometime and then had a look at the servers. And this time mailman was not working on the students server. Mails to students mailing list was not being delivered. Tried to debug that for almost 2 hours, but all in vain. Monitored servers for sometime and some more browsing. Please forgive me for sending all the test mails πŸ™‚

August 21 – 03:00AM – 06:30AM : Mailman debugging + Server monitoring + Browsing.

By 6:30AM, I was like “I’ll die if I don’t sleep. But if I sleep, I’ll miss the class. WTF :(( “. I finally decided to sleep. But scheduled alarms at full volume on my computer. Scheduled high beat songs(shell scripting rocks πŸ˜€ ). And decided to sleep on the chair itself as its difficult to get out of bed.

August 21 – 06:30AM – 07:45AM : Sleeping (on chair πŸ™ )

Woke up at 7:45AM and rushed to mess at around 8:00AM.

August 21 – 08:00AM – 08:20AM : Breakfast. OBH mess serves the worst breakfast you can have. They server chowmin in breakfast. I AM NOT KIDDING.

Now starts the actual hectic session. Continuous class upto 1:00PM. whoaaaa!!!!!

August 21 – 08:30AM – 10:00AM : Topics of Information Security Class.

August 21 – 10:00AM – 11:30AM : Systems and Network Security Class.

August 21 – 11:30AM – 01:00PM : Numerical Analysis. This class may be suicidal if you haven’t slept for almost 36hours. Caution next time.

Time for lunch.

August 21 – 01:00PM – 01:30PM : Lunch @ Yuktahaar.

I would have gone to room for sleeping now. But mailman is still not working and students are missing their mails πŸ™ Went back to server room. Did ad-hoc management to bypass mailman for temporary mail delivery. Fixed few other things.

August 21 – 01:30PM – 03:30PM : Server room.

Now we had a Infrastructure team meeting at 3:30PM. Was good. Discussed about a lot of issues and how do we replace old infrastructure in a systematic manner.

August 21 – 03:30PM – 04:30PM : Infrastructure Team Meeting.

As its almost 5PM again, time for three hours long “Music Appreciation” class.

August 21 – 04:30PM – 07:00PM : Coffee shop + Music Appreciation class.

Enough is enough. I *MUST* sleep now. Returned to room quickly ignoring the dinner and went to bed thinking that I’ll sleep for at least 14-16 hours (normally I sleep for 12-14hours at a stretch). But who knew that its not possible. I wake up at 12:30.

August 21 – 07:30PM – 12:30AM : Sleeping (in bed πŸ˜› )

Woke up at 12:30AM. And Internet is here again. Browsing. Browsing. Browsing. Browsing. Browsing. And blogging. BTW, a newer version of youtube cache is available now.

August 22 – 12:30AM – 01:30AM : Browsing + Blogging.

Was feeling real hungary. Rushed to Deepak’s room for snacks and toasts.

August 22 – 01:30AM – 02:30AM : Snacks + Toasts.

Browsing again. And I have been writing this god damn post since half an hour. Its almost 3:30AM now and I am thinking of going to bed again. Hopefully I’ll sleep well this time πŸ˜€

August 22 – 02:30AM – 03:30AM : Blogging.

Previous two days was really hectic. No? Well, I can’t really take more than that. I think my biological clocks are out of synch. Need a break.

PS1 : Change your status messages frequently. Don’t bore me.

PS2 : Need to blog more frequently. Have a lot of stuff to blog about. This sem is real happening.

PS3 : Have two courses on security this sem. Wish me luck πŸ˜€

 

vacations and few learnings

Just a quick wrap up of happenings in last month before I leave for home.

1. Do what you want and never care what others have to say about it. example -> http://fedora.co.in/

2. Don’t ever think that you are perfect in your field. Nobody is perfect.

3. No machine (computer) in this world is secure. Only the switched off machines are 100% secure.

4. Talk less, do more.

5. Don’t underestimate anyone. You can learn from everyone if you are willing to.

6. Be a rebellion.

7. Go home when you get the f**kin vacations.

8. Try not to get flamed and refrain from flaming others as well.

9. Don’t reply to PMs for assistant when you are not the concerned person.

10. Switch on the vacation responder when you go for holidays πŸ™‚

I think thats enough. Leaving for home. Will be away from computers/internet for almost 20days. See you if I manage to survive πŸ™‚

 

Whats keeping me busy lately

Previous week was a bit more busier than I actually expected. I wonder why I have to attend a lot of meetings. There were 3-4 of them last week.

Constant complaints about wireless are killing and sometime frustrating. As rishabh pionted out “Become a sysAdmin at IIIT” as a never do thing. I don’t agree completely. But sometimes you get frustrated enough by the complaints and incompetence of your colleagues to say that. Keeping that apart, being a sysAdmin is real fun. You get to play with most critical servers at the place. You learn things that you will never learn in a course or project. (I wonder what we actually learn in a course πŸ˜› One thing I can immediately point out is the attendence management.)

Apart from above, I have been working on IntelligentMirror, my GSOC project and its sister project Youtube Caching using squid. I have achieved 100% youtube caching without altering the refresh patterns in squid. That means your squid will not malfunction and will cache youtube videos successfully and in a browsable fashion πŸ™‚ Going to release version 0.2 very soon. Working on caching Google and metacafe videos as well.

And last but not the least, sleep time has increased from 6-8 hrs to 12-16 hrs/day πŸ˜›

PS1 : This is the funniest post, I have ever seen. A must see for MS by research people πŸ˜›

PS2 : Also maruti has posted some nice crap πŸ˜›

 

Will SKP and ORB get their MS?

It seems to me that right now entire IIIT is worried about just two guyz. And the two guyz are SKP and ORB. Why??? Because they are going to get their MS soon. :O Yes!!!! This is an event which will never happen again ever in history of mankind. And I am one of those guyz who are are waiting for the heavenly event to happen. But I am not the only one who is waiting for this. From bloggers, it includes Pati himself, Rama, Shark etc.. and thousands of non-bloggers. In reaction to shark’s blog, after struggling for hours I made these πŸ˜› Click on the image and read clearly in higher resolution.

SKP ORB MS Comic

SKP ORB MS Comic

No offense to anyone … this is just for fun πŸ™‚

PS : @SKP, ORB don’t kill me sir jee….

 

Sudden affinity for conferences

I never liked to attend any conferences irrespective of the theme and goal of the conference for the past three years. But after attending two-three conferences in recent months and coming across new people with matching interests, I am suddenly looking forward to attend any open source/linux/entrepreneurship conference in hyderabad or nearby cities. BTW, are you planning to attend Eclipse Demo Camp 2008 – Ganymede Edition in Hyderabad on 25th June?

 

I want reservation

Well, enough has been said about reservation by millions of bloggers and other people who express their views on Internet either by blogging or forums or any other medium. And most of the people ( the ones who fall in the non-reserved categories and the ones who think its not a good idea) have opposed it. FYI, I fall in OBC (other backward classes), but I never used this because I had enough talent to defeat the problems myself.

We (the IIITians), time and again boast of no reservation at this holy place. And I am also proud of this as this brings THE talent to our institute which makes it THE place that it is. You might be wondering why the heck I want reservation??

Well, recent happenings have forced me to think in a tangential direction. The increasing number of non-IIITians at IIIT is one of them. Right now at IIIT, ratio non-IIITians/IIITians is more than 2. Here, I count IIITians as all the students who came to IIIT under UG, PG (excluding MSIT, check this. It doesn’t list MSIT in IIIT’s PG curriculum) and PhD programs. My friends’ requests for rooms in OBH have been denied because there are no rooms left. Rooms allotted to my friends (they were about to shift in a few days) have been given to outsiders without even informing them. One fine day they came with their luggage to OBH and found out that their rooms have been given to some other people (non-IIITians). All BC spots (Coffee Shop, NBH main gate, playgrounds etc.etc…) are now over-populated. These are just a few things to mention. The people living in the campus right now, know them much better. All resources (bandwidth is the one about which I am worrying the most) are now shared.

All this said, I want some reservation in IIIT for IIITians. May be I am being a bit paranoid, but I am serious about whatever I said.

 

I am fedora’ed

Ever since I joined IIIT, I was getting closer and closer to Fedora. For almost the first two and a half years, I was a user and explorer (not the internet one πŸ˜› ). I learnt to use and hack many things as I kept climbing up. Few months ago, I stepped up one more stair and started participating in development channels (mainly #yum).

Though I was not really developing anything, I was reading the source code and was constantly trying to break things. Whenever I happened break something successfully, I discussed it over the mailing list or irc and get it fixed. It was like a prerequisite to get started with the actual development.

Then my project was accepted for GSOC and I knew that this will really be a huge breakthrough to get started with actual development for open source. I was damn excited as I saw the time coming when I will give back to the open source community.

These days, I am feeling fedora in everything I do. Being a developer, you get surrounded by Fedora too much that you see Fedora everywhere. I use Fedora all the time. Whatever I code get pushed to Fedora Hosted. Whatever I write, again gets reflected at Planet Fedora. I am now addicted to reading blogs from Planet Fedora. They give you a nice place to host open source stuff, as in Fedora People πŸ™‚ You get an email address [kulbirsaini AT fedoraproject DOT org] , which you can show off in your friend circle πŸ˜› And thats all my routine these days.

πŸ™‚

 

Recent Happenings

  1. Entire wifi was down for almost two hours today. Emergency situation. I called whoever I could but just wasted time and money. Wifi admin would have felt totally surprised after getting 25-30 missed calls from my number πŸ˜› Pinged back after an hour and things were back in tune. Thank god!!
  2. Celebrated Yum Bug Day today. We fixed and closed a lot of bugs. Was a very good experience.
  3. GSOC project is going fine.
  4. IntelligentMirror is now hosted on fedorahosted.org πŸ™‚ Checkout the source code here.
  5. Hooked to Gossip Girls πŸ™‚
  6. Playing cricket at 6AM after night out is now becoming a habit. Will post about it sometime πŸ™‚
  7. Duration and frequency of BC sessions is on the rise. OBH really rocks πŸ™‚
  8. Intake of caffeine is on the rise as well. Thanks to OBH coffee machine πŸ™‚
  9. Chatting a lot these days. Don’t ask why ? πŸ˜›
  10. Blogged a lot this month πŸ™‚

πŸ™‚