I've installed a trial instance of Bugzilla on quark and quarkprime for general use by the department.
I'm using my chat server/client as a trial product in Bugzilla, and all appears to be working.
I've installed JDK 1.5.0 from Sun on the ACLs. JDK 1.4.2 is still installed but I changed the symlink /usr/local/jdk to point to /usr/local/jdk-1.5.0, and pointed /usr/local/bin/java{,c} to the binaries in /usr/local/jdk-1.5.0/bin. I tried upgrading quark's JDK, but the installer consistently crashed even after marking it as a Linux binary.
After testing it out on quarkprime, I upgraded quark's mailman instance to 2.1.5. The primary motivation is the blanket discard option in the web interface, which allows the discarding of any messages marked as defer.
I installed SDM to manage downloading Sun software for millie.
I've installed the sysstat package on all the ACLs to gain use of the iostat command.
Present: CharlieP, Dan, Skylar
* New backup scheme with /,/usr on one tape once a week. /clients and /var on separate tape.
* Give Hassan codd for DB project
* Hardware list
- Gigabit switch
- IP terminals
- Optical scroll mice
I've gotten MRTG running on image, and am monitoring both the Cisco routers and both the Extreme switches. Go to mrtg.cs.earlham.edu to see the graphs.
I solved a problem with lpd on quark. It was dying with the error message "flock: Resource temporarily unavilable" in its debug output, and sucking all available CPU time. I ended up rebuilding /var/spool/lpd, which solved the problem.
vi was core dumping occassionally with the error message "Resource unavailable". On a guess, I built and reinstalled vi from FreeBSD source. Hopefully that'll solve our problem.
I upgraded the kernel on all the ACLs to 2.4.27 to fix a problem I was experiencing with the ide-cd module. All is good now.
At a user's request, I changed the permissions on su from 4750 to 4755.
After checking out Nagios to see how office1 was doing, I noticed that none of the cairo nodes were reachable, so I went down and attempted to fix it with Toby, with some help from Charlie
Nagios was reporting that Cluster-SW0 was down; none of the switches in the Cairo rack were labelled, so I power cycled all 3. That didn't do anything. I know very little about the clusters beyond their names, so I got Charlie. We tried hopperprime and thought maybe a dead NIC in there, then we speculated that the Cluster-SW0 was bad, but we came to the conclusion that c0 just needed to be power cycled. Seemed to do the trick, for now.
On a tip from Skylar, I checked out office1's connection in the D224 closet, and determined that it's a flaky problem somewhere in there. Maybe a flaky connection on the Summit 48 (port 10), on the patch wall, or on the cable itself. I haven't really tested it out enough. But I wiggled the cable around and office1's connection is back up.
I configured spamd to use BDB files rather than Postgres, and that improves performance considerably. There's something about their Postgres tables that just makes them hideously slow.