From: Flemming Videbaek (videbaek@sgs1.hirg.bnl.gov)
Date: Fri Aug 22 2003 - 11:55:36 EDT
enclosed is the minutes from last meeting. These are essentially as written down during the meeting- no futher editting /fv 8/21/03 Large UPS at RCF means several things can stay up ~8 hours after a power failure i.e. Web, e-mail , kdc, NIS, ssh,'' ITD keeps up Backbone+Perimeter. RCF will To keep e-mail/web server up as long UPS can be maintained. Outline of plan will written up - what to shut own order an bringing them back up. Disk will be brought up first. HPSS will go down early, but also come up (rather independent). Tests to be done (temperature rise triggers a shutdown.). Module to deliver signal on UPS alarm failed too. There is a time-period when the shutdown can be cancelled. Procedure will be completed an distributed for comments. HPSS 2 tape drives did not come back on. Some disk problems too. Schedule to bring to same level will be done Monday. Time-outs do occur at times, seems to be firmware problem. Should be transparent. DISK 3 disks lost on one controller. Did the automatic rebuild. Lost a total of 2 controllers. 32,36 controllers total. Could have been faulty power system (due to heat ) data07 problem. Inode problem. Veritas not very forthcoming, could not add to it (why did they not notify RCF on this issue when the 1Tb limit was addressed. Plan to restore today. CTS ticket server will brought up to RH9.0 (done from atlas, will be done for RCF next week) Linux 2 power supplies, 2 disk lost. OS upgrade will be done for Phobos next week. A few packages still missing (Intel compilers e.g.) Some minor issues on Ganglia, Condor. Recall the upgrade will wipe out local disks. LSF Recovered well from blackout. Clients to be installed worked under RH8. Upgrade RCF2 to Solaris 9. Almost every single package (latex,..etc) (following week) Kerebos. Imap on rcf2 will eventually need K5. The mail server uses nfs. AFS - upgrade scheduled for early mid September. Major changeover cell name change+going to to K5. Kpassword change from outside may not work after upgrade. 3 servers need to be upgraded before K5 stuff will work. (www, rfc mail, ..K5+NIS) September 8 will be the day. Apache upgrade of www4 server (brahms+phobs) short ~ 5 sec interrupt. Do not expect problems. No tech meeting next Monday. Cyber security audit during (DOE Chicago 2 to 21 off-site from 21 to end on-site) passwords, ITD will collect password files to check for cracking. ------------------------------------------------------ Flemming Videbaek Physics Department Brookhaven National Laboratory tlf: 631-344-4106 fax 631-344-1334 e-mail: videbaek@bnl.gov
This archive was generated by hypermail 2.1.5 : Fri Aug 22 2003 - 11:55:42 EDT