From: Flemming Videbaek (videbaek@sgs1.hirg.bnl.gov)
Date: Thu Aug 14 2003 - 09:59:17 EDT
RCF 8/14/2003 HPSS Star tape had gotten stuck. Attempt repacking to solve. Single authentication method is in place. Light usage. DISK Monitoring in details to understand crashes; load (many transaction problem) Happened to Phobos Wednesday that nfs was quite low, low through put. Time-outs , lack of response. A system wide problem i.e. can happen to any of the rmine systems. Phenix had one bad server (file-corruption, many disks had been replace in the last couple of weeks. System had been rebuild. LINUX Changes to Condor scripts as per e-mails. To match outside default settings. Separate RH8 for submission rcsruser3. @sys name still missing on rplay10. Will be done today. Local disks on new sys will be RAID setup. LSF New license file arrive 2200, enough to cover additional machines; would then be too few for next upgrades. Cannot (unlikely) negotiate a larger number soon. SOLARIS Web server upgrade + CTS server will happen at same time. AFS Schedules for change of cell name. Edward says it is not a big deal on RCF end. Problem is the cell name is used as seed for password. Stop afs password and go to Kerebos passwords (one day) remote AFS. What if they (remote sites) do not have a kerebos5 client? Can/will setup a fake afs-server (was done for ATLAS). Update local celldb for machine that has disk mounted. { rhic.bnl.gov will be new name..} ------------------------------------------------------ Flemming Videbaek Physics Department Brookhaven National Laboratory tlf: 631-344-4106 fax 631-344-1334 e-mail: videbaek@bnl.gov
This archive was generated by hypermail 2.1.5 : Thu Aug 14 2003 - 09:59:51 EDT