Well, This sounds it is not a local problem. I.e. it may be related to netwrok access, and your jobs rying to access certain files. Is it that the jobs gets no cpu time? The foreign condor jobs are certainly running a a high nice level and ought not to cause problems. If you want we can schedule a test when you do interactive work. I can kill foreign condor-jobs on specficic machines, though it is not so nice to do, and see for a test if this is the cause or what? Flemming -------------------------------------------- Flemming Videbaek Physics Department Bldg 510-D Brookhaven National Laboratory Upton, NY11973 phone: 631-344-4106 cell: 631-681-1596 fax: 631-344-1334 e-mail: videbaek @ bnl gov ----- Original Message ----- From: "Bekele, Selemon" <bekeleku_at_ku.edu> To: "Flemming Videbaek" <videbaek_at_bnl.gov> Sent: Thursday, February 01, 2007 3:47 PM Subject: RE: [Brahms-dev-l] RCAS status Hi, this has actually been a problem for the last couple of days. It was virtually impossible to run any interactive jobs on may of brahms nodes. I have been using rcas0032, rcas0034. selemon, -----Original Message----- From: brahms-dev-l-bounces_at_lists.bnl.gov on behalf of Flemming Videbaek Sent: Thu 2/1/2007 12:53 PM To: hongyan_at_ift.uib.no; devlist Subject: Re: [Brahms-dev-l] RCAS status Hi This has been the policy of having general queues running on our clusters for quite a while. Unless the jobs are special this in general has a very small impact, since they are niced; our condor jobs have priority. Thus give me specific information on what machines seems to be slow, so one can look at the performance. One easy way is to do a top - check for io/wait. If this is large (>20% there is a potential problem) - CTRL M ->order jobs by memory usage. There may be very large jobs doint little. if any of these conditions are fullfield there may be a problem. If not it is perception, not a real impact (95% confidence level) /fv -------------------------------------------- Flemming Videbaek Physics Department Bldg 510-D Brookhaven National Laboratory Upton, NY11973 phone: 631-344-4106 cell: 631-681-1596 fax: 631-344-1334 e-mail: videbaek @ bnl gov ----- Original Message ----- From: "Hongyan Yang" <hongyan_at_ift.uib.no> To: "devlist" <brahms-dev-l_at_lists.bnl.gov> Sent: Thursday, February 01, 2007 1:31 PM Subject: [Brahms-dev-l] RCAS status > Dear all, > > I am wondering if any of you've noticed that almost all of our RCAS > machines are busy with some condor jobs (not from BRAHMS) - which > affected the speed of a terminal to complete any local session. This has > been for quite a while - is there any restriction for people from other > collaboration to use our cluster? > > Hope this can be solved (or at least compromised), to make sure at least > we have enough machines to do our jobs. > > Best regards, > Hongyan > > -- > Hongyan YANG > Department of Physics and Technology Phone: +47 55 58 27 25 (Office) > University of Bergen > Allegt. 55, 5007 Bergen Fax : +47 55 58 94 40 > Norway Email: hongyan[at]ift.uib.no > > > _______________________________________________ > Brahms-dev-l mailing list > Brahms-dev-l_at_lists.bnl.gov > http://lists.bnl.gov/mailman/listinfo/brahms-dev-l > _______________________________________________ Brahms-dev-l mailing list Brahms-dev-l_at_lists.bnl.gov http://lists.bnl.gov/mailman/listinfo/brahms-dev-l _______________________________________________ Brahms-dev-l mailing list Brahms-dev-l_at_lists.bnl.gov http://lists.bnl.gov/mailman/listinfo/brahms-dev-lReceived on Thu Feb 01 2007 - 18:44:16 EST
This archive was generated by hypermail 2.2.0 : Thu Feb 01 2007 - 18:44:37 EST