Re: [Brahms-dev-l] RCAS status

From: Flemming Videbaek <videbaek_at_bnl.gov>
Date: Thu, 1 Feb 2007 18:43:21 -0500
Well,

This sounds it is not a local problem. I.e. it may be related to netwrok access, and your jobs
rying to access certain files. Is it that the jobs gets no cpu time? The foreign  condor jobs are certainly running a a high nice 
level and ought not to cause problems.
If you want we can schedule a test when you do interactive work. I can kill foreign condor-jobs
on specficic machines, though it is not so nice to do, and see for a test if this is the cause or what?

Flemming
--------------------------------------------
Flemming Videbaek
Physics Department
Bldg 510-D
Brookhaven National Laboratory
Upton, NY11973

phone: 631-344-4106
cell:       631-681-1596
fax:        631-344-1334
e-mail: videbaek @ bnl gov
----- Original Message ----- 
From: "Bekele, Selemon" <bekeleku_at_ku.edu>
To: "Flemming Videbaek" <videbaek_at_bnl.gov>
Sent: Thursday, February 01, 2007 3:47 PM
Subject: RE: [Brahms-dev-l] RCAS status



Hi,

   this has actually been a problem for the last couple of
days. It was virtually impossible to run any interactive jobs
on may of brahms nodes. I have been using rcas0032, rcas0034.

selemon,

-----Original Message-----
From: brahms-dev-l-bounces_at_lists.bnl.gov on behalf of Flemming Videbaek
Sent: Thu 2/1/2007 12:53 PM
To: hongyan_at_ift.uib.no; devlist
Subject: Re: [Brahms-dev-l] RCAS status

Hi

This has been the policy of having general queues running on our clusters for quite a while.
Unless the jobs are special this in general has a very small impact, since they are niced; our condor jobs have priority.
Thus give me specific information on what machines seems to be slow, so one can look at the
performance. One easy way is to do a top
- check for io/wait. If this is large (>20% there is a potential problem)
 - CTRL M ->order jobs by memory usage. There may be very large jobs doint little.

if any of these conditions are fullfield there may be a problem. If not it is perception, not a real impact (95% confidence level)

/fv


--------------------------------------------
Flemming Videbaek
Physics Department
Bldg 510-D
Brookhaven National Laboratory
Upton, NY11973

phone: 631-344-4106
cell:       631-681-1596
fax:        631-344-1334
e-mail: videbaek @ bnl gov
----- Original Message ----- 
From: "Hongyan Yang" <hongyan_at_ift.uib.no>
To: "devlist" <brahms-dev-l_at_lists.bnl.gov>
Sent: Thursday, February 01, 2007 1:31 PM
Subject: [Brahms-dev-l] RCAS status


> Dear all,
>
>   I am wondering if any of you've noticed that almost all of our RCAS
> machines are busy with some condor jobs (not from BRAHMS) - which
> affected the speed of a terminal to complete any local session. This has
> been for quite a while - is there any restriction for people from other
> collaboration to use our cluster?
>
> Hope this can be solved (or at least compromised), to make sure at least
> we have enough machines to do our jobs.
>
> Best regards,
>  Hongyan
>
> -- 
> Hongyan YANG
> Department of Physics and Technology  Phone:  +47 55 58 27 25 (Office)
> University of Bergen
> Allegt. 55, 5007 Bergen       Fax  :  +47 55 58 94 40
> Norway           Email:  hongyan[at]ift.uib.no
>
>
> _______________________________________________
> Brahms-dev-l mailing list
> Brahms-dev-l_at_lists.bnl.gov
> http://lists.bnl.gov/mailman/listinfo/brahms-dev-l
>
_______________________________________________
Brahms-dev-l mailing list
Brahms-dev-l_at_lists.bnl.gov
http://lists.bnl.gov/mailman/listinfo/brahms-dev-l


_______________________________________________
Brahms-dev-l mailing list
Brahms-dev-l_at_lists.bnl.gov
http://lists.bnl.gov/mailman/listinfo/brahms-dev-l
Received on Thu Feb 01 2007 - 18:44:16 EST

This archive was generated by hypermail 2.2.0 : Thu Feb 01 2007 - 18:44:37 EST