• Datacenter CDCE Room Cooling Failure

    Duration:
    6/16/2021 12:30 pm — 6/16/2021 3:00 pm

    Group Responsible:
    IT Fabric

    Affected Area:
    ATLAS T1 Compute Nodes, spool0XYZ Systems

    Expected Impact:

    Maintenance Type:
    Unplanned/Outage

    Description:
    There was a major cooling failure in the CDCE room in SDCC's datacenter earlier today (6/15), starting around 12:30 PM EST, due to an issue with the chilled water system in the building. Temperatures rose quickly, triggering automated monitoring software shutdowns of compute nodes in that room around 1:00 PM in order to avoid equipment damage. This affected all ATLAS T1 compute nodes, and a large portion of the shared pool (all spool0XYZ systems). Parts of our RHEV system were also affected. The issue with the building chilled water circulation was repaired by approximately 3:00 PM, and the farm equipment was powered back online, and opened to jobs after the room room temperature stabilized at 3:30 PM.\n\nAt this time we believe all affected services have been restored. If you continue to experience issues, please submit a ticket to RT.

  • Major Upgrade on Federated IdP Proxy

    Duration:
    6/8/2021 10:00 am — 6/8/2021 11:00 am

    Group Responsible:
    Services & Tools

    Affected Area:
    Accessing external resources using federated IdP authentication

    Expected Impact:

    Maintenance Type:
    Transparent Upgrage/Maintenance

    Description:
    The proxy for federated IdP will be upgraded 10am on 6/8/21. Although transparent but may require re-authentication if accessing external resources using federated IdP during this scheduled window.

  • SDCC User Support Survey

    Thu Jul 8 15:06:55 EDT 2021

    This item has been posted to rhic-rcf-l@lists.bnl.gov, sdcc_users-l@lists.bnl.gov, usatlas-users-l@lists.bnl.gov

    Dear SDCC user,

    Would you please take a moment to visit and complete a short survey regarding our support system? We are interested in your thoughts on all aspects of your support experience with our facility, including the support interface, reporting queues, and any opinions you may have on our tools and processes.

    Please visit the survey at your convenience:
    https://tinyurl.com/sdcc-users

    While the survey is only a few questions and won't take long to complete, your response will be invaluable toward helping us evaluate and improve our support interfaces and user interaction.

    Thank you,
    John De Stefano, on behalf of SDCC

  • NX servers Version upgrade - Reminder

    Thu Jun 24 08:40:13 EDT 2021

    This item has been posted to rhic-rcf-l@lists.bnl.gov, sdcc_users-l@lists.bnl.gov, usatlas-users-l@lists.bnl.gov, bnl-shared-tier3-l@lists.bnl.gov

    Summary:
    The current NX servers running version 6 will be upgraded to version 7.

    Duration:
    June 24th (Thursday) 9.30 am - 10.30 am

    Group Responsible:
    GCE

    Affected Area:
    NX service

    Expected User Impact:
    The NX sessions will be terminated. Please save your work.

    Maintenance Type:
    Service Interruption

    Submitted By:
    Saroj Kandasamy,saroj@bnl.gov

    Description:
    The NX servers will be upgraded to resolve issues in the current version.

  • Due to hardware related issues, some files will be unavailable in /gpfs01

    Duration:
    4/19/2021 9:00 am — 4/21/2021 9:00 am

    Group Responsible:
    IT Services

    Affected Area:
    Central storage

    Expected Impact:
    Users can continue to read/write to the filesystem but reads to some files will fail will IO errors.

    Maintenance Type:
    Unplanned/Outage

    Description:
    Due to hardware related issues, some files will be unavailable in /gpfs01/ and will return with an IO error. Users can continue to read/write to the filesystem but reads to some files will fail will IO errors

  • NoMachine/NX Service Update

    Duration:
    5/3/2021 8:00 pm

    Group Responsible:
    IT Services

    Affected Area:
    NX service

    Expected Impact:
    The NX sessions on nx01/nx02/nx06/nx07/atlasnx01/atlasnx02 servers will be terminated. Please transition to the new NX service.

    Maintenance Type:
    Planned Maintenance/Downtime

    Description:
    The current NX service ( on nx01/nx02/nx06/nx07.rcf.bnl.gov atlasnx01/atlasnx02.usatlas.bnl.gov) will be deprecated May 4th , 2021. Please follow instructions for the new NX service: https://www.sdcc.bnl.gov/resources/services/nomachine-nx

  • Public MatterMost channel for new website

    Duration:
    12/31/1969 7:00 pm

    Group Responsible:

    Affected Area:

    Expected Impact:

    Maintenance Type:
    Information

    Description:
    The SDCC recently unveiled its new website http://www.sdcc.bnl.gov that serves as the entry point to facility services and support. In addition, we have created a public MatterMost channel for feedback and recommendations on the new website. Users are welcome to join this channel (see link below) and participate.\n\nhttps://chat.sdcc.bnl.gov/bnl/channels/sdcc-website-feedback\n\nPlease note that this channel is not meant to be used for support issues (broken links, missing documentation, request for changes, etc). Support requests must be made through the RT ticket system (go to http://www.sdcc.bnl.gov and select 'Get Help')

  • HPC clusters update

    Duration:
    12/31/1969 7:00 pm

    Group Responsible:

    Affected Area:

    Expected Impact:

    Maintenance Type:
    Information

    Description:
    Hi all,\n\nThe clusters are back to normal operations starting 12:30 pm.\n\nThe system affected were IC cluster, Skylake cluster and the volta cluster.\n\nRegards,\nCostin

  • Globus endpoint SDCC will be down for system updates

    Duration:
    3/24/2021 9:00 am — 3/24/2021 10:00 am

    Group Responsible:
    IT Services

    Affected Area:
    Globus endpoint “SDCC”

    Expected Impact:
    All transfers related to Globus endpoint “SDCC” will be interrupted.

    Maintenance Type:
    Planned Maintenance/Downtime

    Description:
    Globus endpoint 'SDCC' will be down for system updates on March 24th , 2021 between 1pm-2pm