HPCC and Perceval maintenance scheduled (+ improvements to the storage, scheduler, and connection to RU network)
Ryan Novosielski
Friday, March 17, 2017 at 4:33 pm
HPCC Users,
A maintenance period has been scheduled for the HPCC and perceval clusters from 10:00a on Wednesday, 3/22 to 6:00p on Friday, 3/24. This maintenance is necessary to correct a problem we have been experiencing with our storage system, as well as to install some scheduler upgrades and add a redundant connection to the Rutgers network for improved performance and reliability as well as improve the layout of our Infiniband fabric. If the maintenance is concluded sooner, the clusters will be returned to service before the end of the maintenance period and an announcement will be sent to the user community. A maintenance reservation has been put into place that will delay any job that will not complete by 10:00a on Wednesday, 3/22, and such jobs will run after the maintenance has completed. Please note that there will be no access to either HPCC or perceval while maintenance is in progress. Please plan accordingly.
Thank you for your patience as we work to improve our systems. Please respond to this message if you have any questions or need additional information about this maintenance period.
=The OARC Sysadmin Team