Publication from : Darren Moore (darren.moore@stfc.ac.uk)
Dear all
The Stratum-1 service at RAL-LCG2, which serves both EGI and the WLCG communities is currently in a long down time (28th June to 14th August) while it is rebuilt with new hardware.
When the downtime was declared we removed RAL-LCG2 from the global config meaning any site relying on this would simply stop trying to access it and use Nikhef or CERN for EGI or the WLCG repositories respectively.
For sites that were explicitly configured to use RAL, the Stratum-1 alias was configured to return a 503 error when queried (on port 80 and 8000), which should make the failover rapid. We hoped that this would mean the absence of the RAL-LCG2 Stratum-1 would be transparent to sites/users.
It has been brought to our attention that there are multiple GGUS tickets (e.g. https://ggus.eu/index.php?mode=ticket_info&ticket_id=167385) for failing jobs at sites that would appear to be caused by attempts to connect to RAL-LCG2 Stratum-1. If a GGUS ticket is created against your site on a similar issue, feel free to include lcg-support@gridpp.rl.ac.uk in the cc list and we will provide support if it is related to the RAL-LCG2 Stratum-1.
Darren
RAL Tier-1 Operations Manager
Link to this broadcast.