|
|
Line 109: |
Line 109: |
| <!-- *********************************************************** -----> | | <!-- *********************************************************** -----> |
| <!-- ***********************Start T1 text*********************** -----> | | <!-- ***********************Start T1 text*********************** -----> |
− | '''Tue 21st January''' Report for the Experiments Liaison Report (21/01/2019) is [https://www.gridpp.ac.uk/wiki/Tier1_Operations_Report_2019-01-21 here]. | + | '''Tue 21st January''' Report for the Experiments Liaison Report (28/01/2019) is [https://www.gridpp.ac.uk/wiki/Tier1_Operations_Report_2019-01-28 here]. |
| <!-- *********************************************************** -----> | | <!-- *********************************************************** -----> |
| <!-- **********************End T1 text************************** -----> | | <!-- **********************End T1 text************************** -----> |
− | * Embarrassingly it's still business as usual at Tier-1 with very little to report. | + | * CPU Efficiencies are looking bad for ATLAS and CMS. This appears to be a global problem for CMS (i.e. all sites have very poor efficiency). The Liaisons have been taksed to investigate. |
− | * Last weeks ARC-CE issues are steadily being resolved. This includes the creation of a new ARC-CE that is hoped to be in production by the end of this week. | + | |
− | * We did experience a network issue over the weekend that impacted cvmfs. As such as we took a hit on batch farm CPU efficiencies. The issue was resolved but as the efficiencies are calculated on the completion of jobs it will take a couple of days for the efficiencies to be back up to normal. | + | * Some CMS GridFTP errors due to “Address already in use” problem. This was due to new hardware being put into production missing the fix that had been applied to the old machines. This was quickly resolved (intermittent errors for ~24 hours). |
| + | * A disk server in Castor for LHCb ran into problems over the weekend and had to be removed from production while the disk array is being rebuilt. Some LHCb files are temporarily unavailable (although they are in Echo so if the LHCb fail-over mechanism is working, there should be no failed jobs!). |
| + | * CMS submitted a GGUS over the weekend due to intermittent SAM failures connecting to Castor. Under investigation. UPDATE 29/1/2019: This was resolved PM 28/1/2019 |
| |} | | |} |
| <!-- ****************Start Storage & DM****************** -----> | | <!-- ****************Start Storage & DM****************** -----> |