RAL Tier1 weekly operations Grid 20100830
From GridPP Wiki
Revision as of 09:22, 27 August 2010 by Matt hodges (Talk | contribs)
Contents
Operational Issues
Description | Start | End | Affected VO(s) | Severity | Status | |
---|---|---|---|---|---|---|
Job status monitoring from CREAMCE | 2-Feb-2010 | CMS | medium | [10-Feb-2010] WMS patch available soon; CREAMCE new version available soon [07-Apr-2010] CMS tests have shown that WMS patches resolve the problem; still waiting for patch to be installed on the production WMSs in Italy [13-Jul-2010] CNAF WMSs have been updated; testing using backfill is in progress [19-Jul-2010] So far everything looks good |
Downtimes
Description | Hosts | Type | Start | End | Affected VO(s) |
---|---|---|---|---|---|
Blocking Issues
Description | Requested Date | Required By Date | Priority | Status |
---|---|---|---|---|
HW needed to test Dataguard technology for LFC/FTS | 19 May 2010 | 15 June 2010 | Medium | [24-05-2010]HW available; needs to be deployed by Fabric and then handed over to Dataservices |
Developments/Plans
Highlights for Tier-1 Ops Meeting
Highlights for Tier-1 VO Liaison Meeting
Detailed Individual Reports
Alastair
- Working on ATLAS software server, testing CVMFS[ongoing]
- Working on testing FTS timeout limits.
- Working on Hammer cloud test of castor 2.1.9
Andrew
- CMS VOBOXs
- Setting up new CMS VOBOX (+ Squid, PhEDEx)
- Installed & basic testing of Squid on lcgvo-02-21 [Done]
- CASTOR 2.1.9 stress testing with & without lazy-download [Ongoing]
- Completed disk accounting consistency webpages [Done]
- CMS data ops
- Installed ProdAgent 0_12_18
- VO support survey [Ongoing]
Catalin
- gLite updates WMS03 non-LHC [done]
- ATLAS frontier monitoring [ongoing]
- test SL5 LFC quattor profiles [ongoing]
- work on improving ganglia monitoring for Grid Services
Derek
- CREAM CE quattor profile [ongoing]
- Investigating CREAM CE instability [ongoing]
- At GridPP meeting Mon-Thu, A/L Friday and following week
Matt
- Change Controls for FTS FE updates.
- Build quattorised gLite3.1 PX test node [Done]
- Audit wLCG pledges vs. deployed disk [Done]
- Look at asciidoc build system for Grid Services docs [Done]
- Build quattorised gLite3.2 FTS test node [Done]
Richard
- Submitted c/c request for replacing RAL site-level BDIIs with Quattorised machines
- Working on the "team status page" being developed as an action from team awayday [ongoing]
- Reviewing G/S process documentation [ongoing]
- CASTOR items:
- Chase up last few "non-runners" in 2.1.9 tests
VO Reports
ALICE
ATLAS
CMS
LHCb
OnCall/AoD Cover
- Primary OnCall:
- Grid OnCall:
- AoD: Catalin (Wed)