|
Jobs Successfully submitted to Tier-0. Problem with Infrastructure lead to multiple submissions of same job. Should be solved today
|
Transfer and Data Management Activities SRM and SRB Export Buffers available.
Discussion about when to update RLS
|
|
Analysis Activities Waiting for Events at processing centers. Updates should be posted to the list |
Hardware Identified Hardware Requirements from Facilities Groups CERN Tier-0 (Equipment status as of Feb 5)
FNAL Tier-1 (Equipment status as of Feb 2)
PIC Tier-1 (Equipment status as of February 3)
GridKa Tier-1 (Equipment status as of February 3 )
UK Tier-1 (Equipment status as of January 23)
IN2P3 Tier-1 (Equipment status as of February 5)
CNAF Tier-1 (Equipment status as of February 6)
|
|
|
print(Date("1 F d, Y")); ?>
|
DC04 Elements
The CMS Data Challenge consists of 3 primary area
- Event Reconstruction at the TIer-0 Center at 25Hz for the period of a month
- Transfer of raw and reconstructed data to distributed Tier-1 centers
- Access of events at distributed center for analysis-type applications
The status of the 3 main challenge elements is described below.
DC04 Daily Logs
Log April 16
We will try to rerun the jobs that failed last weekend. This will be something like 100 jobs running through the weekend. Tony will inform people when data should start to show up.
We will give back to IT about 100 CPUs today as we do not anticipate requiring them for DC04 anymore. Tony believes we can get about 10M more events through the system by the end of April using 2-300 CPU's. This data is at CERN and can hopefully be built into runnable datasets in the next days.
(This would mean we eventually get about 15M events fully through the system in the DC04 period, about 1/3 of what we originally planned, but not too bad an achievement)
We will also return some of our disk buffers. For starters we can return one from the gdb. We ask the SRM and SRB EB teams to propose a route for us to liberate 1/4 disk servers from each. Clearly this will require some thought to achieve this. The ball is in the EB managers court to tell us when this can be done.
CNAF reported a Castor stager problem that they are recovering from but may impact the start up of transfers to CNAF
Nicola reported that some 200 analysis jobs ran last night. We look forward to a report on them. When the CNAF Castor problems are solved we would like to see a timestamp analysis showing the realtime analysis performance. Tony points out that a key feature to get the analysis turnround optimized will be to optimize the export buffer file selection algorithms; to move away from random selections to ones that ensure that datasets (or filesets required by an analysis job) get completed more quickly.
Lassi reported that a new COBRA/ORCA will be available (Stephan notes it is available now) There are no ORCA changes but some COBRA improvements to allow location independent virgin metadata catalogs.
