CMS Data Challenge 2004 Started March 1st    

Current Action Items | Recent Decisions
OO Software Home | PRS Pages
JetMet DC04 Page
PCP Production Status
GridIce DC04 Monitoring | GridIce Main Page
Monitoring DC04 | General MonaLisa Page
Database Browser | Pablo's T1 Monitor | Database Schema
DC04 at CERN | LCG DC04
subglobal8 link | subglobal8 link | subglobal8 link | subglobal8 link | subglobal8 link | subglobal8 link | subglobal8 link

 

DC04 StatusCMS LOGO

 

DC04 Elements

The CMS Data Challenge consists of 3 primary area

  1. Event Reconstruction at the TIer-0 Center at 25Hz for the period of a month
  2. Transfer of raw and reconstructed data to distributed Tier-1 centers
  3. Access of events at distributed center for analysis-type applications

The status of the 3 main challenge elements is described below.

DC04 Daily Logs

Log April 16

We will try to rerun the jobs that failed last weekend. This will be something like 100 jobs running through the weekend. Tony will inform people when data should start to show up.

We will give back to IT about 100 CPUs today as we do not anticipate requiring them for DC04 anymore. Tony believes we can get about 10M more events through the system by the end of April using 2-300 CPU's. This data is at CERN and can hopefully be built into runnable datasets in the next days.
(This would mean we eventually get about 15M events fully through the system in the DC04 period, about 1/3 of what we originally planned, but not too bad an achievement)

We will also return some of our disk buffers. For starters we can return one from the gdb. We ask the SRM and SRB EB teams to propose a route for us to liberate 1/4 disk servers from each. Clearly this will require some thought to achieve this. The ball is in the EB managers court to tell us when this can be done.

CNAF reported a Castor stager problem that they are recovering from but may impact the start up of transfers to CNAF

Nicola reported that some 200 analysis jobs ran last night. We look forward to a report on them. When the CNAF Castor problems are solved we would like to see a timestamp analysis showing the realtime analysis performance. Tony points out that a key feature to get the analysis turnround optimized will be to optimize the export buffer file selection algorithms; to move away from random selections to ones that ensure that datasets (or filesets required by an analysis job) get completed more quickly.

Lassi reported that a new COBRA/ORCA will be available (Stephan notes it is available now) There are no ORCA changes but some COBRA improvements to allow location independent virgin metadata catalogs.

 

Reconstruction Activities

Jobs Successfully submitted to Tier-0.

Problem with Infrastructure lead to multiple submissions of same job.

Should be solved today

 

Transfer and Data Management Activities

SRM and SRB Export Buffers available.

  • End-to-end tests proceeding
  • RM for SRM dCache EB should be available this week
  • Validation sample should be put in TMDB

Discussion about when to update RLS

  • Tentative decision to use an agent in the GDB to update RLS and TMDB

 

   

Analysis Activities Waiting for Events at processing centers. Updates should be posted to the list

Facility Activities

Hardware Identified

Hardware Requirements from Facilities Groups

CERN Tier-0 (Equipment status as of Feb 5)

  • cmsdc04 Login aavailable (Contact Werner for password)
  • Login and daemons systems available before end of week on lxgate04.cern.ch
  • 5 Dedicated P4 Xeon nodes with 2.4GHz 1GB memory 100BaseT
  • 2 1.3TB Disk Servers one for General Distribution Buffer (GDN) and one for SRB Export Buffer (EB)
  • RLS ready
  • Transfer Management database on ORACLE OK

FNAL Tier-1 (Equipment status as of Feb 2)

  • 622Mpbs to CERN with 250Mbps sustained
  • 2 4TB RAID devices for Import Buffers (IB). Configured with dCache migrates to Enstore
  • 1 Dual CPU P4 Xeon for SRM server
  • 1 dual CPU P3 system for Tier1 Agent
  • 10 dual CPU Xeon systems available analysis
  • 1 dual CPU Xeon system for analysis job submission
  • Tape allocated 50TB

PIC Tier-1 (Equipment status as of February 3)

  • 400Mbps Network to CERN, but 80Mb sustained
  • 2-4 TB of Input Buffer
  • 80 2.8GHz Xeon CPUs with Gigabit networking and 1GB RAM
  • 10TB of disk (20TB possible)

GridKa Tier-1 (Equipment status as of February 3 )

  • 1Gbit to CERN network shared
  • 50 CPUs available for challenge
  • 5TB input buffer

UK Tier-1 (Equipment status as of January 23)

  • 1Gb Link to CERN shared
  • 8TB of Input Buffer
  • 50TB Tape Space
  • 60 CPU Farm for analysis

IN2P3 Tier-1 (Equipment status as of February 5)

  • 2TB input buffer (in a 4TB shared system)
  • 70TB Tape available

CNAF Tier-1 (Equipment status as of February 6)

  • 1GB Link to CERN (500Mbps seen sustained)
  • 8TB of disk now (Potentially 20TB additional for DC04)
  • 70 dual Xeon systems for analysis