Offline Checklist

The goal of the offline checklist is to monitor the data quality promptly as the production processing wends it way from the online tape to full reconstruction. Data problems are to be reported ASAP to the processing overseer, (currently Erez Etzion) or one of the production processing crew (Richard Dubois, Joe Perl, Gary Bower, and Karen Heidenreich) as appropriate.

That follows may seem to be a checklist of numbers to compare to standards and make a simple binary decision on data quality. Rather you should use the detailed questions to get yourself oriented and warmed up to use your physics skills to think about the run output and look for problems. If data quality was just a matter of some binary decisions based on some numbers we would just have the computer do it and we wouldn't need you!

You are responsible for any stages of run processing that finish during your shift period. The easiest way to determine this is to use the links from the checklist bellow. In the date range on the "Links" page enter your shift days and it will list all processing that has finished during that range. Thus, if your shift runs from the 3rd through the 5th you should have a final look on the 6th since the range is midnight to midnight California time. If some of the runs are 'catch-up' processing on older missed runs, concentrate on the runs from the day/previous day that are most time critical to report on.


Readme file

See the OFFSITE README file for the latest temporary kludges to the monitoring. It will be updated regularly as problems come and go.

Processing Stages

Once the end-run button is pressed in the online, the acquisition (ACQ) data tape is released for further processing in the SLACVX cluster.

The SLDPM server is notified that the tape is ready for processing. It copies the tape to disk, followed by running a PASS1 filter, followed by a full reconstruction and PASS2 filter.


Relevant Pages from the STATUS DISPLAY System

Page 12: Offline
gives details on the last run processed.
Page 14: Offline Zs
includes plot of the number of SLD Reconstructed Zs.
page 15: Processing
gives list offline jobs currently queued or running.

Tape copy (ACQCOPY)

This stage performs two functions. It copies the input tape to disk for effective access in subsequent steps. The second function is to monitor the trigger records for inconsistencies. The run monitor is notified if any are found.

ACQCOPY Duties

The only duty here is to determine whether trigger errors ( example ) have been flagged, and notify the run coordinator if there have.

For each run there is a file, called ASrun# SDATCHK ( example ). Look at this file to see if any trigger errors have been flagged. These are the ones to notify the run coordinator of.

Also examine ASrun#.STATS ( example ) to see if any subsystem data is missing.

Do it (Check tape copy/trigger)


Filter (Pass 1) duties

Do it (Check Filter, pass1 reconstruction)


Recon (pass 2) duties

Do it (Check pass 2 Reconstruction )


ZCHKDAY

This stage runs ZXFIND on all the PASS2 output events for a given day. It looks at the primary vertex position, gamma conversions and K-shorts, and track muliplicities and VXD linking efficiency. Default parameters are used in ZXFIND.

Output diagnostics from this stage include statistics files ( example ) :

And Histograms
Links to ZCHKDAY Output
creates a table of links from which you can browse statistics and histogram files from the previous day's ZCHKDAY job.


Unanalyzed Runs
Unanalyzed Runs
shows information on runs never seen by the offline, runs currently being analyzed and runs with analysis in suspended state. The SLC estimate for the Z count is given.


Offline Shift Report

An example.
Quantities that need to be verified are: The last three items, plus an explicit overall evaluation, are asked for in the report. Other problems should be reported in the comments section.
 
Generate an Offline Shift Report
prompts you for some basic information then generates the report form.
Old Offline Shift Reports


Erez Etzion
Last Updated: