Welcome to FDR I Jamboree ANL local Wiki
Introduction
This is a wiki page intended for the use of the ANL participants of the BNL/ANL FDR Jamboree (17-20 March). In addition to providing information relevant to learning how to run on FDR I data, you can post your own questions, comments and working examples here.
Note that much of the following information is specific to the ANL ASC.
What is FDR?
- There are some Grid problems in atlas16. grid-proxy-init (and pathena) works but not other
Grid commands. There maybe other problems: It's recommended that you use atlas17 and atlas18 for now.
Local data sets
There are some FDR data sets available locally at the ANL ASC:
- /data/nas2/users/ryoshida/fdr08_run1/fdr08_run1.0003050.StreamEgamma.merge.AOD.o1_r6_t1._0001.1
- /data/nas2/users/ryoshida/fdr08_run1/fdr08_run1.0003050.StreamEgamma.merge.AOD.o1_r6_t1._0002.1
- /data/nas2/users/ryoshida/fdr08_run1/fdr08_run1.0003051.StreamEgamma.merge.AOD.o1_r6_t1._0001.1
- /data/nas2/users/ryoshida/fdr08_run1/fdr08_run1.0003051.StreamEgamma.merge.AOD.o1_r6_t1._0002.1
- /data/nas2/users/ryoshida/fdr08_run1/fdr08_run1.0003070.StreamEgamma.merge.AOD.o1_r12_t1._0001.2
- /data/nas2/users/ryoshida/fdr08_run1/fdr08_run1.0003070.StreamEgamma.merge.AOD.o1_r12_t1._0002.1
The Pool catalog corresponding to these files is in the same directory:
- /data/nas2/users/ryoshida/fdr08_run1/PoolFileCatalog.xml
David Malon has created TAG files for FDR they are in:
- /users/malon/FDR1Tags/MinBias_r12.root
StreamEgamma_r12.root
StreamMuon_r12.root
StreamJet_r12.root
Getting Started
This is the instruction for a "cold" start analyzing FDRI AODs. If you are not starting from scratch, you should be able to start somewhere in the middle of these instructions.
- Set up your account according to the instructions here. Version 13.0.40 is recommended for FDR I analysis.
- Install Athena version 13.0.40 software in your area (and try running HelloWorld) according to the instructions here. The tag-collector for UserAnalysis package for 13.0.40 is 00-10-12.
- Try to see if you can run the example AnalysisSkeleton on a Monte Carlo AOD using v13. The general instruction is as same as at CERN, which is here. You can use the data file /users/ryoshida/testarea/data/AOD.015240._00001.pool.root.1 as your AOD input. This is a v13 MC data of Z->ee.
- If you'd like, you can try modifying the default AnalysisSkeleton code to make new plots (or whatever you want). An example can be found in /users/ryoshida/testarea/13.0.40/PhysicsAnalysis/AnalysisCommon/UserAnalysis/src/AnalysisSkeleton.cxx.mod1 and ../UserAnalysis/AnalysisSkeleton.h.mod1. If you want to try these, copy them into your equivalent directory as AnalysisSkeleton.cxx and .h, respectively and:
>cd ../cmt
>source setup.sh
>gmake
>cd ../run
>athena AnalysisSkeleton_topOptions.py
should produce an extra histogram (a mass plot showing a Z peak) in your aan.root file.
- Now you need to remove references to MC truth quantities inside the AnalysisSkeleton job in order to read the FDR I data. You can find an example in /users/ryoshida/testarea/13.0.40/PhysicsAnalysis/AnalysisCommon/UserAnalysis/src/AnalysisSkeleton.cxx.mod2. You will need to modify AnalysisSkeleton_topOptions.py to not to refer to MC quantities and also to read the FDR I data (shown above).
Running on FDR I data on the grid using pathena
Following should work if you were able to get the above "Getting Started" part to work.
- cd ~/testarea/13.0.40/
- cmt co PhysicsAnalysis/DistributedAnalysis/PandaTools
- cd PhysicsAnalysis/DistributedAnalysis/PandaTools/cmt
- cmt config
- source setup.sh
- gmake
Now:
- cd ~/testarea/13.0.40/PhysicsAnaysis/AnalysisCommon/UserAnalysis/run
- source ../cmt/setup.sh
- export PATHENA_GRID_SETUP_SH=/grid/vdt/osg/setup.sh
- pathena --inDS data_you_want_to_analyze.AOD.nnnn --outDS your_output_ntuple_name AnalysisSkeleton_topOptions.py
The following will send the job to UChicago (encouraged).
- pathena --inDS data_you_want_to_analyze.AOD.nnnn --outDS your_output_ntuple_name --site=ANALY_MWT2 AnalysisSkeleton_topOptions.py
If AnalysisSkeleton_topOptions.py runs interactively, pathena will now run the job on the Grid. You will eventually get an email about the completion (or failure) of your job. How to monitor the job, retrieve the output, etc can be found here.
Running over FDR data locally using TAGs
TAGs are used for event selection. The main reference for TAGs is
here. There are further
discussion of their use here and
here.
In the following example, the same job as in the "introduction" now via the
TAG files with a selection on the TAG attributes.
In the AnalysisSkeleton_topOptions.py
- Remove the line ServiceMgr.EventSelector.InputCollections=["your_data_file.AOD.pool.root"]
- Add the following five lines which
- supply a tag file as input in place of an AOD file;
- let the EventSelector know that the input is a tag file;
- tell the EventSelector which reference to use (tags can also navigate to ESD and RAW);
- specify a filter predicate;
- add a "ReadCatalog" in which the AOD files pointed to by the tags have been registered
- ServiceMgr.EventSelector.InputCollections = [ "/users/malon/FDR1Tags/StreamEgamma_r12.root" ]
- ServiceMgr.EventSelector.CollectionType = "ExplicitROOT"
- ServiceMgr.EventSelector.RefName = "StreamAOD"
- ServiceMgr.EventSelector.Query = "RunNumber==003070 && NLooseElectron>1"
- ServiceMgr.PoolSvc.ReadCatalog += [ "xmlcatalog_file:/data/nas2/users/ryoshida/fdr08_run1/PoolFileCatalog.xml" ]
- And run locally as before.
An example can be found in /users/ryoshida/testarea/13.0.40/PhysicsAnalysis/AnalysisCommon/UserAnalysis/run/AnalysisSkeleton_topOptions_tags.py as well as in /users/malon/testarea/13.0.40/PhysicsAnalysis/AnalysisCommon/UserAnalysis/run/AnalysisSkeleton_topOptions.py.
Note that the TAGs, for technical reasons, are available for the "r12" AODs and not for "r6" ones. Therefore in the data in /data/nas2/ryoshida/, only run 3070 can be selected using the TAG files.
Working examples
Jamboree attendants, please add your working examples here:
Running Akira's DPD Tutorial at ANL:
Check out TopPhysDPDMaker
cmt co -r TopPhysDPDMaker-00-00-10 PhysicsAnalysis/TopPhys/TopPhysDPDMaker
compile it:
cd PhysicsAnalysis/TopPhys/TopPhysDPDMaker/cmt; cmt make
Make sure you change the input file in ElectroweakD2PD_topOptions.py to
something here:
/data/nas2/users/ryoshida/fdr08_run1/
(eg fdr08_run1.0003050.StreamEgamma.merge.AOD.o1_r6_t1._0001.1)
Then copy python files from InstallArea to run directory:
cp ~/testarea/13.0.40/InstallArea/jobOptions/TopPhysDPDMaker/*py run/
And change lines 55 & 58 of ElectroweakD2PD_topOptions.py so that they include
these python files from the run directory. Finally setup some environment
variables:
source ~anelson/testarea/set.sh
It should run then:
athena ElectroweakD2PD_topOptions.py
Comments and Questions
Please post your comments and questions here: