Skip to content

Latest commit

 

History

History
59 lines (49 loc) · 2.9 KB

run-2-data-flow.md

File metadata and controls

59 lines (49 loc) · 2.9 KB
layout title subtitle minutes
page
First Steps in LHCb
Changes to the data flow in Run 2
10

Learning Objectives {.objectives}

  • Understand how the LHCb data flow differs between Run 1 and Run 2

Run 1 of the LHC ran from 2010 to the end of 2012. Run 2 began in the middle of 2015 and is scheduled to continue until 2019. During Run 1 the LHC provided proton-proton collisions at a centre-of-mass energy of 7 and 8 TeV, and in Run 2 the energy increases to 13 TeV. With an increase in energy comes an increase in production cross-sections, and so a much higher rate of interesting events.

So now we have many more events that we would like to store, but as ever we're limited by computing resources. To help overcome this, LHCb introduced the Turbo stream in 2015, whereby the selection of candidates made in the second stage of the high level trigger, HLT2, is saved to disk and used directly by analysts, with no further offline reconstruction by Brunel.

Omitting the offline reconstruction is usually a Bad Idea because it's usually of a much better quality than the reconstruction performed in the HLT (the ‘online reconstruction’) as there's more time to run it. But, thanks an enormous effort improving the reconstruction software both online and offline in between Run 1 and Run 2, the two reconstructions now perform identically. This means if HLT2 performs all the reconstruction you need in your analysis, there's no need to wait for the offline reconstruction to run! This saves a lot of time (it's Turbo, after all), and hence money).

This saves time and money, but you still have many more events to save. To overcome this, events saved to the Turbo stream contain only the candidates that were reconstructed in the trigger. That is to say, any tracks or detector responses that don't form part of the decay that the trigger lines uses to evaluate its selection is thrown away. This is quite pragmatic, because a lot of analyses don't use this information anyway.

The Turbo stream runs in parallel with the regular data flow, and so everything now looks like this:

"The flow of real and simulated data during Run 2 of the LHC"

The change here is the addition of the Turbo stream in the storage output of the trigger. This stream cannot be re-reconstructed because the information needed to do that is thrown away to save disk space.

From our point of view, though, this doesn't change things much because we can run DaVinci over the Turbo stream in exactly the same manner as for the stripping output. We will just need to look in a different place to find the selection definitions, this time for trigger lines rather than stripping lines.

It's best to ask the trigger coordinator in your working group if your analysis has a trigger line that outputs to the Turbo stream, and if so where to find the selection definitions.