Add various utility functions for LCIO objects #150

dudarboh · 2022-08-31T15:05:56Z

BEGINRELEASENOTES

Added a utility function to calculate Track momentum based on its track parameters and magnetic field
Added methods to the LCRelationNavigator that extract the highest weight with an option to indicate weight encoding type ("track"/"cluster").
Added a utility function to get an index of a provided object inside a given LCCollection
Added a utility function to return a leading track from ReconstructedParticle in case it has multiple tracks attached.
ENDRELEASENOTES

TL;DR:
These are the utility functions i often use in my code and I feel like they might be useful for others.
Here is more detailed overview on them:

a. Track momentum: This is very often implemented by hand in many places.

In some cases it is done through the HelixClass::Initialize_Canonical, which, in principle has the same implementation, however it forces you to build a Helix object, even if you just need a momentum and you need to copy 5 track parameters, which most of the time makes 5 lines instead of one. Also Helix class needs a capital cleanup, due to some issues, much makes it not a good place to extract momentum in a long term run. In my opinion it fits the best together with LCIO track.

b. LCRelationNavigator getMaxWeightObject:

This potentially helpful to avoid bugs, where people assume the objects are sorted from highest to lowest weight in the first place, e.g.: lcfiplus/LCFIPlus#64.

Also, having an option to sort assuming track/cluster weight encoding (which is used in "RecoMCTruthLink" relations) might raise awareness of people, why weights are not normalized and prevent unwanted behaviour, if one wasn't aware of documentation here.

c. Getting index of an element inside a collection.

It is helpful if one needs to access an object from a similar collection which doesn't have relation links:
E.g.
MCCollection MarlinTrkTracks FixedTracksCollection OldTracksCollection
MCParticle <-> Track (0x7f0b08) # 13 <-> # 13 <-> # 13

If there is no LCRelations written for new collections, then we need a track index (13) to compare objects one-by-one

d. Getting a leading track from the PFO.
It might be a more consistent thing to do for PFOs with more than a single track, than just taking the first element in the array.

tmadlener

Looks good to me overall. I have a few comments / questions inline.

src/cpp/include/UTIL/ReconstructedParticleTools.h

src/cpp/include/UTIL/TrackTools.h

tmadlener · 2022-09-01T09:58:21Z

src/cpp/src/UTIL/LCRelationNavigator.cc

+      std::function<float(float)> getWeight;
+      if (weightType == "track") getWeight = [](float encoding){ return (int(encoding)%10000)/1000.; };
+      else if (weightType == "cluster") getWeight = [](float encoding){ return (int(encoding)/10000)/1000.; };
+      else {
+        getWeight = [](float encoding){ return encoding; };
+      }


This could potentially be solved without having to use an std::function with an immediately invoked function expression (IIFE), something like

const auto getWeight = [&weightType](){ if (weightType == "track") { return [](float encoding) { return (int(encoding)%10000)/1000.; }; } // ... }();

This doesn't work.

I think each lambda inside if returns its unique type, so in the end it returns an error, not knowing which type to assign to the final lambda:

error: inconsistent types ‘UTIL::LCRelationNavigator::getRelatedToMaxWeightObject(EVENT::LCObject*, const string&) const::<lambda()>::<lambda(float)>’ and ‘UTIL::LCRelationNavigator::getRelatedToMaxWeightObject(EVENT::LCObject*, const string&) const::<lambda()>::<lambda(float)>’ deduced for lambda return type return [](float encoding){ return (int(encoding)/10000)/1000.; };

Ah right, forgot about the different types of the lambdas... In that case I think we go with the std::function (and it's small overhead), but in that case could we make the default usage more prominent, i.e.:

Suggested change

std::function<float(float)> getWeight;

if (weightType == "track") getWeight = [](float encoding){ return (int(encoding)%10000)/1000.; };

else if (weightType == "cluster") getWeight = [](float encoding){ return (int(encoding)/10000)/1000.; };

else {

getWeight = [](float encoding){ return encoding; };

}

std::function<float(float)> getWeight = [](float encoding) { return encoding };

if (weightType == "track") getWeight = [](float encoding){ return (int(encoding)%10000)/1000.; };

else if (weightType == "cluster") getWeight = [](float encoding){ return (int(encoding)/10000)/1000.; };

As a (more or less) purely academic aside (and for your information), The IIFE could be made to work with the following little trick:

const auto Lambda = [](int a, int b) { return a < b; }; // Lambda will be some compiler named type const auto FuncPtr = +[](int a, int b) { return a < b; } // The lambda will be implicitly converted to // a function pointer and FuncPtr will be a // bool (* const)(int, int)

By converting the lambda into a function pointer via the unary operator+ we would be able to use the IIFE again. However, given the context we are doing this in, I think it is better to go with the more legible std::function approach.

I think then I leave the std::function out and simply do e.g.:

Suggested change

std::function<float(float)> getWeight;

if (weightType == "track") getWeight = [](float encoding){ return (int(encoding)%10000)/1000.; };

else if (weightType == "cluster") getWeight = [](float encoding){ return (int(encoding)/10000)/1000.; };

else {

getWeight = [](float encoding){ return encoding; };

}

float maxWeight = *std::max_element(weights.begin(), weights.end(), [](float a, float b){return a < b ;};

if (weightType == "track") maxWeight = *std::max_element(weights.begin(), weights.end(), [](float a, float b){return (int(a)%10000)/1000. < (int(b)%10000)/1000.;});

else if (weightType == "cluster") maxWeight = *std::max_element(weights.begin(), weights.end(), [](float a, float b){return (int(a)/10000)/1000. < (int(b)/10000)/1000. ;});

return maxWeight;

what do you think?

src/cpp/src/UTIL/LCRelationNavigator.cc

tmadlener · 2022-09-01T10:01:29Z

src/cpp/include/UTIL/LCRelationNavigator.h

+    const EVENT::FloatVec & getRelatedFromWeights(EVENT::LCObject * to) const ;
+
+    /** Object with a highest weight that the given from-object is related to.
+     *  LCObject is of type getToType().
+     */
+    const EVENT::LCObject* getRelatedToMaxWeightObject(EVENT::LCObject* from, std::string weightType) const ;
+
+    /** From-object related to the given object with a highest weight (the inverse relationship).
+     *  LCObject is of type getFromType().
+     */
+    const EVENT::LCObject* getRelatedFromMaxWeightObject(EVENT::LCObject* to, std::string weightType) const ;
+
+    /** The highest weight of the relations returned by a call to getRelatedToObjects(from).
+     * @see getRelatedToObjects
+     */
+    float getRelatedToMaxWeight(EVENT::LCObject* from, std::string weightType) const ;
+
+    /** The highest weight of the relations returned by a call to getRelatedFromObjects(to). 
+     * @see getRelatedFromObjects
+     */
+    float getRelatedFromMaxWeight(EVENT::LCObject* to, std::string weightType) const ;


Are the weights and the objects commonly used together? I could imagine there is a use case for having only the object, but is there also a use case for only the weight?

I would say weights and object must always be used together.

You always want to extract an object with a certain weight.
You might want the weight value as well to require certain cut, e.g.: accept only MCParticles which contributed >90% of the energy to this particular PFO.

I would much prefer here the use of std::pair, to return both object and weight in a single method, but I decided to be consistent with already existing code style, which has separate function to return weights and objects.

In that case I would argue that having either a std::pair return value or even a

struct RelObject { Event::LCObject* object; float weight; };

(which should be usable like)

const auto [obj, w] = getRelatedFromMaxWeigth(to, "");

should be the main implementation and if you want to keep the compatibility with the existing code base, you add simple wrappers around this to return only part of the result.

I mean,
maxWeight or maxWeightObject can be used separately, so I don't think it makes sense to change them to return a std::pair. If you know that this object already has a max weight (because thats what method returns), you don't need a weight value already.
In the last message, when i said object must always be used together exactly for the purpose to find the maxWeight object or leastWeight object, etc. If you take a simply object, ignoring its weight, it doesn't really make sense.
But this would require complete overhaul how do we store them. I wont go that deep here

src/cpp/include/UTIL/TrackTools.h

dudarboh · 2022-09-02T12:49:12Z

@tmadlener

I have moved all implementations to the .cc files, so compiler sees them. And I have added CLHEP dependency to the LCIO.

I squashed some of your comments fixes into existing commits, so it doesn't look like a mess, please take a look.

src/cpp/include/UTIL/LCCollectionTools.h

src/cpp/src/UTIL/LCRelationNavigator.cc

gaede

Hi @dudarboh, we do not want to introduce a mandatory dependence on CLHEP.
As far as I have seen, you only need this for three vectors - this should be easy to solve with either a simple ThreeVector class in UTIL or using arrays...

dudarboh · 2022-09-08T12:14:47Z

Hi @gaede,

Dont we have CLHEP dependencies already here?

LCIO/src/cpp/include/UTIL/LCFourVector.h

Lines 6 to 11 in a7b74eb

    
           #include "CLHEP/Vector/LorentzVector.h" 
        
           // CLHEP 1.9 and higher introduce a namespace: 
        
           namespace CLHEP{}   
        
           using namespace CLHEP ;

not sure if it works, because currently we don't have CLHEP dependency inside CMakeLists.txt, however it compiles, because the header file isn't used anywhere.

We also have a test for this LCFourVector, and CMakeLists.txt for test has a needed CLHEP dependency:

LCIO/tests/CMakeLists.txt

Lines 296 to 302 in a7b74eb

    
           FIND_PACKAGE( CLHEP QUIET ) 
        
           IF( CLHEP_FOUND ) 
        
               INCLUDE_DIRECTORIES( ${CLHEP_INCLUDE_DIRS} )  
        
               ADD_LCIO_TEST( test_fourvector)  
        
           ELSE() 
        
               MESSAGE( STATUS "cannot build test_fourvector: clhep not found" ) 
        
           ENDIF( CLHEP_FOUND )

If we strongly don't want to introduce CLHEP dependency, then should we also remove LCFourVector with its test for the consistency?

EDIT: I missed the word "mandatory", was that the intended design that one could compile LCIO w/o CLHEP dependency, but then just couldn't use LCFourVector?

gaede · 2022-09-08T12:51:51Z

EDIT: I missed the word "mandatory", was that the intended design that one could compile LCIO w/o CLHEP dependency, but then just couldn't use LCFourVector?

Yes, exactly, we want LCIO to compile w/o CLHEP. The LCFourVector is a historical leftover and it can be used in downstream code that actually has CLHEP but does not introduce a mandatory CLHEP dependency...

src/cpp/include/UTIL/ReconstructedParticleTools.h

src/cpp/src/UTIL/ReconstructedParticleTools.cc

dudarboh · 2022-10-17T11:52:10Z

@tmadlener, how do I fix this for macOS?

tmadlener · 2022-10-17T13:18:11Z

src/cpp/src/UTIL/TrackTools.cc

+    std::array<double, 3> getTrackMomentum(const EVENT::TrackState* ts, double bz){
+        std::array<double, 3> mom;
+        double omega = ts->getOmega();
+        double phi = ts->getPhi();
+        double tanL = ts->getTanLambda();
+        double c_light = 299.792458; // mm/ns
+        double pt = (1e-6 * c_light * bz) / std::abs(omega);
+        mom[0] = pt*std::cos(phi);
+        mom[1] = pt*std::sin(phi);    
+        mom[2] = pt*tanL;
+        return mom;
+    }


This could be a templated function, right? On a first glance this seems to be the identical code:

something like

template <typename TrackLikeT> std::array<double, 3> getTrackMomentum(const TrackLikeT* track, double bz) { // ... implementation }

Additionally, I think you could save creating an array explicitly, by doing

return {pt * std::cos(phi), pt * std::sin(phi), pt * tanL};

The implementation will then have to move to the header

How do I test then, whether it ~~compiles~~ works?

By adding a test ;)

https://github.com/iLCSoft/LCIO/tree/master/src/cpp/src/TESTS

has a test_tracks.cc and also a test_trackstate.cc. They seem to already create a few tracks and track states, so you should be able to check whether your new utilities actually behave as they should. Essentially the MYTEST calls in the second half are what you are looking for, they work like

MYTEST(/*actual*/, /*expected*/, /*name of thing that is compared*/);

as long as there is an implementation of operator== for the thing that you try to compare this should work. You can in principle also add the getLeadingTrack to one of the other tests, if you find a reconstructed particle with multiple tracks attached.

I have some troubles. I feel like it also needs << operator overload... Which std::array doesn't have.
I can compare absolute values though..

And I can't used std::hypot(px, py, pz) (since C++17)?
LCIO is still not C++17?

I feel like it also needs << operator overload... Which std::array doesn't have.

Ah you are right, I missed that, sorry. In this case, that would probably need to go to the TESTS/tutils.h header (if you want std::array outputs (with a semi-fancy version below).

template<typename T, size_t N> std::ostream& operator<<(std::ostream& os, const std::array<T, N>& arr) { if constexpr (N == 0) { return os << "[]"; } os << "[" << arr[0]; for (size_t i = 1; i < N; ++i) { os << ", " << arr[i]; } return os << "]"; }

And I can't used std::hypot(px, py, pz) (since C++17)?

std::hypot should in principle be available, as all CI workflows actually use CMAKE_CXX_STANDARD=17. Not entirely sure why you are running into problems here tbh. I will have to check.

Additionally, it looks like the values that are filled into the tracks/track states do not really make sense from a physics point of view. They produce NaNs when trying to calculate the momenta from them:

https://github.com/iLCSoft/LCIO/actions/runs/3267033539/jobs/5371642760#step:4:17906

tmadlener · 2022-10-17T14:36:08Z

@tmadlener, how do I fix this for macOS?

You don't ;) I still have to remove the macOS workflow here, since github has removed runners with an appropriate versions. See, e.g. AIDASoft/run-lcg-view#3

src/cpp/src/TESTS/test_tracks.cc

…ker way to extract an object with the highest weight

…llection

…Particle

treatment for null curvature and add test for the getTrackMomentum

tmadlener reviewed Sep 1, 2022

View reviewed changes

dudarboh force-pushed the add_utilities branch from fdbd6b7 to 8ad8876 Compare September 2, 2022 12:45

tmadlener reviewed Sep 2, 2022

View reviewed changes

src/cpp/include/UTIL/LCCollectionTools.h Outdated Show resolved Hide resolved

src/cpp/src/UTIL/LCRelationNavigator.cc Outdated Show resolved Hide resolved

gaede reviewed Sep 8, 2022

View reviewed changes

dudarboh force-pushed the add_utilities branch from 8ad8876 to 1e46059 Compare September 12, 2022 13:21

tmadlener reviewed Sep 15, 2022

View reviewed changes

src/cpp/include/UTIL/ReconstructedParticleTools.h Outdated Show resolved Hide resolved

tmadlener reviewed Sep 19, 2022

View reviewed changes

src/cpp/include/UTIL/ReconstructedParticleTools.h Outdated Show resolved Hide resolved

tmadlener reviewed Sep 23, 2022

View reviewed changes

src/cpp/src/UTIL/ReconstructedParticleTools.cc Outdated Show resolved Hide resolved

tmadlener reviewed Oct 17, 2022

View reviewed changes

dudarboh force-pushed the add_utilities branch from 4cb3d67 to 6571b3f Compare October 18, 2022 16:08

tmadlener reviewed Oct 19, 2022

View reviewed changes

src/cpp/src/TESTS/test_tracks.cc Outdated Show resolved Hide resolved

tmadlener force-pushed the add_utilities branch from 27e852f to db6687f Compare October 19, 2022 09:56

dudarboh and others added 11 commits October 19, 2022 13:48

Add utility function to calculate tracks momentum

0345aa3

Added four utility methods to the LCRelationNavigator to allow a quic…

70b2700

…ker way to extract an object with the highest weight

Remove virtual

6c9e734

Add a utility function to return objects index inside a provided LCCo…

c8f7379

…llection

Add a utility function to return a leading track of the Reconstructed…

1f661af

…Particle

Clarify docstring

27dfad8

remove magnetic field dependance from getLeadingTrack

08710cf

Make getTrackMomentum a template to also cover TrackState use. Add

96dfc7d

treatment for null curvature and add test for the getTrackMomentum

Remove unnecessary duplicate tests and make comparison more robust

31ac1ca

Make definition visible before it is used

2c4297a

Move determination of max element into separate function

55dd651

tmadlener force-pushed the add_utilities branch from da00370 to 55dd651 Compare October 19, 2022 12:01

tmadlener approved these changes Oct 19, 2022

View reviewed changes

tmadlener merged commit 3048ede into iLCSoft:master Oct 19, 2022

dudarboh deleted the add_utilities branch October 19, 2022 13:21

tmadlener mentioned this pull request Oct 24, 2022

hot fix seg. fault when there is not enough hits to calculate the track length iLCSoft/MarlinReco#106

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add various utility functions for LCIO objects #150

Add various utility functions for LCIO objects #150

dudarboh commented Aug 31, 2022

tmadlener left a comment

tmadlener Sep 1, 2022

dudarboh Sep 1, 2022

tmadlener Sep 2, 2022

dudarboh Sep 2, 2022 •

edited

Loading

tmadlener Sep 1, 2022

dudarboh Sep 2, 2022

tmadlener Sep 2, 2022 •

edited

Loading

dudarboh Oct 17, 2022

dudarboh commented Sep 2, 2022

gaede left a comment

dudarboh commented Sep 8, 2022 •

edited

Loading

gaede commented Sep 8, 2022

dudarboh commented Oct 17, 2022

tmadlener Oct 17, 2022

tmadlener Oct 17, 2022

dudarboh Oct 17, 2022 •

edited

Loading

tmadlener Oct 17, 2022

dudarboh Oct 17, 2022

dudarboh Oct 17, 2022

tmadlener Oct 17, 2022

tmadlener commented Oct 17, 2022

Add various utility functions for LCIO objects #150

Add various utility functions for LCIO objects #150

Conversation

dudarboh commented Aug 31, 2022

tmadlener left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dudarboh Sep 2, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tmadlener Sep 2, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dudarboh commented Sep 2, 2022

gaede left a comment

Choose a reason for hiding this comment

dudarboh commented Sep 8, 2022 • edited Loading

gaede commented Sep 8, 2022

dudarboh commented Oct 17, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dudarboh Oct 17, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tmadlener commented Oct 17, 2022

dudarboh Sep 2, 2022 •

edited

Loading

tmadlener Sep 2, 2022 •

edited

Loading

dudarboh commented Sep 8, 2022 •

edited

Loading

dudarboh Oct 17, 2022 •

edited

Loading