Polymorphic allocators in DPL by mkrzewic · Pull Request #1514 · AliceO2Group/AliceO2

mkrzewic · 2018-11-28T16:41:06Z

marked as WIP since this is a proposal on how interfaces in DPL to handle STL containers with polymorphic allocators could look like. A proposed use case is in DPLmerger.cxx - it also finally got rid of the last instance of the use of the header:Stack freefn public API. Good riddance.

The use of pmr containers with appropriate allocators is meant to achieve zero copy data exchange in all cases supported by fairmq (including shmem and ofi) in a transparent standards compliant way.

ktf · 2018-11-29T11:30:33Z

Framework/Utils/include/Utils/Utils.h

I am worried about all this allocator business creeping to user code. How about we have an O2 specific smart pointer which hides all this? e.g.:

o2::ptr<std::vector<char>>

which internally holds a std::vector<char,o2::memory_resource::polymorphic_allocator<o2::byte>>

sounds a bit complicated, introduces yet another abstraction layer for something which is standard and introduces pointer semantics for vector (which already is a smart pointer of sorts). I think an alias would be enough, sth like std::pmr::vector.
I think it is not more complicated to remember to pass an additional argument to a constructor than to go through the logic of registering the data in a framework. We should give the users some credit ;)

sawenzel · 2018-11-29T14:11:44Z

@mkrzewic : Could you add some more explanation of what is the target use here? Is it to support shared memory transport in DPL? Or being able to adopt another vector or the like? Thanks.

mkrzewic · 2018-11-29T14:18:20Z

@sawenzel I updated the description. The point is to transparently be able to take advantage of shared memory/ofi transports without copying data while just using standard containers (currently only std::vector and std::string would make sense afaik, in the future - who knows :)). So if you use an o2::pmr::vector<int>(outputChannelAllocator) - it gives you all the benefits of the container (e.g. dynamic size, move semantics) in the most efficient way supported by the transport.

mkrzewic · 2018-11-29T14:22:06Z

also: when developing new containers we would only have to stick to the STL container requirements making the development standards-compliant and at the same time be "message-efficient" with no extra effort.

ktf · 2018-11-30T07:35:27Z

I think we need to discuss this next week. IMHO, esposing allocators is as bad as an idea as it can possibly get. Data own by the framework should be created via the framework.

…

On 29 Nov 2018, 15:23 +0100, Mikolaj Krzewicki ***@***.***>, wrote: also: when developing new containers we would only have to stick to the STL container requirements making the development standards-compliant and at the same time be "message-efficient" with no extra effort. — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

mkrzewic · 2018-11-30T09:07:20Z

this PR does not expose allocators (any more than they already were), see
o2::pmr::vector<T> makeVector(const Output& spec, Args&&... args)

as for the ownership - modern C++ practice is to keep ownership with the data since move semantics allow us to do this efficiently - this way we don't have to have magical entities to manage resource lifetimes - so in that respect I cannot disagree more with your statement.
Actually - the resource is managed by the framework that created it - FairMQ - since somewhere buried inside the (invisible) allocator there lives a FairMQMessagePtr.

ktf · 2018-12-04T13:24:19Z

You were exposing the allocator in the user code, as in "user has to know the allocator type", as far as I remember. This is indeed not the case anymore and using something like the o2::pmr::vector is indeed the direction I would prefer.

The advantage of having an o2::ptr<C<T>> or o2::ref<C<T>> (or whatever, the point is to have a wrapper<C<T>> kind of class) rather than an o2::pmr::container<T> is that the code is actually general enough you could use it for anything which is an AllocatorAwareContainer (with extra checks once we get to C++20 Concepts).

The reason for having the Framework / DPL to create the object, rather than doing it in vacuum is that there is no chance to mistake and create the wrong type of vector.

To me:

auto r = ctx.outputs().makeVector<Clusters>(Output("TPC/TRACKS/0"), 10);

is better than:

auto v = o2::vector<Clusters>{10};
auto r = ctx.outputs().adoptVector(Output("TPC/TRACKS/0"), v);

You expose less of the details and you keep everything in one place. If you really do not like the ctx.outputs(). we could hide that behind a static std::makeVector<> but when we discussed this the feedback was that people liked to see the ctx.outputs() as it makes clear you are creating something outgoing.

Apart from that, which we can keep for a later discussion, I would drop the pmr namespace from o2::pmr::vector since we do not have and we will not have any o2::vector to differentiate from (i.e. an o2::vector is the vector to use if you want to exchange it using o2).

mkrzewic · 2018-12-04T13:54:34Z

You were exposing the allocator in the user code, as in "user has to know the allocator type", as far as I remember. This is indeed not the case anymore and using something like the o2::pmr::vector is indeed the direction I would prefer.

it is the same, just different wording, user does have to know where the data is supposed to be going. Whether you call it allocator or output channel, I don't care, things can always be aliased with some dumbed down name.

The advantage of having an o2::ptr<C<T>> or o2::ref<C<T>> (or whatever, the point is to have a wrapper<C<T>> kind of class) rather than an o2::pmr::container<T> is that the code is actually general enough you could use it for anything which is an AllocatorAwareContainer (with extra checks once we get to C++20 Concepts).

I think this is superfluous, an allocator aware container already IS a wrapper around data, and has everything that is needed for efficient data transfers. No need to hide everything from users.

The reason for having the Framework / DPL to create the object, rather than doing it in vacuum is that there is no chance to mistake and create the wrong type of vector.

To me:
auto r = ctx.outputs().makeVector<Clusters>(Output("TPC/TRACKS/0"), 10);
is better than:
auto v = o2::vector<Clusters>{10};
auto r = ctx.outputs().adoptVector(Output("TPC/TRACKS/0"), v);

i'm not suggesting to do it in vacuum, just suggesting a more natural syntax (see previous post):
auto v = o2::vector<Clusters>{ctx.outputs().getTransportAllocator(), Args...} //make a vector suitable for output

this is just plain C++, no need to invent some additional syntax and logic for it.
In the pmr implementation the underlying message is managed by the container itself - so if you are happy with your data you return/move/send/whatever at the end, otherwise you drop it. No need to keep some state (the message) in the framework. So yes, you have to call something at the end (you have adoptVector, but I would just call it Send() or something). More explicit and more intuitive since it reflects the ownership. Also exception safety is ensured by simple scoping and clear(standard) ownership semantics.

You expose less of the details and you keep everything in one place. If you really do not like the ctx.outputs(). we could hide that behind a static std::makeVector<> but when we discussed this the feedback was that people liked to see the ctx.outputs() as it makes clear you are creating something outgoing.

i have no problem with ctx.outputs() being explicitly visible - other than the confusing naming (plural suggests you get some collection of outputs, but you get a DataAllocator (i think), all that is hidden by everything is auto, so hard to actually find the thing you are getting)

Apart from that, which we can keep for a later discussion, I would drop the pmr namespace from o2::pmr::vector since we do not have and we will not have any o2::vector to differentiate from (i.e. an o2::vector is the vector to use if you want to exchange it using o2).

I have no strong preference here, but I think there is no need to hide the fact that the vector is a pmr vector. Plus it would make it easy to switch to std::pmr::vector when that gets implemented in the standard libraries.

mkrzewic · 2018-12-04T14:13:24Z

now I come to think of it: maybe the make* methods that return something managed (like the ones that call adopt(...) internally) should return by pointer, not by value (to avoid dtors being called on exception) and not by reference (confusing since value semantics suggest ownership).
The makeVector(...) I'm suggesting returns by value and it is then clear - the thing you get is self contained, you own it until you move it away. To avoid confusion maybe a different name should be chosen to avoid ambiguity with the other make* methods.

ktf · 2018-12-04T14:26:16Z

No, there is a huge difference semantically by having an allocator, an output channel or anything related to transport layer and specifying an output datatype, which does not know anything about how data gets transported to the rest of the consumers.

The example you have:

auto v = o2::vector<Clusters>{ctx.outputs().getTransportAllocator(), Args...}

will not be enough for the case in which you have multiple outputs or multiple transports. If at lower level as an implementation detail we want to use pmr and similar, fine, in the DPL the design tradeoff is:

Users specify inputs in terms of O2 Data Model descriptors.
Users create output in terms of the O2 Data Model descriptors.
No knowledge is needed about the underneath transport and how and how many devices are connected using which technology.

These requirements are there since the beginning and I think they are turning out to be a good idea. I am happy to improve the API and remove confusions, but I think those principles should remain intact.

Regarding the naming of o2::pmr::vector vs o2::vector I do prefer o2::vector so if there is no strong objection, I think it would better to use the latter. I refuse to believe people will want to know about polymorphic memory resources. ;-)

mkrzewic · 2018-12-04T15:30:09Z

The example you have:
auto v = o2::vector<Clusters>{ctx.outputs().getTransportAllocator(), Args...}
will not be enough for the case in which you have multiple outputs or multiple transports.

of course it is enough:

auto v = o2::vector<Clusters>{ctx.outputs().getTransportAllocator(Output("TPC/TRACKS/0")), Args...}

something like that will take care of selecting the proper one. We should probably also avoid specifying the output by string since it is supposed to be predeclared already. Also note there is no need to specify the size since it will either resize itself or user calls reserve() - it is a std::vector after all.

If at lower level as an implementation detail we want to use pmr and similar, fine, in the DPL the design tradeoff is:

Users specify inputs in terms of O2 Data Model descriptors.

Users create output in terms of the O2 Data Model descriptors.

No knowledge is needed about the underneath transport and how and how many devices are connected using which technology.

These requirements are there since the beginning and I think they are turning out to be a good idea. I am happy to improve the API and remove confusions, but I think those principles should remain intact.

sure, nothing wrong with it, I don't see how my proposal breaks it. I'll reshuffle the code a bit to make it a bit more clear.

Regarding the naming of o2::pmr::vector vs o2::vector I do prefer o2::vector so if there is no strong objection, I think it would better to use the latter. I refuse to believe people will want to know about polymorphic memory resources. ;-)

should be possible, but i see a funny interference with std::vector after changing that - in some cases o2::vector gets precedence in implementation files which use "vector", even after a "using std::vector" statement at the beginning...

ktf · 2018-12-04T16:10:28Z

Regarding your example, I still prefer:

auto v = ctx.outputs().makeVector<Cluster>{Output("TPC/TRACKS/0")), Args...}
// or:
auto v = ctx.outputs().make<o2::vector<Cluster>>{Output("TPC/TRACKS/0")), Args...}
// or:
o2::ref<std::vector<Cluster>> v = ctx.outputs().make<std::vector<Cluster>>{Output("TPC/TRACKS/0")), Args...}

over:

auto v = o2::vector<Cluster>{ctx.outputs().getTransportAllocator(Output("TPC/TRACKS/0")), Args...}

as it is more symmetric with all the cases which we will have which do not support polymorphic allocators (e.g. histograms, boost serialised objects as implemented right now) or which have different mechanisms (e.g. arrow and it's pool_allocator business). How would you treat those cases?

mkrzewic · 2018-12-04T16:54:37Z

just to answer you quickly: i'm not aiming at full symmetry, I don't believe it is quite possible and I would not like to be limited to only one possibility. As for the other cases: I don't know ATM, but i'll give it some thought:)
This PR however only contains the "symmetric" case: a DataAllocator::make method. The "manual construction" method is simply a fact of life and it should be possible to use it if desired. The "make" method is just syntactic sugar that wraps explicit construction.

dennisklein · 2018-12-04T17:07:27Z

IMHO, esposing allocators is as bad as an idea as it can possibly get. Data own by the framework should be created via the framework.

What do you mean exactly by "created"?

Assuming you mean "allocated", then Mikolajs contribution is exactly enforcing the property you stated above. If the framework does not expose its allocator, then the stated property will deny moving data to the framework, or am I missing anything? That is at least the feature I recognize Mikolaj wants to add here?

ktf · 2018-12-04T20:07:07Z

@dennisklein In an entity-component-system system, the system should be used to create entities and components. They should not come to life autonomously. In this case, a method of DPL should be used to construct an object, allocated with whatever policy (which is really an implementation detail and should not be exposed) and return it to the user. The user should not care / know how the object comes into life and the object itself should not expose how it comes into life or who it belongs to. Ideally I would not even expose the fact that the container is a std::vector (which requires guarantees on the contiguousness of the memory region) but I would expose an iterator pair / range, albeit that's a different story.

@mkrzewic given it's syntactic sugar, why not having symmetric sugar and have a specialisation which matches:

  o2::vector<T> make<o2::vector<T>>(const Output& spec, Args&&... args)

isn't what you propose a simply a more verbose and implementation details exposing version of the same thing? I am not against breaking symmetry (we do it actually for pointers and everything else already), but why deliberately breaking the symmetry?

mkrzewic · 2018-12-04T21:07:42Z

@ktf makeVector looks how it looks like (i.e. slightly differently) because your make<> templates consistently return by reference - which is in my opinion dangerous since the compiler will not warn you if you catch by value - you get a local copy and will not be working with the entity that was registered internally...
Exposing iterators/ranges is a fantastic device, but only works for consumers of read-only data. When producing we don't know how big a range is going to be so we need access to the actual container and we need to actually control lifetime ourselves - my preference is to do it by using scoped handles like unique_ptr, std::vector etc... I think modern C++ gives us devices which render the older patterns less useful in their pure form - isn't an entity-component system largely a lifetime management system that was needed because it used to be hard to deal with object lifetime management?

dennisklein · 2018-12-04T21:52:41Z

isn't an entity-component system largely a lifetime management system that was needed because it used to be hard to deal with object lifetime management?

Nono, that's called ROOT. Or in other words: For every type T have another type TManager which is a singleton owning container of all the objects of T. :-P

Or in modern ROOT: TManager is a thread local singleton.

ktf · 2018-12-04T21:57:29Z

yes, I agree with your concern, this is why I was proposing to have an o2::observer_ptr<T> or whatever you want to call it to avoid such a case. Maybe I should retake on the exercise at some point.

dennisklein · 2018-12-04T21:57:32Z

Well, nevermind..

mkrzewic · 2018-12-05T15:34:54Z

I squashed everything together for an easier read and added observer_ptr (could not find anything like that in boost), still not sure what to do with the makeVector() method...

Get rid of Stack::getFreefn() - use stl containers with pmr allocator Fix codechecker concerns Clang format Rename o2::memory_resource -> o2::pmr, introduce o2::pmr::vector Advertise explicit return type from makeVector instead of auto Minor update of documentation o2::vector

mkrzewic · 2018-12-20T12:40:29Z

it seems this is stuck. Since a large part of this PR is actually not DPL related and contains a long needed fix of the data header (like the removal of Stack::freefn - which has to go away) I could move the DPL related part to another PR, but then the author of DPLmerger (@gabrielefronze ) will have to fix or remove that code.

mkrzewic · 2019-01-10T23:07:35Z

removing [WIP], there is not much that can be changed here on short notice. This PR fixes a number of long standing issues related to the data model and the remaining concerns can be resolved in the future if needed.

ktf · 2019-01-11T08:16:35Z

Framework/Core/include/Framework/DataAllocator.h


 // Do not change this for a full inclusion of FairMQDevice.
-class FairMQDevice;
+#include <FairMQDevice.h>


Is this really necessary? There was a big cost in including the FairMQDevice.h due to the boost state machine.

FairMQ headers do not contain MSM anymore, all MSM is in source files.

Is it true? I see

#include <FairMQStateMachine.h>

which itself includes MSM. When was this changed?

In May 2018 MSM includes were moved to FairMQStateMachine.cxx.

ktf · 2019-01-11T08:17:25Z

Framework/Core/include/Framework/observer_ptr.h

+// In applying this license CERN does not waive the privileges and immunities
+// granted to it by virtue of its status as an Intergovernmental Organization
+// or submit itself to any jurisdiction.
+


Could you split this is a separate PR, adding a test exercising it? I think this can go in without further discussion.

ktf · 2019-01-11T08:18:43Z

Utilities/aliceHLTwrapper/src/MessageFormat.cxx

  return mBlockDescriptors.size() - count;
 }

-int MessageFormat::addMessages(const vector<BufferDesc_t>& list)


These changes (adding a std::) can also go in as soon as possible. Could you factor them out in a separate PR?

mkrzewic · 2019-01-11T11:47:38Z

ok, i'll reshuffle. closing in favour of #1581 and following.

ktf reviewed Nov 29, 2018

View reviewed changes

mkrzewic force-pushed the pmr_in_dpl branch from 3dfddf5 to 0fe2c66 Compare November 30, 2018 14:51

mkrzewic force-pushed the pmr_in_dpl branch 2 times, most recently from 5d6d4a8 to 6ef1e5e Compare December 5, 2018 15:34

mkrzewic force-pushed the pmr_in_dpl branch from 6ef1e5e to 7b0166e Compare December 5, 2018 16:11

mkrzewic added 3 commits December 20, 2018 13:31

Add observer_ptr ( a la C++20 TSv2)

4c248e9

Add make_observer for good measure

0555ce0

mkrzewic force-pushed the pmr_in_dpl branch from 204f981 to 0555ce0 Compare December 20, 2018 12:32

mkrzewic changed the title ~~[WIP] polymorphic allocators in DPL~~ Polymorphic allocators in DPL Jan 10, 2019

ktf reviewed Jan 11, 2019

View reviewed changes

mkrzewic closed this Jan 11, 2019

mkrzewic mentioned this pull request Jan 11, 2019

Stack fixes #1581

Merged

mikesas pushed a commit to mikesas/AliceO2 that referenced this pull request Dec 13, 2022

Squashing commits (AliceO2Group#1514)

812481a

Conversation

mkrzewic commented Nov 28, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sawenzel commented Nov 29, 2018

Uh oh!

mkrzewic commented Nov 29, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mkrzewic commented Nov 29, 2018

Uh oh!

ktf commented Nov 30, 2018 via email

Uh oh!

mkrzewic commented Nov 30, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ktf commented Dec 4, 2018

Uh oh!

mkrzewic commented Dec 4, 2018

Uh oh!

mkrzewic commented Dec 4, 2018

Uh oh!

ktf commented Dec 4, 2018

Uh oh!

mkrzewic commented Dec 4, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ktf commented Dec 4, 2018

Uh oh!

mkrzewic commented Dec 4, 2018

Uh oh!

dennisklein commented Dec 4, 2018

Uh oh!

ktf commented Dec 4, 2018

Uh oh!

mkrzewic commented Dec 4, 2018

Uh oh!

dennisklein commented Dec 4, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ktf commented Dec 4, 2018

Uh oh!

dennisklein commented Dec 4, 2018

Uh oh!

mkrzewic commented Dec 5, 2018

Uh oh!

mkrzewic commented Dec 20, 2018

Uh oh!

mkrzewic commented Jan 10, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mkrzewic commented Jan 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

5 participants

mkrzewic commented Nov 28, 2018 •

edited

Loading

mkrzewic commented Nov 29, 2018 •

edited

Loading

mkrzewic commented Nov 30, 2018 •

edited

Loading

mkrzewic commented Dec 4, 2018 •

edited

Loading

dennisklein commented Dec 4, 2018 •

edited

Loading