-
Notifications
You must be signed in to change notification settings - Fork 87
Description
Hey Pierre,
We have an application that's very computationally intensive. But our customers can use a wide range of CPU's. Ranging from low-end i3's to some of the highest end i7's. We mostly see potential gains for the high-end i7's, especially the newer models that ship with Efficiency and Performance cores. Furthermore we also have some use cases where they run on Xeon processors but that's another story.
We also use a lot of TParallel.For's in our application and we see a lot of benefit on personal devices tweaking the block arena count. (few million contentions in a 100 second benchmark)
It's hard for us to ship with an one size fits all. Do you have any plans / ideas for determining these runtime?
Also for an example on my i7-12700h using:
CFastMM_SmallBlockArenaCount = 14;
CFastMM_MediumBlockArenaCount = 8;
CFastMM_LargeBlockArenaCount = 6;
Is about 8% faster.