7.608 M/sec iTLB-loads w/ 50.80% iTLB-load-misses for normal mapping, vs 1.547 M/sec iTLB-loads w/ 48.82% iTLB-load-misses with text mapped as a huge page.
-
-
Show this thread
-
The magic necessary (linker flags to align the text sufficiently, link to libhugetlbfs, editing elf headers with hugeedit --text, setting up accessible hugetlbfs) seems too big for most, but I wonder if there's a way that could be simplified.
Show this thread -
Oh, and besides being complicated: It currently also breaks perf profiles...
Show this thread
End of conversation
New conversation -
-
-
Nice.
#FreeBSD does this already (though admittedly I'd liek to have more control over whether it does it, which is more of an issue for DSM segments than the text segment, for eg parallel hash joins; on Linux the answer there is also "never").https://twitter.com/MengTangmu/status/1135003656032587776 … -
It does? For it to work well one needs to make sure that the text segment and data/bss segments are far enough apart to fit onto separate 2MB huge pages. Otherwise you'd give up sharing the text segment, which'd again have its own cost.
- 3 more replies
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.