Q: Producer/Consumer ring is a common pattern for high perf comm between 2 CPU cores or CPU core & device. Thus, I expected Intel to have non-temporal store instruction that write to LLC without polluting L1/L2. Useful also with device DDIO. But MOVNT* also bypass LLC. Why? (1/3)
-
- Još 7 drugih odgovora
Novi razgovor -
-
How can you be sure client and server are sharing L2 cache? It limits portability of code. If you're adding an instruction, make it "send to ACPIC_ID, mem", which will make sure the write is visible to the other CPU w/ minimal perf hit. See also:https://twitter.com/AviKivity/status/1139469089527750656 …
-
Cool, I didn't remember that
@AviKivity brought up similar question. Writing directly to target's L1/L2 cache in case of core2core comm is indeed an even better non-temporal store for that case. For devices, you want to write to LLC DDIO-accessible ways. But yes idea is the same. - Još 1 odgovor
Novi razgovor -
Čini se da učitavanje traje već neko vrijeme.
Twitter je možda preopterećen ili ima kratkotrajnih poteškoća u radu. Pokušajte ponovno ili potražite dodatne informacije u odjeljku Status Twittera.