- 2 times faster CPU share verification (11 -> 5 ms) - 1.5 times faster light cache initialization
- Use `popcnt` instruction only when it's supported