I see there are numerous threads like this one. I took the suggestion of setting the receive window size to the default of 32kb. After that, using the same 3 queues, instances are running on average at 0% CPU usage. System process is also at 0%.
It is unfortunate that the higher receive window uses so much CPU in v4. I was able to drastically increase download speeds with the high receive window from certain locations. I was using a 512kb window originally.
Any news on this would be nice!