Simply put : You need fastest single thread available (since I doubt old software will use more than two cores/threads).
If you want to go cheap and be 100% Win XP compatible, a good start would be OC'ed Phenom II (with biggest L3 cache), or Core i5's 1-st gen (Quad Core only, because i5 6xx series and lower have slower IMC's than full quads).
From newer CPU's :
Pentium G3258 OC'ed, should be a great and cheap CPU for this (but Win XP support on it... is "tricky" at best 🙁).
Best CPU would be the newest/fastest Core i5 (currently i5 6600k is "the top dog").
The big question is : How much $$$ you want to sink into project like this, and do you want to OC the CPU (if OC is out of the question, search for highest clocked locked versions).
To clarify :
I DID NOT tested how those CPU's behave in software render.
I only pointed out those that (to me), have best chance at doing what needs to be done.
PS. Fast RAM is higly recommended (lowest timings with highest clock speed, I don't khow how what is more important for old soft render, ie. Bandwidth or Latency).
Last thing... Windows 10 HATES DX9 software renderer from 3DMark 03 and 05.