VOGONS


First post, by ElBrunzy

User metadata
Rank Oldbie
Rank
Oldbie

I just received my terminator card for this dell gxpro computer. I used to have some 200mhz ppro cpu: 2x256kb, 1x512kb and 1x1mb. This computer used to run debian sarge and win2000, now it's hosting dos6.22 and win98se so it felt kind of a waste to have SMP 2x256 on those OS. The obvious choice was to use the 1024kb cache version but I was curious about the TDP that intel declare of them cpu and how dos and win98 handle l2 cache size difference. In virtual 8086 mode the results where kind of dull, but in real mode some bench in dos started to show difference, so I kept those numbers. I think my choice should be the 512kb as I found it has the best tradeoff power usage/speed, luckily I was able to redress the pins, this cpu was all beaten up and left for dead where I found it. My guess about the 1024kb poor performance is that intel has to somehow slow down the l2 cache of the 1mb version of the ppro to add more and this was not necessary on the 512kb version, something related to row/column delay and that 16bit os didnt care that much about huge amount of l2. I might be wrong, it's just a tough ... so here is the number:

2 x 256kb:
off: 3.6w
windows idle: 90w
windows nu SI bench: 97w, score: 84.1
dos idle: 93w
nssi cpu: 94w score 95162Dhrystones
nssi fpu: 95w score 41760KWhetstones
pctools sysinfo overall: 95w performance index: 42.2
nu8 sysinfo: Maint processor: 95w pentium 225mhz: 498.8
nu8 sysinfo overall: 95w performance index: 128.0

1 x 256kb:
off: 3.6w
windows idle: 47w
windows nu SI bench: 72w, score: 86
dos idle: 69.3w
nssi cpu: 69.2w score 95171Dhrystones
nssi fpu: 68.8w score 42080KWhetstones
pctools sysinfo overall: 69w performance index: 45.4
nu8 sysinfo: Maint processor: 70w pentium 225mhz: 498.9
nu8 sysinfo overall: 70w performance index: 129.5

1 x 512kb:
off: 3.6w
windows idle: 47.3w
windows nu SI bench: 78w, score: 97.9
dos idle: 75w
nssi cpu: 76.5w score 95171Dhrystones
nssi fpu: 76.2w score 41931KWhetstones
pctools sysinfo overall: 76w performance index: 43.6
nu8 sysinfo: Main processor: 77.2w pentium 225mhz: 498.9
nu8 sysinfo overall: 77.5w performance index: 129.6

1 x 1mb:
off: 3.6w
windows idle: 84.5
windows nu SI bench: 85w, score: 98.5
dos idle: 83w
nssi cpu: 84w score 95207Dhrystones
nssi fpu: 84.2w score 42044KWhetstones
pctools sysinfo overall: 84w performance index: 39.9
nu8 sysinfo: Main processor: 86.6w pentium 225mhz: 498.8
nu8 sysinfo overall: 85w performance index: 128.0

Last edited by ElBrunzy on 2017-01-02, 02:15. Edited 2 times in total.

Reply 2 of 28, by Tetrium

User metadata
Rank l33t++
Rank
l33t++

Interesting! I've always wondered about these 3 variants of PPro.

Not sure about the 1MB variant being slower, perhaps it was really meant to do very heavy server types of data handling and were kinda over-optimized specifically for that?

Whats missing in your collections?
My retro rigs (old topic)
Interesting Vogons threads (links to Vogonswiki)
Report spammers here!

Reply 3 of 28, by ElBrunzy

User metadata
Rank Oldbie
Rank
Oldbie
F2bnp wrote:

Nice thread!
Would you be willing to try out some games as well? Wonder if the added cache would make any of them run faster in Software mode or even 3D accelerated!

of course I would, and I in fact wanted to do so, but time was of the essence (the beer store close at ten) and I couldnt think of any demos or games that would give you an fps. But I know some does, do you have any title in mind ? Maybe big red racing ?

Tetrium wrote:

Not sure about the 1MB variant being slower, perhaps it was really meant to do very heavy server types of data handling and were kinda over-optimized specifically for that?

on other tough maybe intel had to add an extra cache controller to handle the second 512kb half and that would slow down the whole process when using only the first few kb of cache. Because, of course, I dont think nssi, nu8 and pctools benchmark are complex enough to take care of l2 cache difference, but then again, who is ?

Reply 5 of 28, by ElBrunzy

User metadata
Rank Oldbie
Rank
Oldbie

I did some read in hope to understand why the cache made so little difference. I have not find any satisfactory information but ideas for some more win98se bench software. Also I did reflect on the F2bnp idea and came to the same conclusion as amadeus7779999. I must have spent an hour to try to make fps appear on timedemo, so finally I used another method that give same results in a fraction of the time. Spoiler : the results are even more boring than my first post. It start to make no sence at all... I'm still open to idea, but unless I can boot a nt4/2000 on that machine I see no point in continuing my investigation.

Quake 1.08
msdos6.22 clean boot
software rendering on matrox g450 pci
quake108 -nosound -nocdaudio
~viewsize 100
~timerefresh at quake gates

1 x 256kb:
watts: 83w
320x200 fps: 43.9
360x400 fps: 24.8
640x480 fps: 8.7
1280x1024 fps: 2.28

1 x 512kb:
watts: 90w
320x200 fps: 46.5
360x400 fps: 25.3
640x480 fps: 8.7
1280x1024 fps: 2.28

1 x 1024kb:
watts: 97w
320x200 fps: 47.5
360x400 fps: 27.1
640x480 fps: 8.8
1280x1024 fps: 2.29

PCMark2002
win98se

1 x 256kb:
watts: 90w
CPU score: 548

1 x 512kb:
watts: 97w
CPU score: 557

1 x 1024kb:
watts: 105w
CPU score: 561

WinBench97
win98se

1 x 256kb:
watts: 88w
CPUMark16: 351
CPUMark32: 514

1 x 512kb:
watts: 93.5w
CPUMark16: 361
CPUMark32: 588

1 x 1024kb:
watts: 102w
CPUMark16: 366
CPUMark32: 601

7zip bench
win98se 7-Zip 9.20 (default benchmark)

256kb

       speed	cpu usage	rating
comp 162KBs 100% 158MIPS
deco 1998KBs 100% 180MIPS

512kb

       speed	cpu usage	rating
comp 168KBs 100% 164MIPS
deco 2023KBs 100% 182MIPS

1024kb

       speed	cpu usage	rating
comp 170KBs 100% 166MIPS
deco 2029KBs 100% 183MIPS

Reply 6 of 28, by feipoa

User metadata
Rank l33t++
Rank
l33t++

It would be neat to see the CPU cost per fps increase. You could also add the cost of the extra power consumption.

Plan your life wisely, you'll be dead before you know it.

Reply 7 of 28, by ElBrunzy

User metadata
Rank Oldbie
Rank
Oldbie
feipoa wrote:

It would be neat to see the CPU cost per fps increase. You could also add the cost of the extra power consumption.

I cannot beleive you can be serious, you must be pulling my legs

Reply 10 of 28, by gerwin

User metadata
Rank l33t
Rank
l33t
mrau wrote:

isnt that somewhat low for quake?

Regarding Quake 1 in VGA mode; Swaaye's pentium pro 1MB does 42,7 FPS when underclocked to 133MHz (2x66). source. But it is not really fair because i think he enabled write combining for the VGA range, which is not a reliable setting IMO. Contrary to VESA video modes, where it is surely advisable to enable LFB range write combining, with mtrrlfbe or a similar tool.

--> ISA Soundcard Overview // Doom MBF 2.04 // SetMul

Reply 11 of 28, by mrau

User metadata
Rank Oldbie
Rank
Oldbie
gerwin wrote:
mrau wrote:

isnt that somewhat low for quake?

Regarding Quake 1 in VGA mode; Swaaye's pentium pro 1MB does 42,7 FPS when underclocked to 133MHz (2x66). source But it is not really fair because i think he enabled write combining for the VGA range, which is not a reliable setting IMO. Contrary to VESA video modes, where it is surely advisable to enable LFB write combining with mtrrlfbe or a similar tool.

im talking about 640 resolution in particular; iirc someone here on a low end p6 got really fluent quake with a matrox vard (might even have been an older one than g450)

Reply 12 of 28, by gerwin

User metadata
Rank l33t
Rank
l33t

Well, then the last sentence of my previous post applies: 640 resolution in Quake uses a VESA mode, which preferably uses a Linear Frame Buffer (LFB). Write combining greatly speeds up the LFB memory access -> more FPS. Does not matter which graphics chipset, it is a CPU feature.

--> ISA Soundcard Overview // Doom MBF 2.04 // SetMul

Reply 13 of 28, by mrau

User metadata
Rank Oldbie
Rank
Oldbie
gerwin wrote:

Well, then the last sentence of my previous post applies: 640 resolution in Quake uses a VESA mode, which preferably uses a Linear Frame Buffer (LFB). Write combining greatly speeds up the LFB memory access -> more FPS. Does not matter which graphics chipset, it is a CPU feature.

no no, write combining speeds up by over 100%? also i remember explicitly the statement that the function in question comes from the GPU

Reply 15 of 28, by mrau

User metadata
Rank Oldbie
Rank
Oldbie
gerwin wrote:

100%, seemingly it goes that far: Benchmarks write combining.

noice, however ive seen many people do not reach that much of an increase; is this similar/complementary/whatever to uncacheable speculative write combining?

Reply 17 of 28, by mrau

User metadata
Rank Oldbie
Rank
Oldbie
gerwin wrote:

I don't know. For starters, where are these other people's results?

on this forum, but don't force me to search, i kinda cannot do this effectively; i'm telling this from memory, so don't crucify me either

Reply 18 of 28, by gerwin

User metadata
Rank l33t
Rank
l33t

It is no problem 😀. Just give MTRRLFBE from Rayer a spin when you are chanced. Other then that, I found it is difficult to keep benchmark results comparable. There is often some detail or setting which complicates it.

I really wonder why BIOS developers generally did not include LFB write combining for DOS. Or maybe DOS games should have enabled this themselves. I found out about it way too late... Maybe in the pentium 2 era DOS was already out of favour for gaming.

Last edited by gerwin on 2017-01-15, 17:25. Edited 1 time in total.

--> ISA Soundcard Overview // Doom MBF 2.04 // SetMul

Reply 19 of 28, by mrau

User metadata
Rank Oldbie
Rank
Oldbie
gerwin wrote:

It is no problem 😀. Just give MTRRLFBE from Rayer a spin when you are chanced. Other then that, I found it is difficult to keep benchmark results comparable. There is often some detail or setting which complicates it.

I really wonder why BIOS developers generally did not include LFB write combining for DOS. Or maybe DOS games should have enabled this themselves. I found out about it way too late...

would this work on an amd fx 6200 cpu? im not sure how id get dos running here