VOGONS


First post, by Tetrium

User metadata
Rank l33t++
Rank
l33t++

-Start Edit- You can download the 2 versions of superpi I used in my 3rd post!!-End Edit-

Some time ago I had this 'contest' with a friend of mine about who has the most computational power in the house.

The challenge: Could I build enough old singlecore systems to beat my friends new Intel Quadcore?

During the course of several months I build computers from parts I had laying around and ran superpi on every one of them.
Many of these systems didn't even have a case and were put together with parts I wouldn't think of everyday use.

In the end this challenge also did provide me with a small but interesting database of how fast a certain processor could do calculations.
I chose superpi as it's benchmarking and results are pretty straightforward.

I didn't use any optimised versions of superpi as these would taint the results. I wanted to have an untainted database.
Also superpi is able to run on anything that has windows 95 and newer. The only problem was finding a superpi that calculated in factions of seconds and could run on any windows. As this proved to be impossible I used a modded version of superpi for any windows prior to 2k and XP which did calculate fractions of a second.

I did try googling up results for older and newer systems but most of these used many different hardware and software, many of them overclocked and thus useless to me.

On all the systems I build I used the 1M calculation
Calc/s is calculations per second (1M is 1 million calculations)
Score is the calculation time inversed, then multiplied by a million
(Calculating a score makes so you can add up scores to see how fast all computers are compared to a single modern system)
Because I have so many results (and I'm dead tired as I'm writing this) I formatted the results in this way:

Intel Tetrium ($$$mhz), motherboard and OS used (along with additional info if applicable)
$$$$$s [=Superpi 1M calculation in seconds] (calculations per mhz), score:"the score" calculated by inverting $$$$$s and then multiplying by a million (=1000000 😁 )

Here are my results!
Processor (Mhz)...................Motherboard........Seconds (1M calc)..Calc/s.........OS..........Score

i486dx4 w/t (99.5mhz) AP43 socket 3, Sis496, 64MB FPM, Windows 95B
10960,278s (0,917), score:91,2

Celeron Mendocino (401mhz), MS-6159 i440LX, 192MB SDRAM 66 cl2, Windows ME
444s (5,617), score:2252,3

K6-3-400 Sharptooth (±400mhz), ALI s7 GA-5AX, 256MB SDRAM PC-100 cl2, Windows ME (Currently my Voodoo 2 rig)
429s (5,826), score:2330,6

Pentium 3 Coppermine (551mhz), ASUS P2B, i440BX, 512MB SDRAM 100, Windows ME (My very first pc, completely filled with lots of old junk)
243s (7,469), score:4115,2

Pentium 3 Coppermine (±667mhz), slot 1 VIA chipset, SDRAM PC133 cl3, Windows ME
240,145s (6,243), score:4164,2

Celeron Coppermine (±800mhz), s370 i440BX, 256MB SDRAM, Windows ME
222s (5,631), score:4504,5

Pentium 3 Coppermine (551mhz), slot 1 i440BX, SDRAM, W2K
206,777s (8,777), score:4836,1

Tualeron (±1300mhz), s370 (system given to me, benched it as it was), Windows XP
206,637s, score:4839,4

Duron Spitfire (800mhz), Abit Kv7-V POS, DDR, Windows ME (Couldn't get XP on the bugger no matter what I tried)
193,554s, score:5166,5

Sempron Thoroughbred 2600+ (±1090mhz), same Abit KV7-V as above
126,738s, score:7890,3

Sempron Thoroughbred 2600+ (±1833mhz), same Abit again
82,810s, score:12075,8

Pentium 3 Coppermine (864mhz), s370 i815 oem board, SDRAM PC133, Windows XP
150,596s (7,686), score:6640,3

Athlon Thunderbird(B?) (±1000mhz), s462 mATX board with iirc a VIA KT133, windows XP
144,418s (6,924), score:6924,3

Pentium 3 Coppermine (±1000mhz), s370 i815 (My 2nd work pc, briefly replacing my 1st pc before I got my A64)
141,264s (7,078), score:7078,9

Pentium 3 Coppermine (985mhz), s370 i815 Fujitsu Siemens oem board iirc D1219, FS Windows XP
141,003s (7,200), score:7092,0

Pentium 3 Tualatin (±1200mhz), s370 VIA 694T SDRAM
133,212s, score:7506,8

Athlon Thunderbird B (±1100mhz), ASUS A7V, VIA KT133, SDRAM, Windows XP
130,518s, score:7661,8

Athlon Thunderbird C (±1000mhz), ASUS A7V133, VIA KT133A, SDRAM, Windows XP
128,305s, score:7793,9

Athlon XP Palomino 2000+ (±1666mhz), ASUS A7V133, SDRAM, Windows XP
83,007s (7,231), score:12046,9

Athlon XP Palomino 1800+ (±1533mhz), ASUS A7V133, SDRAM PC133 cl2, Windows XP
81,267s, score:12305,1

Athlon XP-M Thoroughbred 2000+ (±1753mhz) 167x10.5, ASUS A7V8X-X, DDR, Windows XP
66,579s, score:15019,8

Athlon XP Barton 3200+ (2200mhz), VIA KT600 Fujitsu Siemens oem motherboard (by Gigabyte), DDR400, FS Windows XP
50,844 (8,940), score:19668,0

Athlon 64 Venice 3200+ s754 (±2200mhz), VIA K8M800, DDR400, Windows XP (ordered this board by mistake, decided to keep it and build another computer from parts laying around)
41,016s (11,082), score:24380,7

Athlon 64 Venice 3500+ s939 (2200mhz), VIA K8T800 (Pro?), DDR400 DC. Windows XP (My current working computer)
40,64s (11,185), score:24606,3

I know it looks messy, I'll inprove the formatting tomorrow when I'm not as tired and I know some results are missing, mostly different cpu's used on my GA-5AX

If anyone else can be arsed to run superpi on their retro computer(s), please post your serults in this thread so we can start comparing some data!

Last edited by Tetrium on 2010-07-06, 12:05. Edited 2 times in total.

Reply 1 of 34, by Tetrium

User metadata
Rank l33t++
Rank
l33t++

Some interesting notes about these results:

The Palomino 2000+ scores lower then the Palomino 1800+ even though they both use the same ASUS A7V133.
This might be because the 2000+ uses normal PC133 whilst the 1800+ uses PC133 cl2.
Seems higher bandwidth will improve performance of these processors
This seems to be confirmed by the Thunderbird 1000 C having a slightly better score then the Thunderbird 1000 B

The P6 celerons seem to have about 70% the computational strength of their respective bigger P3 brothers.

I did the same (testing cl3 and cl2) with the Katmai 550 (later replaced with a coppermine), I tested first some PC100 cl3 and then the exact same configuration at cl2. Didn't make a significant difference.

To my surprise the K6-3 400 scored a little better then the Celeron 400 even though the K6-3 is supposed to have a crappy FPU.

Overall the 64 bit Athlons score about 25% higher then their equally fast XP's.

And my badly configured 486 seemed to score WAY lower then I had anticipated. It would be interesting to get more results of this generation of processors.
As I was testing without a mouse (for some reason it wouldn't work anymore after having moved) and having to do all the windows functions with the keybaord I just couldn't be bothered to try other configurations

Reply 3 of 34, by Tetrium

User metadata
Rank l33t++
Rank
l33t++

EDIT --> SUPERPI DOWNLOAD INCLUDED IN THIS POST <--

I could either upload them somewhere. I remember for XP I used the superpi mod 1.5 version.
The 9x version I used is the only version I found that gives results in fractions of a second.

I'll go search for the downloads and edit in the links in this or the original post 😉

And the more benchmark results the better 😁

Edit:Uploaded them to here, these are the ones I use.

There is a small bug in the 9x version though. After having run for the 1st time, it will crash the system after it has written the result to the result-file.
After reboot it will have written the data to the file and superpi will work perfectly fine afterwards.
It will NOT crash if theres already a superpi benchmark result written to the result-file.

What I do is this: I run superpi 32k on some random system, save the result (yes it will crash the 1st time) and after reboot, I distribute the calculated result with superpi to any other computer and just ignore the 32k result from the other computer. NO CRASHES 😀

Btw, whenever I run superpi I always run in this order: 16k, 64k, 256k and finally 1M

Attachments

  • Filename
    super_pi_mod-1.5.zip
    File size
    60.44 KiB
    Downloads
    493 downloads
    File comment
    2k xp version of superpi
    File license
    Fair use/fair dealing exception
  • Filename
    Super Pi Modded 9x!!!.zip
    File size
    82.18 KiB
    Downloads
    659 downloads
    File comment
    9x version of superpi
    File license
    Fair use/fair dealing exception

Reply 5 of 34, by Tetrium

User metadata
Rank l33t++
Rank
l33t++

Thanks Davros!

A friend of mine has the exact same cpu and I benchmarked it. His time was 21,86 seconds so yours is a little bit speedier 😁

I got my GA-5AX results coming up, will edit this post

Edit: I tested a couple different cpu's on my GA-5AX super 7 board. OS is Windows ME

K6-3 233-66, 693.0s
K6-3 250-100, 549,1s
k6-3 400-66, 491,1s
k6-3 400-100, 429,1s
k6-3 400-100 tweaked, 424,3s

k6-2 400-100 cl2, 515,0s
k6-2 400-66, 631,4s
k6-2 400-66 cl2 tweaked, 616,0s

P1mmx 250-100 cl2, 603.2s
p1mmx 233-66, 731,8s
p1mmx 233-66 cl2, 720,1s

Cyrix M2 233-66, 824,5s

Comparing K6-3, P1mmx and Cyrix M2 we get:
K6-3 233-66, 693.0s
p1mmx 233-66, 731,8s
Cyrix M2 233-66, 824,5s

The Cyrix at the same frequency is about 19% slower then a K6-3 and about 13% slower then a P1mmx.
You can clearly see it's FPU is somewhat lacking. I tend to just ignore their PR rating but I'm pretty much expecting their integer performance to be relatively better.

Comparing K6-2 and K6-3 I found the K6-3 about 21% faster then the K6-2. This was measured on a 100mhz fsb.

When lowering the fsb of the K6-3 and K6-2 from 100 to 66, it seems to degrade the performance of the K6-2 by almost 20% while the K6-3 suffers a less then 16% loss in performance by limiting the fsb.
I did measure this with memory set to cl2.

Increasing the fsb for the P1mmx from 66 to 100 and lowering the multi to x2.5 will result in a slight overclock of 7% while superpi performance jumps up by 21%.
Strangely letting the K6-3 go from 233mhz to 250mhz yielded a performance increase of 26%! Quite strange imo, the 256kb cache should normally compensate somewhat for the lower fsb

Reply 7 of 34, by 5u3

User metadata
Rank Oldbie
Rank
Oldbie
Tetrium wrote:

5u3, could you please run superpi on your DX4? I'd like to know if it's as slow as mine 😜

Problem is, I don't have Win9x installed on this machine. Perhaps I'll have time to do it tomorrow. I have to fix this machine anyways, just found out that the L2 cache is unstable at 40 MHz and high temperatures...

Reply 8 of 34, by retro games 100

User metadata
Rank l33t
Rank
l33t

Forgive me for this worthless remark, but I'd love to contribute to this interesting thread, and also to post much more on Vogons, but I don't have access to my retro hardware at the moment as I'm "out of town", and probably won't for some time. 🙁
So, to Tetrium and all other Vogons retro enthusiasts - keep up the excellent work and valuable comments. 😀

Reply 9 of 34, by elianda

User metadata
Rank l33t
Rank
l33t

I think the Mod Version requires a Pentium for the MSR Timer.

And Retro PCs may not have the required memory for the dataset, so you would bench more or less HDD performance.

Anyway here we go:
supi386.jpg

Retronn.de - Vintage Hardware Gallery, Drivers, Guides, Videos. Now with file search
Youtube Channel
FTP Server - Driver Archive and more
DVI2PCIe alignment and 2D image quality measurement tool

Reply 10 of 34, by Tetrium

User metadata
Rank l33t++
Rank
l33t++

You ran superpi on windows 3?? Didn't know that was possible.

harddrive may influence results on systems with little ram. One more reason for me to test all 4 16k, 64k, 256k and 1M.

Reply 11 of 34, by elianda

User metadata
Rank l33t
Rank
l33t

Original SuperPi runs on Windows 3. The screenshot shows Windows 3.11 for Workgroups.
The usual modded version were changed that they do not run anymore on Win 3.x. This is a real drawback for benching older systems.

Though optimal would be a DOS version utilizing Unrealmode for larger sets, that can be run from the commandline.

Though back to the known version. SuperPi has no check if its dataset is completely in memory and not swapped out. As you see on the screenshot, if I would double it to 512K set, it would be beyond the free Real memory, resulting in swapping. So 512K and 1M would give results with no FPU performance representation on this machine.

So back to the old systems:
You probably know that Win98 requires a 486DX (FPU included). Win95 runs on a 386 without FPU already, Win 3.x also.
So for programs that utilize the 80x87 and have no own x87-Emulation included these systems trap the x87 Code and redirect the execution to a emulation dll. This was a really common way. This means, that SuperPi, while benching mainly FPU performance runs also without a real FPU.
Though be aware that this takes much longer, maybe ~25 times longer as with f.e. a real 387.

Retronn.de - Vintage Hardware Gallery, Drivers, Guides, Videos. Now with file search
Youtube Channel
FTP Server - Driver Archive and more
DVI2PCIe alignment and 2D image quality measurement tool

Reply 12 of 34, by sliderider

User metadata
Rank l33t++
Rank
l33t++

Interesting. The window above says Pi calculation is done but how can calculation of an irrational number ever be "done"? The last time I checked it has been calculated to several hundred billion places with no end yet in sight.

Reply 13 of 34, by elianda

User metadata
Rank l33t
Rank
l33t

I just wanted to add that the FPU used in the calculation was a 387, no 387DX.
If I apply the same recipe you used for the 1M calc, to this 256K calc I get a score of 63.1
It would be nice to have some systems with 1M and 256K calcs, as the 486DX4 to get some overlap. 1M calculations on slower systems would exceed any senseful timeframe.

I just tried on my 486SX2-66, though it is way slower than the 387-16. (thats on 256K)

Reply 14 of 34, by Anonymous Freak

User metadata
Rank Member
Rank
Member

Interesting. The window above says Pi calculation is done but how can calculation of an irrational number ever be "done"? The last time I checked it has been calculated to several hundred billion places with no end yet in sight.

Obviously, one can never "finish calculating Pi". What SuperPi does is calculate Pi to a certain number of decimal places. By being "done", it means it has completed calculating to that precision. An oft-used number is how long it takes to calculate 1 million places. elianda's screenshot shows how long it takes to calculate to 256,000 decimal places.

One source has it that an Apple II takes 40 hours to calculate to 1000 decimal places. My current workstation takes 41 seconds to do 2 million places; or 3.3 seconds to do the 256,000 places it took elianda's 386 4.5 hours to complete. (And that's only using one of eight cores, as SuperPi is single-threaded.)

Reply 15 of 34, by Tetrium

User metadata
Rank l33t++
Rank
l33t++
elianda wrote:
Original SuperPi runs on Windows 3. The screenshot shows Windows 3.11 for Workgroups. The usual modded version were changed that […]
Show full quote

Original SuperPi runs on Windows 3. The screenshot shows Windows 3.11 for Workgroups.
The usual modded version were changed that they do not run anymore on Win 3.x. This is a real drawback for benching older systems.

Though optimal would be a DOS version utilizing Unrealmode for larger sets, that can be run from the commandline.

Though back to the known version. SuperPi has no check if its dataset is completely in memory and not swapped out. As you see on the screenshot, if I would double it to 512K set, it would be beyond the free Real memory, resulting in swapping. So 512K and 1M would give results with no FPU performance representation on this machine.

So back to the old systems:
You probably know that Win98 requires a 486DX (FPU included). Win95 runs on a 386 without FPU already, Win 3.x also.
So for programs that utilize the 80x87 and have no own x87-Emulation included these systems trap the x87 Code and redirect the execution to a emulation dll. This was a really common way. This means, that SuperPi, while benching mainly FPU performance runs also without a real FPU.
Though be aware that this takes much longer, maybe ~25 times longer as with f.e. a real 387.

Could you please upload your version of superpi for us so we all are using the same versions?
Btw, would it be possible to make a win 3.1 bootdisk and run superpi from there? That could MASSIVELY ease benchmarking, we wouldn't need to install, or swap from another working system, windows everytime we might want to try something new.

I never tried that as I thought superpi wouldn't run on windows 3

For old slower systems I reckon running 256k or maybe just 64k or 16k would suffice. Using superpi to benchmark harddisk write performance is indeed a bad way to calculate FPU performance. I wouldn't have any problems with that.
asmof, I used 16k, 64k, 256k and 1M with all benchmarks that I've run. I only posted the 1M results here 😉

I'll go dig up all results from my 486...

Edit: Here it is
superpidx4totaal.jpg

Edit2:I just remembered another visual bug. In this pic you can see it, It counts the hours correctly but it forgets to detract 60 minutes for every hour it counts.
What I do it, I simply ignore the hours 😉

Reply 16 of 34, by ratfink

User metadata
Rank Oldbie
Rank
Oldbie

Here's my results:

AMD 5x86-75, 32mb ram, Windows 95, Abit BN4 -
22.213 seconds [16K]
113.15 seconds [64K]
709.44 seconds [256K]
3519.7 seconds [1M]

Intel E4300 at 1.8ghz, Windows 7 - 32.594 seconds [1M]

Phenom X4 9350e, Debian 5.0 - 30.741 seconds [1M]
[using version 2.0 of super_pi for linux]

Last edited by ratfink on 2010-07-07, 14:58. Edited 1 time in total.

Reply 17 of 34, by Tetrium

User metadata
Rank l33t++
Rank
l33t++
ratfink wrote:
Here's my results: […]
Show full quote

Here's my results:

AMD 5x86-75, 32mb ram, Windows 95, Abit BN4 -
22.213 seconds [16K]
113.15 seconds [64K]
709.44 seconds [256K]
3519.7 seconds [1M]

Intel E4300 at 1.8ghz, Windows 7 - 32.594 seconds [1M]

Nice formatting!
My 486 in seconds now:

00069,2s [16k]
00374,8s [64k]
02087,1s [256k]
10960,3s [1M]

It's 3 times as slow 🤣! But I know it can probably do better if I fix it.

Reply 18 of 34, by elianda

User metadata
Rank l33t
Rank
l33t

The official superpi version can be grabbed here: ftp://pi.super-computing.org/pub/exec_windows/super_pi.zip
I think SuperPi requires Win32s in Win3.x.

Since numbers are hard to read for comparison, could you do some nice graphs?
You might have to scale the time axis logarithmically. Add the CPU/MHz shortcuts to the data points.
MATLAB, Octave or some similar program will do fine.

SUPI486SX66.GIF

Be aware that the Integer speed of the 486SX2-66 is about 11.5x faster than the 386DX-16.
This are 42601 seconds or a score of 23.5.

So a 486SX2-66 using Integer FPU-Emulation is 2,7x slower than a 386DX-16 with 387-16. Scaling the Integer performance of the 486 to the 386,
then a 386DX-16 with 387-16 FPU would be 30.8x faster than a
386DX-16 using Integer FPU emulation.

Reply 19 of 34, by Tetrium

User metadata
Rank l33t++
Rank
l33t++

Frankly, I have no idea how to make nice little graphs, but it could help make it easier to compare results.
If you know how to make nice graphs, feel free to present us with a nice demonstration of what you have in mind 😀

I still haven't edited my OP yet, will do later tonight as right now I don't have the time.

Edit: I can't reach the superpi website for some reason, I'll look into that when I get back in a couple hours