A brief comparison of 386 FPUs

Reply 120 of 154, by rasz_pl

Posted on 2022-03-22, 15:38

rasz_pl Offline

Rank l33t

Rank: l33t
Posts: 4685
Joined: 2017-06-04, 00:57

JohnBourno wrote on 2022-03-22, 12:58:

What is interesting, is that without a FPU Duke Nukem 3d sometimes gets slowed down significantly. It drops from 20 frames per second to 7 fps in some scenes. Turns out that Duke3D uses floating point operations for rendering slopes. So without an FPU every time you have some slopes in your viewports the CPU has to do all the extra work and the game gets almost unplayable. Just by adding a IIS FPU the fps only drop to 15 instead of 7. So quite playable.

90MHz NexGen Nx586 drops down to 10fps on first screen with sloped roof for that very reason:
https://youtu.be/41O2bNG2qKA?t=234

Im sure today someone could come up with a faster integer/lookup table method of calculating those slopes.

https://github.com/raszpl/sigrok-disk FM/MFM/RLL decoder
https://github.com/raszpl/FIC-486-GAC-2-Cache-Module (AT&T Globalyst)
https://github.com/raszpl/386RC-16 ram board
https://github.com/raszpl/Zenith_ZBIOS Zenith Z-386 MFM-300 ZBIOS disassembly

Reply 121 of 154, by BitWrangler

Posted on 2022-03-22, 15:45

BitWrangler Offline

Rank l33t++

Rank: l33t++
Posts: 8915
Joined: 2017-10-11, 00:55
Location: Ontario

Next time I find/boot my U5S CPU, I should try it out on DN3D with one of the 387 emulators see if there's a speed boost.

Unicorn herding operations are proceeding, but all the totes of hens teeth and barrels of rocking horse poop give them plenty of hiding spots.

Reply 122 of 154, by galanopu

Posted on 2022-05-24, 14:11

galanopu Offline

Rank Member

Rank: Member
Posts: 102
Joined: 2020-10-28, 08:45
Location: EU

So... here is an update.
My 387DX FPU collection has increased by a lot.
All overclocked to the max (up to 55MHz). Enjoy:
https://www.youtube.com/watch?v=Zvisa_-uEqI

Let's mod everything! Check my youtube channel:
https://www.youtube.com/channel/UCZ6ULBqIKhxuNslAbqFNJUg
Interested in my devices? Check my store:
https://migron-electronics.com

Reply 123 of 154, by Ailicec

Posted on 2022-06-02, 03:38

Ailicec Offline

Rank Newbie

Rank: Newbie
Posts: 34
Joined: 2011-06-12, 20:48

On the 4x4 instruction.. you load the 4x4 transformation matrix into alternate register sets, then leave them resident while you load the fairly small 4x1 vector. Typically the transformation matrix is reused many times, so this saves a huge amount of communication overhead. For the time, a very big win. You also presumably (I haven't tried it) get a pretty good chunk of time that the FPU is busy and you might be able to do something useful with the integer unit. Easier than trying to work in a couple int instructions in between normal FPU instructions.

Often you can get away with a 3x3 matrix for 3d graphics, which would be a bit faster, but they didn't support that.

A downside.. Operating systems don't know about it, so they don't save the extra register sets. It didn't matter much at the time.

Deunan wrote on 2021-04-27, 08:44:

Personally I see one big problem with the 4x4 operation - it needs all the data to be fed to FPU and then the results be read back. AFAIR you need to start with empty stack, use IIT-specific stack extension instructions, and some of the input arguments are over-written to store the result. This, coupled with the rather slow CPU-NPU comm channel, limits the usefulness of such instructions. Weitek worked around that by having their NPU register space memory-mapped, at the cost of even lower compatibility with typical x87 code.

Long story short: It took MMX to finaly have some direct access to FPU register space, and even that was flawed due to cost of switching between MMX and x87 modes. SSE finally made FPU on x86 family somewhat saner by today's standards.

Reply 124 of 154, by Sphere478

Posted on 2022-12-15, 04:08

Sphere478 Offline

Rank l33t++

Rank: l33t++
Posts: 6004
Joined: 2021-01-13, 04:45

I’m going to be building a 386 er ‘486’ system soon. And using the texas instruments sxl2 66 on the new interposer.

And was wondering what the best fpu to pair with it would be.

This will be the slowest system that I have ever built. So it will be kinda fun new territory for me.

I’ll have to post the exact mobo model later as I can’t seem to remember it, but I think it is a shuttle hot with 16 isa and 8 bit isa. I seem to recall it having a AMD-40 soldered on and a witek socket as well as a cpu upgrade socket

Sphere's PCB projects.
-
Sphere’s socket 5/7 cpu collection.
-
SUCCESSFUL K6-2+ to K6-3+ Full Cache Enable Mod
-
Tyan S1564S to S1564D single to dual processor conversion (also s1563 and s1562)

Reply 125 of 154, by pshipkov

Posted on 2022-12-15, 07:33

pshipkov Offline

Rank l33t

Rank: l33t
Posts: 2293
Joined: 2018-10-11, 05:08

For up to 40-45MHz base frequency that will be a combination of Weitek Abacus 3167 + Cyrix FasMath (gray top) or ULSI DLC.
Where ULSI is faster in Quake 1 only and Fasmath at everything else.
At 50mhz base you will most likely be going with Cyrix Fasmath (black top) or ULSI DLC. No idea is 3167 can manage that speed.

retro bits and bytes | DOS media library

Reply 126 of 154, by Sphere478

Posted on 2022-12-15, 07:38

Sphere478 Offline

Rank l33t++

Rank: l33t++
Posts: 6004
Joined: 2021-01-13, 04:45

The sxl2-66 cpu might be operating at as much as 80 mhz

Sphere's PCB projects.
-
Sphere’s socket 5/7 cpu collection.
-
SUCCESSFUL K6-2+ to K6-3+ Full Cache Enable Mod
-
Tyan S1564S to S1564D single to dual processor conversion (also s1563 and s1562)

Reply 127 of 154, by pshipkov

Posted on 2022-12-15, 08:11

pshipkov Offline

Rank l33t

Rank: l33t
Posts: 2293
Joined: 2018-10-11, 05:08

Looks like specific units can hit 90.
At that frequency they can start encroaching on BL3@100.

retro bits and bytes | DOS media library

Reply 128 of 154, by kixs

Posted on 2022-12-15, 10:50

kixs Offline

Rank l33t

Rank: l33t
Posts: 3596
Joined: 2013-01-31, 02:08
Location: Slovenia

Sphere478 wrote on 2022-12-15, 04:08:
I’m going to be building a 386 er ‘486’ system soon. And using the texas instruments sxl2 66 on the new interposer. […]
Show full quote

I’m going to be building a 386 er ‘486’ system soon. And using the texas instruments sxl2 66 on the new interposer.

And was wondering what the best fpu to pair with it would be.

This will be the slowest system that I have ever built. So it will be kinda fun new territory for me.

I’ll have to post the exact mobo model later as I can’t seem to remember it, but I think it is a shuttle hot with 16 isa and 8 bit isa. I seem to recall it having a AMD-40 soldered on and a witek socket as well as a cpu upgrade socket

For x2 CPU I'd use the x2 FPU - like ULSI 66 or IIT 50.

Visit my AmiBay items for sale (updated: 2026-03-23). I also take requests 😉
https://www.amibay.com/members/kixs.977/#sales-threads

Reply 129 of 154, by pshipkov

Posted on 2022-12-15, 15:28

pshipkov Offline

Rank l33t

Rank: l33t
Posts: 2293
Joined: 2018-10-11, 05:08

Dont know if the (very rare) 2x models can tick reliably at 80, 90 or 100 mhz.
Also, IIT 387 FPUs are actually very slow.

retro bits and bytes | DOS media library

Reply 130 of 154, by feipoa

Posted on 2022-12-15, 20:37

feipoa Offline

Rank l33t++

Rank: l33t++
Posts: 10506
Joined: 2011-03-07, 13:54
Location: Canada

sphere, a run of the mill black-top Cyrix FasMath at 40 Mhz is your best bet for an SXL2 at 80 Mhz. There is only benefit in the ULSI DX2-66 if you are running your SXL2 at 66 MHz. I doubt the ULSI DX2 can do 80 MHz.

Plan your life wisely, you'll be dead before you know it.

Reply 131 of 154, by Sphere478

Posted on 2022-12-15, 23:25

Sphere478 Offline

Rank l33t++

Rank: l33t++
Posts: 6004
Joined: 2021-01-13, 04:45

Here is the intended motherboard

Sphere's PCB projects.
-
Sphere’s socket 5/7 cpu collection.
-
SUCCESSFUL K6-2+ to K6-3+ Full Cache Enable Mod
-
Tyan S1564S to S1564D single to dual processor conversion (also s1563 and s1562)

Reply 132 of 154, by Anonymous Coward

Posted on 2022-12-16, 14:20

Anonymous Coward Offline

Rank l33t++

Rank: l33t++
Posts: 5061
Joined: 2008-03-20, 05:37
Location: Shandong, China

Nice forex.

"Will the highways on the internets become more few?" -Gee Dubya
V'Ger XT|Upgraded AT|Ultimate 386|Super VL/EISA 486|SMP VL/EISA Pentium

Reply 133 of 154, by Sphere478

Posted on 2022-12-16, 18:04

Sphere478 Offline

Rank l33t++

Rank: l33t++
Posts: 6004
Joined: 2021-01-13, 04:45

So the ulsi dx2 is faster but can’t do 80mhz

So a 40mhz black top is the fastest option for a overclocked sxl2

but a stock speed sxl2 the fastest would be ulsi dx2?

What would be the fastest overall combo?

Ulsi and sxl2 at 75 mhz?

I’m going for maximum upgrade here.

Sphere's PCB projects.
-
Sphere’s socket 5/7 cpu collection.
-
SUCCESSFUL K6-2+ to K6-3+ Full Cache Enable Mod
-
Tyan S1564S to S1564D single to dual processor conversion (also s1563 and s1562)

Reply 134 of 154, by feipoa

Posted on 2022-12-17, 14:44

feipoa Offline

Rank l33t++

Rank: l33t++
Posts: 10506
Joined: 2011-03-07, 13:54
Location: Canada

I think it best to forget about the ULSI DX2-66. This FPU takes rare to a new level. Even if the ULSI DX2-66 ran at 75 MHz, which is doubtful, an SXL2 system at 80 MHz coupled with a Cyrix FasMath at 40 Mhz will be faster.

Plan your life wisely, you'll be dead before you know it.

Reply 135 of 154, by Sphere478

Posted on 2022-12-17, 17:56

Sphere478 Offline

Rank l33t++

Rank: l33t++
Posts: 6004
Joined: 2021-01-13, 04:45

And it will for sure work in that mobo? It says witek?

Sphere's PCB projects.
-
Sphere’s socket 5/7 cpu collection.
-
SUCCESSFUL K6-2+ to K6-3+ Full Cache Enable Mod
-
Tyan S1564S to S1564D single to dual processor conversion (also s1563 and s1562)

Reply 136 of 154, by Sphere478

Posted on 2022-12-17, 18:03

Sphere478 Offline

Rank l33t++

Rank: l33t++
Posts: 6004
Joined: 2021-01-13, 04:45

Got one of these correct one?

Sphere's PCB projects.
-
Sphere’s socket 5/7 cpu collection.
-
SUCCESSFUL K6-2+ to K6-3+ Full Cache Enable Mod
-
Tyan S1564S to S1564D single to dual processor conversion (also s1563 and s1562)

Reply 137 of 154, by feipoa

Posted on 2022-12-17, 22:23

feipoa Offline

Rank l33t++

Rank: l33t++
Posts: 10506
Joined: 2011-03-07, 13:54
Location: Canada

It can use a Weitek or a regular 387 FPU, but the Weitek only works for very select applications.

Plan your life wisely, you'll be dead before you know it.

Reply 138 of 154, by galanopu

Posted on 2022-12-17, 23:02

galanopu Offline

Rank Member

Rank: Member
Posts: 102
Joined: 2020-10-28, 08:45
Location: EU

Sphere478 wrote on 2022-12-17, 18:03:

Got one of these correct one?

Check again my last video.
Assuming testing at the same clock, the fastest FPU is the Cyrix -KN.
No question about that. This one is a bit rare though.

Now benches are nice but in practice a 387 FPU are a bit pointless these days.
Games that need one, just run to slowly with a system of this class.
CAD programs are just too old and pointless.

So in the end a 5% difference in FPU performance, you will not really notice.
And the Cyrix you got is also a good one.

Let's mod everything! Check my youtube channel:
https://www.youtube.com/channel/UCZ6ULBqIKhxuNslAbqFNJUg
Interested in my devices? Check my store:
https://migron-electronics.com

Reply 139 of 154, by Anonymous Coward

Posted on 2022-12-18, 15:11

Anonymous Coward Offline

Rank l33t++

Rank: l33t++
Posts: 5061
Joined: 2008-03-20, 05:37
Location: Shandong, China

The "KN" is the older design with the grey top, right? Don't those have compatibility issues with Cyrix's own SLC/DLC CPUs?
How about a CX83D87-40HP? Those are also very rare. I assume the only difference over the standard black top is just the packaging...but has anyone every benched one?

"Will the highways on the internets become more few?" -Gee Dubya
V'Ger XT|Upgraded AT|Ultimate 386|Super VL/EISA 486|SMP VL/EISA Pentium

Main menu