DOSBox-X branch

Reply 1740 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 08:57

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

I'm building a binary now. Thanks for the tip.

Reply 1741 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 09:00

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

I added fpu_control.h from your web link. It is not in mingw32, at least in a usual directory.

Reply 1742 of 2419, by TheGreatCodeholio

Posted on 2019-02-04, 09:01

TheGreatCodeholio Offline

Rank Oldbie

Rank: Oldbie
Posts: 819
Joined: 2011-08-18, 20:15
Location: Seattle, WA

hail-to-the-ryzen wrote:
I added fpu_control.h from your web link. It is not in mingw32, at least in a usual directory.

Hm... perhaps I should just copy the macro from the header then. MinGW Windows is one of the build targets I make for DOSBox-X.

DOSBox-X project: more emulation better accuracy.
DOSLIB and DOSLIB2: Learn how to tinker and hack hardware and software from DOS.

Reply 1743 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 09:04

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

I also commented out the include for features.h file in fpu_control.h. It seems to build ok and I didn't see any reference to definitions outside fpu_control.h.

Reply 1744 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 09:09

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

It's built and the log shows the correct fpu header file was included:

1 FPU:FPU core: long double FPU
2 FPU:FPU32 selftest passed
3 FPU:FPU64 selftest passed
4 FPU:FPU80 selftest passed

I tested the Explora demo and have same result as you reported. I will test Quake next.

Reply 1745 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 09:15

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

Quake seems to work fine. Compared to the previous non-x86 fpu core in speed, it seems the cost of long doubles is low, perhaps ~5%.

Reply 1746 of 2419, by TheGreatCodeholio

Posted on 2019-02-04, 09:17

TheGreatCodeholio Offline

Rank Oldbie

Rank: Oldbie
Posts: 819
Joined: 2011-08-18, 20:15
Location: Seattle, WA

hail-to-the-ryzen wrote:
That is a good plan. It may work to only adapt a small set of fpu instructions for long double precision. Testing with the x86 f […]
Show full quote
That is a good plan. It may work to only adapt a small set of fpu instructions for long double precision. Testing with the x86 fpu code shows the rendering artifacts are reproduced by reduced precision in the following two functions (tested against the beginning of the Explora demo):
1diff -rupN dosbox-Orig//src/fpu/fpu_instructions_x86.h dosbox//src/fpu/fpu_instructions_x86.h
2--- dosbox-Orig//src/fpu/fpu_instructions_x86.h
3+++ dosbox//src/fpu/fpu_instructions_x86.h
4@@ -1121,11 +1121,15 @@ static void FPU_FDIVR_EA(Bitu op1){
5 }
6 
7 static void FPU_FMUL(Bitu op1, Bitu op2){
8+	FPU_SetCW(0x37F);
9 	FPUD_ARITH1(fmulp)
10+	FPU_SetCW(0x3FF);
11 }
12 
13 static void FPU_FMUL_EA(Bitu op1){
14+	FPU_SetCW(0x37F);
15 	FPUD_ARITH1_EA(fmulp)
16+	FPU_SetCW(0x3FF);
17 }
18 
19 static void FPU_FSUB(Bitu op1, Bitu op2){

I have some questions about those values.

According to this source: http://home.agh.edu.pl/~amrozek/x87.pdf

Bits 6-7 are not defined (0x7F and 0xFF), and bits 9-8 define the precision (0x300), so how does that affect the demo exactly?

DOSBox-X project: more emulation better accuracy.
DOSLIB and DOSLIB2: Learn how to tinker and hack hardware and software from DOS.

Reply 1747 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 09:19

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

It should have changed bit 9 to 1 from 0, so bit 8 and 9 are 11 for extended precision.

Edit: 0x37f should reflect doubles and 0x3ff flipping the bit for long doubles, at least from my binary to hex calculator. :}

Reply 1748 of 2419, by TheGreatCodeholio

Posted on 2019-02-04, 09:47

TheGreatCodeholio Offline

Rank Oldbie

Rank: Oldbie
Posts: 819
Joined: 2011-08-18, 20:15
Location: Seattle, WA

The difference between 0x37F and 0x3FF is bit 7 (0x80), which is not listed to contain anything.

The latest commit adds code to update the FPU control word for ADD, SUB, MUL, DIV, which in the long double FPU code seems to fix the Explora glitches.

Adding the same to the non long-double non-x86 FPU code didn't fix anything. How can it when FPU operations in that code always truncate to 53-bit mantissa (64-bit) precision?

EDIT: However the control word code is probably not going to compile or work properly on my Raspberry Pi (arm7) so further changes will need to be done.

DOSBox-X project: more emulation better accuracy.
DOSLIB and DOSLIB2: Learn how to tinker and hack hardware and software from DOS.

Reply 1749 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 09:51

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

My calculation must have been off, so it must have changed another fpu parameter, such as rounding. 🙁

It does suggest, however, that the multiply is the main factor, but it would have to be confirmed (although not needed now).

Thank you for the commit! I'm building a binary now to test.

Note I added the 0x37F and 0x3FF to the x86 FPU code to test, not the non-x86 code. I was trying to cause the rendering artifact - is that what you asked?

Reply 1750 of 2419, by TheGreatCodeholio

Posted on 2019-02-04, 09:55

TheGreatCodeholio Offline

Rank Oldbie

Rank: Oldbie
Posts: 819
Joined: 2011-08-18, 20:15
Location: Seattle, WA

When using the x86 FPU code, the only test to fail in Intel's i387 program is the transcendental test (SIN, COS, TAN, etc if I remember correctly).

EDIT: In the comprehensive test, the only failure is 35 cases of the "Scale" test (FSCALE instruction?)

DOSBox-X project: more emulation better accuracy.
DOSLIB and DOSLIB2: Learn how to tinker and hack hardware and software from DOS.

Reply 1751 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 09:58

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

That's interesting on the trigonometry functions! I also verified that I counted bit 7 incorrectly. So, I caused the fpu to reset to doubles both before and after instead of just before. However, it showed that the fmul is the reason, or at least one of the causes of the artifacts.

Reply 1752 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 09:59

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

Does PCem show the same problem with the Scale test?

Reply 1753 of 2419, by TheGreatCodeholio

Posted on 2019-02-04, 10:01

TheGreatCodeholio Offline

Rank Oldbie

Rank: Oldbie
Posts: 819
Joined: 2011-08-18, 20:15
Location: Seattle, WA

hail-to-the-ryzen wrote:
Does PCem show the same problem with the Scale test?

That's a test someone else here is going to have to do, I don't have PCem here on Linux.

DOSBox-X project: more emulation better accuracy.
DOSLIB and DOSLIB2: Learn how to tinker and hack hardware and software from DOS.

Reply 1754 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 10:02

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

I'll try it. Can you point me to the software? Also, verified that Explora runs well. 😀

Edit: I found it.

Reply 1755 of 2419, by TheGreatCodeholio

Posted on 2019-02-04, 10:04

TheGreatCodeholio Offline

Rank Oldbie

Rank: Oldbie
Posts: 819
Joined: 2011-08-18, 20:15
Location: Seattle, WA

TheGreatCodeholio wrote:
jmarsh wrote:
The problem may not be related to just the difference in precision, fpu_instructions.h is a very minimal x87 implementation that doesn't always take FP exceptions when it should or set flags correctly for many instructions. Using "long double" won't fix that.

Intel made a utility to test for correct x87 operation: https://winworldpc.com/product/386sx-math-coprocess/10
Dosbox fails every test unless using fpu_instructions_x86.h, which is obviously limited to (32-bit) x86 only as it uses real x87 instructions.

That's a good find, I actually have an old 386SX system I acquired some time ago where someone had installed the i387 in it, I should try that utility on it.

Link in the comment.

DOSBox-X project: more emulation better accuracy.
DOSLIB and DOSLIB2: Learn how to tinker and hack hardware and software from DOS.

Reply 1756 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 10:13

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

Both (custom builds) of pcem and dosbox-x (interpreter cpu cores) are showing many errors in all categories.

Edit: core=dynamic is not generating those errors. In fact no errors that way.

Reply 1757 of 2419, by TheGreatCodeholio

Posted on 2019-02-04, 10:19

TheGreatCodeholio Offline

Rank Oldbie

Rank: Oldbie
Posts: 819
Joined: 2011-08-18, 20:15
Location: Seattle, WA

It seems anything other than C_FPU_X86 with DOSBox-X fails the tests completely.

DOSBox-X project: more emulation better accuracy.
DOSLIB and DOSLIB2: Learn how to tinker and hack hardware and software from DOS.

Reply 1758 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 10:21

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

It doesn't make sense that it is correct. Perhaps another software to use for testing?

Reply 1759 of 2419, by hail-to-the-ryzen

Posted on 2019-02-04, 10:22

hail-to-the-ryzen Offline

Rank Member

Rank: Member
Posts: 441
Joined: 2017-03-09, 01:34

Then are the scaler errors from 64 bit versus 32 bit build? Is that possible?

Main menu

Topic actions

Reply 1740 of 2419, by hail-to-the-ryzen

Reply 1741 of 2419, by hail-to-the-ryzen

Reply 1742 of 2419, by TheGreatCodeholio

Reply 1743 of 2419, by hail-to-the-ryzen

Reply 1744 of 2419, by hail-to-the-ryzen

Reply 1745 of 2419, by hail-to-the-ryzen

Reply 1746 of 2419, by TheGreatCodeholio

Reply 1747 of 2419, by hail-to-the-ryzen

Reply 1748 of 2419, by TheGreatCodeholio

Reply 1749 of 2419, by hail-to-the-ryzen

Reply 1750 of 2419, by TheGreatCodeholio

Reply 1751 of 2419, by hail-to-the-ryzen

Reply 1752 of 2419, by hail-to-the-ryzen

Reply 1753 of 2419, by TheGreatCodeholio

Reply 1754 of 2419, by hail-to-the-ryzen

Reply 1755 of 2419, by TheGreatCodeholio

Reply 1756 of 2419, by hail-to-the-ryzen

Reply 1757 of 2419, by TheGreatCodeholio

Reply 1758 of 2419, by hail-to-the-ryzen

Reply 1759 of 2419, by hail-to-the-ryzen