Here is the analyse of my problem from ChatGPT, I don't know if it's true but could be an explanation :
---
🧩 Summary: Low “Float iSSE” Memory Bandwidth on Pentium III Coppermine + Intel 440BX
I was testing an old PC with a Pentium III 800 MHz (Coppermine), GA-6BXC motherboard (Intel 440BX chipset), and 256 MB SDRAM CAS2.
In SiSoft Sandra 2002 and 2004, the memory benchmark shows normal “Integer” bandwidth, but the “Float iSSE” score is about 40 % lower than expected.
Older Sandra 2001 (which doesn’t use SSE in its memory test) reports normal and balanced Integer/Float results.
After checking everything — BIOS cache settings, RAM timings, chipset drivers — the cause turned out to be the 440BX chipset itself, not the CPU or RAM.
---
🔍 What’s Really Happening
Starting with Sandra 2002, the memory benchmark uses SSE (iSSE) routines that move data in 128-bit blocks (MOVAPS/MOVNTPS).
The 440BX memory controller, designed before SSE existed, only optimizes 64-bit burst transfers and doesn’t handle 128-bit SSE memory access patterns efficiently.
As a result, SSE memory throughput (the “Float iSSE” test) is about 30–40 % slower than expected.
The Integer test is fine, because it still uses regular 64-bit memory operations which the 440BX handles perfectly.
---
🧠 Why Real-World Performance Isn’t Affected
Games and real applications don’t stream large blocks of floats directly from main memory like Sandra’s synthetic test.
Most data fits in the L1/L2 cache, and game engines use SSE arithmetic on cached data, not on big raw memory transfers.
So this bandwidth drop is only visible in synthetic benchmarks, not in actual 3D or physics performance.
---
⚙️ Katmai vs. Coppermine
With a Pentium III Katmai, Sandra doesn’t use full 128-bit SSE memory moves (the Katmai only supports a limited SSE subset).
→ Memory results are normal.
With a Coppermine, full SSE 128-bit instructions are used.
→ The 440BX can’t burst them efficiently, so the “Float iSSE” score drops.
---
✅ Conclusion
It’s not a defect — just a chipset limitation.
The 440BX predates full SSE support and can’t optimize 128-bit memory transfers.
Everything works as it should; it only affects Sandra’s “Float iSSE” synthetic score.
Newer chipsets like the i815E or VIA Apollo Pro 133A don’t show this issue.
DOS/98SE : PIII-S 1400/C3 1200, GA-6BXC 2.0, 256MB PC133 C2, GeForce 2 Ultra, Voodoo2 12MB, Audigy1 Platinum Ex, ESS1688, SSD 128GB
XP/7 : i5 3750k, 8GB DDR3 1600, GTX 280 1GB, Radeon X1950 XTX, SSD 512GB