Just some updates here:
I tinkered a bit with the "port.exe" tool to optimize memory and cache performance. (I didn´t care about PCI-performance so far.)
To keep it simple I run
1. "port.exe -c0x50=0x88"
2. "port.exe -c0x5e=0x80"
to enable WB L1 and WB L2 with one Tag-bit for dirty TAG.
and
3. "port.exe -c0x22=0x40" for the tightest RAS/CAS settings possible.
When I add
4. "port.exe -c0x51=0xc" (for 256K L2 cache, double bank, 2-1-1-1/2-1-1-1)
for fastest L2 cache timings there´s no improvement, so I think BIOS sets "2-1-1-1 / 2-1-1-1" by itself for 33MHz when "Turbo" is set in BIOS.
CTCM, Speedsys and Cachecheck results:
The attachment 486_vip_io1.JPG is no longer available
The attachment 486_vip_io2a.JPG is no longer available
The attachment 486_vip_io3.JPG is no longer available
This seems to be the limit that can be reached with this chipset. The results for this VIA "VT82C486A" mainboard are identical with those on mainboards with the later "VT82C496G".
I get the same results with my Jetway "J-433-A"
Faster FSBs and CPU speeds improve L1 performance but heavily decrease L2 and memory throughput.
Oh, and a big Thank You to Yftul for providing the tool for setting the chipset registers!