[Libre-soc-bugs] [Bug 858] SVP64 Primer Documentation
bugzilla-daemon at libre-soc.org
bugzilla-daemon at libre-soc.org
Sat Jun 18 11:21:04 BST 2022
https://bugs.libre-soc.org/show_bug.cgi?id=858
--- Comment #10 from Luke Kenneth Casson Leighton <lkcl at lkcl.net> ---
important features and benefits to mention:
* The v3.1 Specification is not altered in any way.
* Predication, an often-requested feature, is added cleanly to the
Power ISA (without modifying the v3.1 Power ISA)
* In-registers arbitrary-sized Matrix Multiply is achieved in three
instructions (without adding v3.1 Power ISA instructions)
* Full DCT and FFT RADIX2 Triple-loops are achieved with dramatically
reduced instruction count, and power consumption expected to greatly
reduce. Normally found only in high-end VLIW DSPs (TI MSP, Qualcomm
Hexagon)
* Fail-First Load/Store allows strncpy to be implemented in around 14
instructions (Optimised VSX assembler is 240).
* Inner loop of MP3 implemented in under 100 instructions
(gcc produces 450 for the same function)
All areas investigated so far consistently showed reductions in executable
size, which as outlined in {SIMD_HARM} has an indirect reduction in
power consumption due both to less I-Cache/TLB pressure and Issue remaining
idle.
--
You are receiving this mail because:
You are on the CC list for the bug.
More information about the libre-soc-bugs
mailing list