[libre-riscv-dev] [isa-dev] Re: FP transcendentals (trigonometry, root/exp/log) proposal
lkcl
luke.leighton at gmail.com
Tue Aug 13 05:32:55 BST 2019
On Tuesday, August 13, 2019 at 2:05:23 AM UTC+1, MitchAlsup wrote:
>
>
>
> The alternative is to designate a few OpCodes in a sequence as a single
> result producer, with the intermediate result kept larger than register
> width and fed back to the in-sequent instruction (preserving accuracy.)
>
yehyeh, i totally get it / like it, as a design concept. the implications
however are for context-switches (which you can "fudge" in a dedicated GPU
as it's not going to be dealing with other general-purpose tasks, but you
can't really do in a Hybrid CPU/GPU), the intermediate registers would need
saving/restoring.
that in turn puts pressure on the register file size, which is already big
for a GPU. or requires a "special" register file, other than FP, INT (and
in some cases V as well). which in turn requires _more_ opcodes to... yeh,
you get the idea :)
which is why i said that starting from RISC-V, for GPU purposes, isn't
necessarily the smartest thing to do / be constrained by.
ho hum :)
l.
More information about the libre-riscv-dev
mailing list