[libre-riscv-dev] [isa-dev] FP reciprocal sqrt extension proposal
huffman at cadence.com
Thu Jul 11 18:25:04 BST 2019
Avoiding doing both a square root and a divide to get this is a
worthwhile goal. I do wonder if it's better to have a slightly looser
accuracy requirement. The instruction can be considerably faster if
it's required to be within 1-ulp than if it's required to be correctly
rounded. On the other hand, that means different implementations get
different answers, which is not so good.
On 7/11/19 3:45 AM, Jacob Lifshay wrote:
> I propose a Zfrsqrt extension that consists of the frsqrt.s, frsqrt.d,
> frsqrt.q, and frsqrt.h instructions, where the 32-bit, 64-bit,
> 128-bit, and 16-bit versions require the corresponding F, D, Q, etc.
> If only the F and Zfrsqrt extensions are supported, then only the
> frsqrt.s instruction is supported.
> If only the F, D, and Zfrsqrt extensions are supported, then only the
> frsqrt.s and the frsqrt.d instructions are supported. Likewise for
> frsqrt.q and frsqrt.h requiring the corresponding extensions enabled.
> The operation implemented by frsqrt.* is a correctly-rounded
> implementation of the IEEE 754-2008 rSqrt operation, with all the
> usual FP rounding modes supported.
> For the encoding, I think using an encoding similar to both the
> fsqrt.* and fdiv.* encodings is a good idea, since frsqrt is similar
> to both fdiv and fsqrt; Therefore, as an initial proposal, I think
> using a funct7 value of 0111100 and the rest of the instruction
> identical to fsqrt is a good idea, since, as far as I'm aware, that
> doesn't conflict with anything currently.
> We (libre-riscv.org) are currently planning on implementing frsqrt in
> our libre GPU, since frsqrt is a common operation in 3D graphics (used
> for vector normalization, among other things).
> Comments, modifications, etc. welcome.
> Jacob Lifshay
More information about the libre-riscv-dev