[libre-riscv-dev] [isa-dev] Re: FP transcendentals (trigonometry, root/exp/log) proposal

lkcl luke.leighton at gmail.com
Sun Sep 8 15:56:00 BST 2019


https://gitlab.freedesktop.org/panfrost/mali-isa-docs/blob/master/Midgard.md

Transcendental opcodes in ARM's MALI ISA include

E8 - fatan_pt2
F0 - frcp (reciprocal)
F2 - frsqrt (inverse square root, 1/sqrt(x))
F3 - fsqrt (square root)
F4 - fexp2 (2^x)
F5 - flog2
F6 - fsin
F7 - fcos
F9 - fatan_pt1

pt1 stands for "pi times 1" and pt2 should br obvious.

NVIDIA CUDA transcendentals

https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#mathematical-functions-appendix

Further internal documentation is hard to find.

AMD VEGA

http://developer.amd.com/wordpress/media/2017/08/Vega_Shader_ISA_28July2017.pdf

sin, cos, exp, log, rcp, rsqrt, all in FP16/32/64.

Intesestingly no tan, atan, arc or hypot.

If these are standard opcodes in commercial GPUs, requesting specific and individual quantitative analysis is puzzling in the extreme. Their inclusion is so obviously critical for commercial success in the field of 3D and HPC that it is the equivalent of asking for quantitative analysis of "integer add" or "mul" for a DSP.

L.


More information about the libre-riscv-dev mailing list