[libre-riscv-dev] [isa-dev] Re: FP transcendentals (trigonometry, root/exp/log) proposal
lkcl
luke.leighton at gmail.com
Sun Sep 8 15:56:00 BST 2019
https://gitlab.freedesktop.org/panfrost/mali-isa-docs/blob/master/Midgard.md
Transcendental opcodes in ARM's MALI ISA include
E8 - fatan_pt2
F0 - frcp (reciprocal)
F2 - frsqrt (inverse square root, 1/sqrt(x))
F3 - fsqrt (square root)
F4 - fexp2 (2^x)
F5 - flog2
F6 - fsin
F7 - fcos
F9 - fatan_pt1
pt1 stands for "pi times 1" and pt2 should br obvious.
NVIDIA CUDA transcendentals
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#mathematical-functions-appendix
Further internal documentation is hard to find.
AMD VEGA
http://developer.amd.com/wordpress/media/2017/08/Vega_Shader_ISA_28July2017.pdf
sin, cos, exp, log, rcp, rsqrt, all in FP16/32/64.
Intesestingly no tan, atan, arc or hypot.
If these are standard opcodes in commercial GPUs, requesting specific and individual quantitative analysis is puzzling in the extreme. Their inclusion is so obviously critical for commercial success in the field of 3D and HPC that it is the equivalent of asking for quantitative analysis of "integer add" or "mul" for a DSP.
L.
More information about the libre-riscv-dev
mailing list