Luna::Simd::scaleadd_f4
float4 scaleadd_f4(float4 a, f32 b, float4 c)
Scales packed single-precision (32-bit) floating-point elements in a
using one single-precision (32-bit) floating-point element b
, add the intermediate result to packed elements in c
, and stores the results in dst
.
dst.x = (a.x * b) + c.x
dst.y = (a.y * b) + c.y
dst.z = (a.z * b) + c.z
dst.w = (a.w * b) + c.w