Luna::Simd::load_f4

float4 load_f4(f32 const *mem_addr)

Loads 128-bits (composed of 4 packed single-precision (32-bit) floating-point elements) from memory into dst.

dst.x := MEM[mem_addr:mem_addr+31]
dst.y := MEM[mem_addr+32:mem_addr+63]
dst.z := MEM[mem_addr+64:mem_addr+95]
dst.w := MEM[mem_addr+96:mem_addr+127]

Valid Usage

  • mem_addr must be aligned on a 16-byte boundary or a general-protection exception may be generated.