Multiply the unsigned values in xs1 to xs2, and store the upper half of the result in xd. The lower half is thrown away. If both the upper and lower halves are needed, it suggested to use the sequence: --- mulhu xdh, xs1, xs2 mul xdl, xs1, xs2 --- Microarchitectures may look for that sequence and fuse the operations.
mulhu xd, xs1, xs2
Type: R