Multiply the signed values in xs1 to xs2, and store the upper half of the result in xd. The lower half is thrown away. If both the upper and lower halves are needed, it suggested to use the sequence: --- mulh xdh, xs1, xs2 mul xdl, xs1, xs2 --- Microarchitectures may look for that sequence and fuse the operations.
mulh xd, xs1, xs2
Type: R