c - Flipping sign on packed SSE floats -

- May 15, 2015

I am looking for the most efficient way to flip the sign on all the four floats packed in the SSE register.

I have not found any internal input to do this in the Intel Architecture Software Dev manual. Below are the things that I have already tried

For each case, I omitted the code 10 billion times and indicated the timing of the wall I am trying at least 4 seconds , It takes my non-SIM approach, which is just using the unique minus operator.

[48 seconds]
_mm_sub_ps (_mm_setzero_ps) (), Vec);

[32 seconds]
_mm_mul_ps (_mm_set1_ps (-1.0F), VCC);

[9 seconds]

Union negative mask {int intRep; Float flip rip; } NGMsk; NegMask.intRep = 0x80000000; _mm_xor_ps (_mm_set1_ps (negmask.fltRep), vec);

The compiler is with GCC 4.2-O3. The CPU is an Intel Core 2 Duo.

Just about the underlying vectors to complete their answer through the GCC documentation:

  This type of defined type can be used with a subset of normal operation. At present, GCC will allow the following operators to use these types: `+, - *, /, unary minus, ^, |, And, ~ '

Possible when it is possible to always paste it is a good idea The will of the common GCC always the most efficient code for the SSE stuff.

For your compiler options, add something more specific to your architecture, such as -march = native in most cases.

Search This Blog

Add s econ

c - Flipping sign on packed SSE floats -

Comments

Post a Comment

Popular posts from this blog

paypal - How to know the URL referrer in PHP? -

oauth - Facebook OAuth2 Logout does not remove fb_ cookie -

wpf - Line breaks and indenting for the XAML of a saved FlowDocument? -