pctechguide.com

  • Home
  • Guides
  • Tutorials
  • Articles
  • Reviews
  • Glossary
  • Contact

SSE4 – Intel’s enhanced multimedia focussed CPU instruction set

SSE4 is a set of instructions released in conjunction with Intel’s Penryn processor. SSE4, built upon the Intel 64 Instruction Set Architecture, represented Intel’s first major change to its instruction set for some time, and followed smaller changes introduced (in the guise of SSE3) with the Prescott (horizontal add/subtract) and Core 2 Duo (absolute value and double-width align) processors.

Intel believes that SSE4 offers the greatest change to the x86 instruction set in five years and allows the Penryn clock to run at higher frequencies than its Core 2 parents but within the same cool thermal envelope. While this only benefits applications that are able to use SSE4 (like media encoding), the speed improvements are reported to be in the order of 40%.

There are around 50 new instructions in the SSE4 set, the majority of which are instructions to utilize parallelised code and data-structures, making it easier to take full advantage the Penryn’s multi-core processors and its multi-threading capability.

SSE is an acronym for ‘Streaming SIMD Extensions’. The general concept behind these instructions is to combine certain common operations into one smooth operation: rather than a series of x instructions required for, say, discovering the dot product of two vectors, SSE provides one dedicated instruction. SSE reduces complex operations into native instructions, and this can greatly improve the efficiency of the processor in certain applications.

SSE4 made 47 new instructions available with the Penryn processor. Most of the new instructions are related to vector operations, which are the staple of graphics and multimedia processing. Also included are primitives that increase the speed of streaming and improves access to device memory. Intel groups the instructions into two sets: ‘Vectorizing Compiler and Media Accelerators’ and ‘Efficient Accelerated String and Text Processing’. The table below is summarised from the Intel SSE4 Programming Reference, and provides a brief summary of the new instructions and their benefits:

Instruction Description
BLENDPD, BLENDPS, BLENDVPD, BLENDVPS, PBLENDVB, PBLENDW Blend Packed Double and Single Precision Floating-Point Values
CRC32 Accumulate CRC32 Value
DPPD, DPPS Dot Product of Packed Double and Single Precision Floating-Point Value
EXTRACTPS, INSERTPS Extract and Insert Packed Single Precision Floating-Point Value
MOVNTDQA Load Double Quadword Non-Temporal Aligned Hint
MPSADBW Compute Multiple Packed Sums of Absolute Difference
PACKUSDW Pack with Unsigned Saturation
PCMPESTRI, PCMPISTRI Packed Compare Explicit and Implicit Length Strings, Return Index
PCMPESTRM, PCMPISTRM Packed Compare Explicit and Implicit Length Strings, Return Mask
PCMPEQQ, PCMPGTQ Compare Packed Data For Equal or Greater Than
PEXTRB, PEXTRD/PEXTRQ, PEXTRW Extract Byte, Dword/Qword, and Word
PHMINPOSUW Packed Horizontal Word Minimum
PINSRB, PINSRD/PINSRQ Insert Byte and Dword/Qword
PMAXSB, PMAXSD, PMAXUD, PMAXUW, PMINSB, PMINSD, PMINUD, PMINUW Find Minimum and Maximum of Packed Signed, Unsigned, Dword and Word-length Integers
PMOVSX, PMOVZX Packed Move with Sign and Zero Extend
PMULDQ, PMULLD Multiply Packed Signed Dword Integers and Store Low Result
POPCNT Return the Count of Number of Bits Set to 1
PTEST Logical Compare
ROUNDPD, ROUNDPS, ROUNDSD, ROUNDSS Round Packed and ScalarDouble and Single Precision Floating-Point Values

The 47 instructions available on Penryn represented the initial SSE4.1 release, with a further 7 instructions constituted Intel’s SSE4.2 release.

Previous versions of SSE have been licensed to AMD for use on its chips. It was unclear at the time of release whether SSE4 would be licensed in the same way.

In order to utilise the additional instructions fully code needs to be developed with it in mind from the very start. In particular compilers need to be modified to take advantage of the new native instructions and to this end Intel released a new version of its C compiler to coincide with Penryn’s debut.�As is usually the case with processor enhancements, either in hard, firm or software, the benefits are unlikely to manifest themselves immediately and it is the job of software engineers to realise SSE4’s potential.

  • Intel CoreDuo
  • Intel CoreSolo
  • Intel Core2
  • Guide to the Intel Core 2 Quad and Extreme processors
  • Technical guide to the Intel Penryn Core 2 Processor
  • SSE4 – Intel’s enhanced multimedia focussed CPU instruction set

Filed Under: Intel Core CPUs Tagged With: Intel, Penryn, SIMD, SSE4, Streaming SIMD Extensions

Latest Articles

What is BitCoin

Bitcoin is a digital currency in which encryption techniques are used to mine/generate new coins as well as verify the transfer of funds. If you want to get started on bitcoin, get advice at xCoins. Why is bitcoin Worth Anything? 1.Belief in the System:  Peoples willingness to accept the currency … [Read More...]

LP to CD Recording

Software to facilitate the digitalisation of analogue music is increasingly widely available. The Windows Sound Recorder is not suitable, since it doesn't allow for recording of more than 60 seconds. The application we're going to use in this tutorial is Audacity. This is quite a sophisticated … [Read More...]

Are You Aware of the Extent of Risks Facing Your iPhone

The market for iPhones is still booming after all these years. Apple iPhones have captured the third largest share of the smartphone market. Around 11% of all smartphones sold around the world are iPhones. Despite the popularity of iPhones, there are still some major concerns that people often … [Read More...]

20 Cool Creative Commons Photographs About the Future of AI

AI technology is starting to have a huge impact on our lives. The market value for AI is estimated to have been worth $279.22 billion in 2024 and it … [Read More...]

13 Impressive Stats on the Future of AI

AI technology is starting to become much more important in our everyday lives. Many businesses are using it as well. While he has created a lot of … [Read More...]

Graphic Designers on Reddit Share their Views of AI

There are clearly a lot of positive things about AI. However, it is not a good thing for everyone. One of the things that many people are worried … [Read More...]

Redditors Talk About the Impact of AI on Freelance Writers

AI technology has had a huge impact on our lives. A 2023 survey by Pew Research found that 56% of people use AI at least once a day or once a week. … [Read More...]

11 Most Popular Books on Perl Programming

Perl is not the most popular programming language. It has only one million users, compared to 12 million that use Python. However, it has a lot of … [Read More...]

10 Exceptional Books on ChatGPT that Will Blow Your Mind

ChatGPT is a powerful new AI tool that is taking the world by storm. You are going to find a lot of amazing books that will teach you how to make the … [Read More...]

Guides

  • Computer Communications
  • Mobile Computing
  • PC Components
  • PC Data Storage
  • PC Input-Output
  • PC Multimedia
  • Processors (CPUs)

Recent Posts

FBI Money Pak Virus

FBI MoneyPak is a malware client that holds your computer for ransom until you pay a fine. As stated this is malware, a computer virus that infected … [Read More...]

V90 Technology

The V.90 standard is neither x2 nor K56Flex, although it does use techniques from both. It is actually two standards in … [Read More...]

Guidelines on Troubleshooting Python Code

Whenever we write a computer program we need to verify that it works as expected. Usually we execute the code, if necessary we provide some inputs, … [Read More...]

[footer_backtotop]

Copyright © 2025 About | Privacy | Contact Information | Wrtie For Us | Disclaimer | Copyright License | Authors