Hardware Secrets
Home | Camera | Case | CE | Cooling | CPU | Input | Memory | Mobile | Motherboard | Networking | Power | Storage | Video | Other
Content
Articles
First Look
Gabriel's Blog
News
Reviews
Tutorials
Main Menu
About Us
Awarded Products
Compare Prices
Datasheets
Dictionary
Download
Drivers
Forums
Links
Manufacturer Finder
Newsletter
On The Web
RSS Feed
Test Your Skills
Twitter
Newsletter
Subscribe today!
Search




Recommended
The Winn L. Rosch Hardware Bible, 6th Edition (2 Vol. Set)
The Winn L. Rosch Hardware Bible, 6th Edition (2 Vol. Set), by Winn L Rosch (Que), starting at $2.32
Home » CPU
Inside Intel Nehalem Microarchitecture
Author: Gabriel Torres
Type: Tutorials Last Updated: August 26, 2008
Page: 7 of 7
$ Check REAL-TIME pricing for Intel Celeron E1500 Socket 775 65mm Dual-Core Processor with 2.2GHz 800MHz FSB 512KB L2 Cache Model BX80557E1500 Retail Box Free Ground Shipping $.
Directron: $49.99 Newegg: $49.99
CircuitCity: $49.99 TheNerds: $54.99

Other Features

Now that we covered all main features brought by the new Nehalem core, we are going to explain a little bit more about two important features, Hyper-Threading and the optimization done to deal with unaligned SSE instructions.

Hyper-Threading technology allows each CPU core to be recognized as two CPUs. Thus if you have a Core i7 with four cores, the operating system will recognize it as having eight cores. This technology is based on the fact that when the CPU core is running there are certain circuits inside that are idle and thus can be used. Originally released for the Pentium 4 CPU this is the first time this technology is available on a 6th generation Intel CPU. This technology is also called SMT or Simultaneous Multi-Threading (SMT). This technology does not provide the same performance gain as if “real” CPU cores were used instead (i.e. a CPU with 8 cores is faster than a CPU with 4 cores and HT technology, provided that they both work under the same clock rate and are based on the same architecture), however you are gaining these extra “CPU cores” for free.

There are two kinds of SSE instructions that access memory, aligned and unaligned (also called misaligned). Aligned instructions required the requested data to be inside 16-byte (128-bits) address boundaries, while unaligned instructions don’t. See Figure 9 for an illustration.

Aligned vs unaligned (misaligned) instructions
click to enlarge
Figure 9: Aligned vs. unaligned instructions.

O.k. we know that this sounds cryptic for you, so let’s translate into English.

Imagine a system with dual-channel memory. The memory controller will access the memory 128 bits at a time. So the memory will be divided into 128-bit (16 bytes) blocks. So in theory the address that you request must start at the beginning of each block, so you can make a 128-bit read (or write) and get what you want at just one request. This is the aligned request shown on top of Figure 9.

But suppose that you issue a command to read a data from the memory but instead of using the first address inside the block you ask for the address in the middle of the block. Since you are requesting a 128-bit data, what will happen is that half of the data will be on the first block and the other half of the data will be on the next block – this is shown on the bottom of Figure 9. Since the data you requested will be split into two different blocks the memory controller will have to read two memory blocks, not just one as it happened on the previous example. On the first read you will get back half of the data you want and on the second read you will get the remaining of the data.

Although aligned requests are more efficient they are more difficult for programmers because they need to know the memory organization. Because of that most programmers end up using only unaligned instructions.

Previous Intel CPUs were optimized for aligned instructions and unaligned ones were slower and were translated into multiple micro-ops – in other words, unaligned instructions were easier for the programmer but ran slower. Nehalem-based CPUs are optimized for unaligned instructions, achieving the same speed as aligned instructions. The slide on Figure 10 summarizes this.

Nehalem Core i7
click to enlarge
Figure 10: Nehalem is optimized for unaligned SSE instructions.

Pages (7): « 1 2 3 4 5 6 [7]
Print Version | Send to Friend | | Bookmark Article | Comments (2)

Related Content
  • Details on Intel’s Forthcoming 45 nm Manufacturing Technology
  • Core 2 Duo E6750 Review
  • Core 2 Duo E7200 CPU Review
  • Core 2 Duo, Core 2 Quad, Phenom X3 and Phenom X4: Which One is the Best USD 200 CPU?
  • Everything You Need to Know About The QuickPath Interconnect (QPI)

  • Recommended Deal.
    Phenom II 550 Black Edition Phenom153 ProcessorAMD Phenom II X2 550 Black Edition Dual Core Processor - 3.10GHz Socket AM3 6MB Cache 2000MHz 4000 MT/s Retail Unlocked Multiplier HDZ550WFGIBOX


    CircuitCity: $109.99 CDW: $135.99
    Newegg: $102.00 Directron: $104.99

    RSSLatest News
    Spire Announces CoolNess Laptop Cooler
    November 6, 2009 - 3:07 PM PST
    Mushkin Launches Memories with Copper-made Heatsink
    November 5, 2009 - 3:49 PM PST
    Super Talent Launches USB 3.0 Flash Memories
    November 5, 2009 - 3:47 PM PST
    VIA Announces Nano 3000 Processor Series
    November 5, 2009 - 3:42 PM PST
    Sapphire Announces Vapor-X HD 5870 and HD 5750 Video Cards
    November 5, 2009 - 3:38 PM PST
    Gelid Unveils Tranquillo CPU Cooler
    November 5, 2009 - 3:36 PM PST
    Noctua Intros NH-D14 Premium CPU Cooler
    November 3, 2009 - 8:14 PM PST
    Transcend Unveils DDR3-1333 Memory Kits
    November 3, 2009 - 7:57 PM PST
    EVGA Launches GeForce GTX 275 CO-OP PhysX Edition
    November 3, 2009 - 7:51 PM PST
    Akasa Launches Freedom Xone Mid-tower Case
    November 2, 2009 - 6:05 PM PST
    .:: More News ::.

    RSSLatest Content
    SilverStone Grandia GD04 Case Review
    Can We Trust the 80 Plus Certification?
    NZXT Gamma Case Review
    Patriot Box Office Media Player Review
    ASUS U-75HA 750 W Power Supply Review
    MSI P55-GD80 Motherboard
    Thermaltake Element V Case Review
    Nokia 7705 Twist Cell Phone Review
    Cooler Master Hyper TX3 CPU Cooler Review
    Some Pictures from Our Office
    Antec Two Hundred Case Review
    Corsair TX950W Power Supply Review
    XFX Radeon HD 5770 Video Card Review
    XFX Radeon HD 5750 Video Card Review
    Scythe Big Shuriken CPU Cooler Review

    Our Most Popular Articles
    Maximum CPU Temperature
    1,070,946 views
    How to Find Out Your Motherboard Manufacturer and Model
    695,324 views
    nVidia Chips Comparison Table
    671,491 views
    Connecting Two PCs Using a USB-USB Cable
    585,891 views
    How To Correctly Apply Thermal Grease
    556,019 views
    AMD ATI Chips Comparison Table
    554,163 views
    ATI Radeon X1300 Pro Review
    485,829 views
    ATI Radeon X1600 XT Review
    474,888 views
    How To Perform a BIOS Upgrade
    389,448 views
    Sempron vs. Athlon XP
    336,581 views

    Latest Threads in Our Forums
    help deciding cpu's here
    by shadixmax
    Is it a vga problem or motherboard has shocked?
    by ftomsuk
    Patriot Box Office Media Player Review
    by cchjde
    Is it possible LCD Monitor leaked by itself ?
    by delta32
    Suddenly death syndrome and pendrives
    by Desert Fox
    better cpu cooler?
    by sam_wade07
    Video Transfer camcorder to PC
    by fjs559
    Spire Announces CoolNess Laptop Cooler
    by Hardware Secrets Team
    Can We Trust the 80 Plus Certification?
    by Merman
    SilverStone Grandia GD04 Case Review
    by Olle P
    .:: Visit Our Forums ::.


    © 2004-9, Hardware Secrets, LLC. All rights reserved.
    Advertising | Legal Information | Privacy Policy
    All times are Pacific Standard Time (PST, GMT -08:00)