
JF-AMD wrote:Since it is the holiday season I'll paraphrase from "A Christmas Story". The clarification was so good that I am exempt from ever having to do another one.
Now where is my daisy red ryder BB gun?

brutis wrote:JF-AMD wrote:Since it is the holiday season I'll paraphrase from "A Christmas Story". The clarification was so good that I am exempt from ever having to do another one.
Now where is my daisy red ryder BB gun?
You'll shoot your eye out with it

JF-AMD wrote:the addition of the second integer core in each module only adds ~5% of circuitry to the total die.



JF-AMD wrote:There have been some questions about the Bulldozer architecture so this should help clear up any confusion.
First, Bulldozer is based on a modular architecture where two integer cores are teamed up with an extra-large FPU to create what we call a Bulldozer module. Bulldozer modules are the basis of all of the designs that will be coming from this architecture, and it’s modular nature not only allows us to build processors with different sized core counts but also provides flexibility for future designs that could allow other modular components like GPUs to be added into the designs. The Bulldozer module is a concept and part of an architectural design, it is not something that the user will come in contact with. For instance, when an Interlagos system boots up, the hardware will see 16 integer cores, not 8 modules. When the OS loads, it will see 16 integer cores, not 8 modules, and the applications will see 16 cores as well. Because of this extremely consistent manner by which the whole system sees the integer core (and not modules), it is only natural that Interlagos will be marketed as a 16-core processor. It would actually be more confusing to call it an 8-core processor, because there is no point where a customer would see 8-cores.
Secondly, there was a question about the amount of die space that is consumed by having 2 integer cores in a module versus just one. Bulldozer was designed to be a modular architecture where 2 integer cores are able to share certain resources where it makes sense (in order to reduce power consumption) yet still retain discrete components in order to ensure great performance and no bottlenecks. It was never designed as a single integer core in each module, so dissecting the module components becomes a bit more tricky. Some have compared this to SMT and made statements that SMT customers could see a modest increase in performance for only a fraction of die space. We believe that our Bulldozer architecture will provide far greater performance gains than SMT with up to 80% greater expected throughput when running 2 threads simultaneously compared to a single thread running on a single integer core. Our engineers estimate that the amount of discrete circuitry that is added to each Bulldozer module in order to allow for a second integer thread to run is ~12% at the core level, but because the integer cores are only a portion of the overall die space , the addition of the second integer core in each module only adds ~5% of circuitry to the total die. We believe this is an excellent balance of greater performance with a very small silicon cost.
Finally, there are those that have suggested that the two integer cores in the Bulldozer module could potentially be merged together into a single core. This is not true. Perhaps they are confusing the functionality of the FPU, which is flexible enough to be split between the two cores in the module, giving each a 128-bit FMAC simultaneously, OR can be combined into a 256-bit FMAC for one integer core to use exclusively if the second integer core does not need any FPU commands in that cycle.
We hope this clarifies the questions that seem to be most prevalent.

superrugal wrote:Thanks JF, this message is very helpful !! Maybe the "50% additional area" statement is still unofficial or completely not right ?!
vsary6968 wrote:Thank you so much for the clarification. It very clear this time.
Will the Bulldozer arrive before 2011? It will be nice if it arrive 2H2010.
superrugal wrote:Maybe the "50% additional area" statement is still unofficial or completely not right ?!

JF-AMD wrote:Our engineers estimate that the amount of discrete circuitry that is added to each Bulldozer module in order to allow for a second integer thread to run is ~12% at the core level, but because the integer cores are only a portion of the overall die space , the addition of the second integer core in each module only adds ~5% of circuitry to the total die.





piesquared wrote:So JF, I guess that means that Interlagos and Valencia are Bulldozer are 8 and 4 modules on the same die (not package) connected by HT? That still shouldn't affect yield, correct? Maybe you can't answer that though.

abinstein wrote:piesquared wrote:So JF, I guess that means that Interlagos and Valencia are Bulldozer are 8 and 4 modules on the same die (not package) connected by HT? That still shouldn't affect yield, correct? Maybe you can't answer that though.
...
My guess is Interlagos is two 8-core connected in the same way as Magny-Cours. Valencia should be a 8-core die by itself.
Of course, this allows better yield than making a single die of 16 cores.


abinstein wrote:piesquared wrote:So JF, I guess that means that Interlagos and Valencia are Bulldozer are 8 and 4 modules on the same die (not package) connected by HT? That still shouldn't affect yield, correct? Maybe you can't answer that though.
...
My guess is Interlagos is two 8-core connected in the same way as Magny-Cours. Valencia should be a 8-core die by itself.
Of course, this allows better yield than making a single die of 16 cores.
CarlosTex wrote:JF your explanation was the most clarifying about bulldozer so far. Personally i think i´ve understood pretty much the idea and the concept of bulldozer, it will be a multi-threading monster for sure while saving A LOT of die space, performance per watt will be dramatically improved.
But i have a Big concern, i´m not an engineer but everytime i look into a bullduzer module and i think about it i get nervous. I know that the future is multi-threading but i´m kinda afraid that single thread integer performance won´t be amazing. OK i know that integer execution will be improved against K10.5 but IMHO the logic on these integer cores doesn´t look like it will be enough to rival Intel SandyBridge. I mean Nehalem already has got an edge against K10.5 on integer execution and i don´t know if these improvements on bulldozer will be enough to counter Intel´s SandyBridge. Is everybody sure that 4 pipelines per integer core will be enough? The beauty of this micro-architecture is that it´s easy to throw more hardware power to the design without having to redesign the whole thing. Maybe when bulldozer gets a 22nm shrink they can think of putting at least 6 pipelines per integer core and double the size of L1 caches both instruction and data, increasing the load/store capabilities to keep those pipelines working, and bigger L2 and L3. I don´t think this is so difficult to make because a redesign isn´t necessary, and at least AMD could keep improving its architectures with each shrink. For Intel this is a "tock" to a "tick".
Another small concern is clock speed and how well will the performance scale with clock increase. However i really really hope those lonely 4 integer pipelines will be enough. When i look at them in the module diagrams that are out there i always think of the first superscalar architectures i mean that looks to simple to hope for the performance increase we are expecting.
Remember that i´m talking about single-threaded integer performance, i´m not talking about the FPU.
Oh and makes sense that those two integer cores in a module can´t work together because they are separated but feels like were wasting that circuitry power when doing single-thread. Maybe if the L1 data was shared they could work together i don´t know but i feel that it´s a waste.
Anyway if you totally disagree with my concern or share it the same way i do, please comment!
Thanks


Return to K10: Barcelona, Shanghai, Quad-Core Opteron, Phenom
Users browsing this forum: Bing [Bot], Yahoo [Bot] and 2 guests