Username:
Password:
Remember Me?
   Lost your password?
Search



Nvidia's next-gen Compute architecture

G300-Fermi: Nvidia focuses on GPU Computing - Impressive Raytracing demo shots

At the GTC Keynote Nvidia has announced its next generation GPU and Cuda architecture - Fermi (G300). The Californians concentrated on flexible usability and high workload of the 512 Shader ALUs - DirectX 11 was only an aside.
High-Level Diagram of the G300/Fermi
 
High-Level Diagram of the G300/Fermi [Source: view picture gallery]
The architecture of the G300 is code-named Fermi and features about 3 billion transistors, 512 ALUs, up to 6 GiByte GDDR5 RAM and a 384-bit memory interface. Nvidia has not anything revealed about clock rates yet - therefore all details about capability are meant by clock which does not necessarily show the performance ratio of the final products to its predecessors.

With the Fermi architecture Nvidia more and more focuses on GPU computing and also uses those terms in their presentation. The former texture units turned into Load/Store units, the shader ALUs (which Nvidia formerly called stream processors) are Cuda core or Cuda processors now. Certainly chips basing on the Fermi architecture will be DirectX 11-compatible but Nvidia doesn't talk about that much.

Picture gallery  (enlarge to view source)


Fermi Streaming Multiprocessor orSIMD
 
Fermi Streaming Multiprocessor orSIMD [Source: view picture gallery]
Specifications: G300 Fermi
A total of 512 Cuda cores will be on the G300 chip, organized in 16 SIMD units. So every SIMD has 32 ALUs which share the 16 loading and memory units (LS units, ex-TMUs). Up to now Nvidia has not revealed details about the capabilities of the LS units. The presented specifications have not given hints about the power of texture filtering yet. Values between 4 and 16 texture filters per SIMD could be possible.

Special Function Units (SFUs) execute transcendental instructions such as sin, cosine, reciprocal and square root. Each SFU executes one instruction per thread, per clock; a warp executes over eight clocks. The SFU pipeline works independently from the dispatch unit, allowing the dispatch unit to issue to other execution units while the SFU is occupied.

Double precision arithmetic has been improved as well. It does not only fulfill the IEEE 754-2008 floating-point standard (formerly IEEE 754-1985 was used) with the more precise FMA (Fused Multiply-Add, which AMD offers with the HD 5800 series and Nvidia with the GT200 only for DP), but also the DP output increases by factor 8 compared with the GT200 (per clock cycle!). Every SIMD (called Streaming Multiprocessor) can execute 16 FMA operations and 256 per chip respectively - the GT200 was only capable of 30 DP-MADs.

The SIMDs thread in groups of 32 parallel threads called warps. Each SIMD has two warp schedulers and two instruction dispatch units, allowing two warps to be issued and executed concurrently. Every warp scheduler activates either a group of sixteen cores, sixteen load/store units or four SFUs. Since the warps work independently, Fermi's scheduler does not need to check for dependencies from within the instruction stream.

Speeds & Feeds
 
Speeds & Feeds [Source: view picture gallery]


Fermi Cache and RAM
A clever trick is the cache division of the SIMDs. Physically there are 64 kiByte per SIMD/Streaming Multiprocessor. 16 kiByte are fix configurated Shared Memory (as it was with G80 and GT200) and 16 kiByte are configured as Level1 cache - the rest of the 32 kiByte can be freely used.

Furthermore the G300 with Fermi architecture offers a unified Level 2 cache with 768 kiByte capacity.

In our gallery you will also see brand-new screenshots from physics and raytracing techdemos. At the GDC the already known GPU Raytracing demo with a Bugatti Veyron was shown in an updated version and it uses a much more realistic illumination based upon Global Illumination. Other demos show a physically correct water simulation used in films as well as particle effects and a destruction simulation.

Fermi will probably not launch until 2010. Nvidia is still working on details so it will take another couple of months before the graphics cards will hit the retail market.




--
Author: Carsten Spille (Oct 01, 2009)


You’re looking for more information related to AMD/ATI products like Phenom, Athlon and Radeon? Then please check our special AMD landing page with news, reviews and guides.




Advertisement

Comments (16)

Comments 6 to 16  Read all comments here!
connos Re: G300-Fermi: Nvidia focuses on GPU Computing - Impressive Raytracing demo shots
Senior Member
02.10.2009 22:34
Some more on the "fake" Fermi.

www.semiaccurate.com/2009...
chizow Re: G300-Fermi: Nvidia focuses on GPU Computing - Impressive Raytracing demo shots
Senior Member
02.10.2009 19:37
Quote: (Originally Posted by natr0n)
latest word on the e-street is saying that actual card fermi was a prop/fake card. Meaning that ray traced car wasnt rendered on the fermi.

oops

Fuad confirming the card shown on stage was a mock-up, but the ASIC running the N-body simulation was Fermi. Reasons for not showing the working unit were obvious, its apparently a mess of wires and PCB. He was promised a picture of the working GPU later. Also the Bugatti ray-tracing demo was done on a GT200-based card, that was stated during the conference. Most logical reason why, they don't have working drivers beyond the very basic yet for Fermi.

www.fudzilla.com/content/...
connos Re: G300-Fermi: Nvidia focuses on GPU Computing - Impressive Raytracing demo shots
Senior Member
02.10.2009 19:11
Nvidia does thinks like that but who knows, for sure it wasn't real time. They were trying in a harry for sure to stall the ATI sales as if it was possible to render it in real time and demonstrate it they will probably do it.

Also you may think that ATI spread the rumors due that Nvidia didn't demonstrate it in real time.
natr0n Re: G300-Fermi: Nvidia focuses on GPU Computing - Impressive Raytracing demo shots
Senior Member
02.10.2009 17:35
latest word on the e-street is saying that actual card fermi was a prop/fake card. Meaning that ray traced car wasnt rendered on the fermi.

oops
Yapa Re: G300-Fermi: Nvidia focuses on GPU Computing - Impressive Raytracing demo shots
Senior Member
01.10.2009 18:43
Well I hope your right but that article and author did make some valid points, it made sense that nvidia wanted to move away from PC gaming.

Also why did Nvidia totally ignore any news on DX11 and gaming in their opening showcase? It seemed to all be focused on the features which help with HPC?
chizow Re: G300-Fermi: Nvidia focuses on GPU Computing - Impressive Raytracing demo shots
Senior Member
01.10.2009 18:43
Quote: (Originally Posted by Yapa)
So how long before we can install Windows without an Intel/AMD CPU?

Just found this interesting article:
arstechnica.com/hardware/...

It's kind of scary but what he says is true... Nvidia might be moving away from gaming video cards... this could be the end of PC gaming?


Not even close to signaling the end of PC gaming, they're the first to admit they piggy-back their GPGPU/cGPU goals on the back of their 3D graphics budget, that's the only way they could've developed CUDA so quickly on their architecture. Jensen said himself a GeForce was a supercomputer with a day job, 3D games during the day, and HPC at night.

Simply put the economics are not even close to the point it would make sense or be fiscally responsible for Nvidia to neglect their consumer GPU business. Keep in mind, Nvidia still generates 75% of their revenue directly from their GPU business. 50% of that is from their Desktop GPU segment. 25% is from their professional solutions, but only 1% of that is Tesla (the other 24% being Quadro). That was from the last quarterlies I checked, but they also mentioned that yesterday saying with only ~40m total revenue from Tesla. That's nothing given they have ~4 billion in annual revenue and a 1 billion per year R&D budget.

Anyways, from their presentation yesterday, I can see them moving away from x86 and Intel some day, maybe not for windows, but certainly for their HPC/cGPU/Tesla business. They actually talked about that some yesterday when comparing their Heterogeneous Computing model. Eventually they want to get to the point they only need a single fast CPU and the rest would be accelerated in parallel on the GPU.

Personally I think this shift will start on the mobile and handheld market with Tegra and perhaps ARM processors. Its well-known Microsoft wants to get into this potentially lucrative market and in order to do so, they need to trim down their OS and adapt it away from its reliance on x86. Once that happens, and these "alternative" processors get faster....you never know. They may just get to the point they're "fast enough" to run serial code well enough to run an OS and keep a monster brain of a Nvidia cGPU fed and happy......
Yapa Re: G300-Fermi: Nvidia focuses on GPU Computing - Impressive Raytracing demo shots
Senior Member
01.10.2009 17:52
Quote: (Originally Posted by Unregistered)


Both intel (with Larrabee) and AMD (with Fusion) plan to do something like what nvidia did, but nvidia was faster.


So how long before we can install Windows without an Intel/AMD CPU?

Just found this interesting article:
arstechnica.com/hardware/...

It's kind of scary but what he says is true... Nvidia might be moving away from gaming video cards... this could be the end of PC gaming?
Yapa Re: G300-Fermi: Nvidia focuses on GPU Computing - Impressive Raytracing demo shots
Senior Member
01.10.2009 17:38
Interesting, I think we will see something really special here.

Some points of concern from the pics... is that the actual retail card shown?

Look at the power connectors, 1x6pin and 1x8pin.... thats ok BUT they put the 8pin connector on the REAR of the card! How stupid is that design, have they gone backwards? 8800GTX had them on the top and so do most of the high end Nvidia range.... why would they now put it on the rear and one on the top!

I hope thats not the final design, as many cases will not fit it... dont like that at all!

Also, why does it only have 1 x DVI connectors? I think this might be some kind of developer / test card... hopefully.

Yapa
connos Re: G300-Fermi: Nvidia focuses on GPU Computing - Impressive Raytracing demo shots
Senior Member
01.10.2009 17:24
This article and others said that the way the architecture is, its easy to scale it down. On paper it sounds impressive and very powerful. We need to see it in action though. I am sure it will be fast but for me how fast I want to play my games? How much will this cost? I am still using 19 monitor in 1280 1024.

Nvidia must go this way as they will be out of business and I like that they try to innovate and create a lot of pressure for ATI, this benefits the consumer.

I oversees that ATI will come with a totally new architecture in their next Graphics Card as Nvidia made them a lot of damage with their 8800 series so they couldn't come this time with a hole new card.
dried Re: G300-Fermi: Nvidia focuses on GPU Computing wi...
Member
01.10.2009 16:03
OK. I fail to see the point of this.

Are they really targeting the gamer community or is it something more?

Performance wise, Fermi seems promising. Price? not so much.
A 5870 in Xfire can already make a processor be the bottleneck in a lot of situations. What will 2x Gtx (380?) do ?

Correct me if im wrong, but i believe that the real gaming market is at the price and level of the 5850. Will Nvidia have anything to compete against it (dx11)?

Maybe its too soon to say anything. Ill just wait for more details

Copyright © 2010 by Computec Media AG      About/Imprint  •  Terms/Conditions