There is an ungreen revolution taking place in enthusiast game PC circles.
A 1,250-watt power supply--this one from Cooler Master--is the largest a game PC maker will install today.
(Credit: Cooler Master)The eye-opening graphics possible on today's game PCs come at a cost: light-dimming power consumption. The trend, rooted in the perennial quest for more speed, bucks the overall greening of the PC industry.
Green PC designs have become more than just practical; they're cool. Power-sipping Netbooks are in, as are small desktops like the Dell Studio Hybrid and Hewlett-Packard Pavilion Slimline.
This is not the case for high-end gaming PCs, where bigger is better. How far this trend can go isn't clear, but a seminal event in Apple's history may offer a lesson. In 2001, Apple unveiled one of the first dual-processor consumer systems, based on the overheating-prone IBM PowerPC G4 processor. The original Apple tower design had a Rube Goldberg feel to it, with a host of fans straining to rid the system of heat. A noise like that emitted by a wind tunnel, generated by the power supply and fans, forced Apple to redesign the system.
This symbolized why Apple eventually abandoned PowerPC: The platform wasn't efficient with power.
Fast-forward to 2008. Game rig makers are cramming as many as four graphics chips into high-end boxes that are notable not only for performance but also for the power they consume. As a consequence, big power supply units are in vogue. Today, bragging rights extend to the units themselves: some systems boasting boutique brand names such as Cooler Master and SilverStone draw 1,200 watts--roughly three times the power requirements of game systems a few years ago.
It's an ominous trend, according to box makers. "If this trend does continue, then, yes, it will give us problems," said George Yang, an engineer at Los Angeles-based game rig maker IBuyPower. "A regular home user would have to have an electrician come in, get the outlet out, and plug in a higher breaker," Yang said. Today, some of the higher-end systems with big power supplies require a special wall power socket, according to Yang.
Other game rig makers are equally concerned. "I swore that I'd never break 1,000 (watts)," said Kelt Reeves, president of game PC maker Falcon Northwest. "Unfortunately, that's been the solution for the past several years. Bigger, bigger, bigger power supplies."
Reeves says that 1,200 watts is now essential for gaming systems based on multiple boards from Nvidia or AMD's ATI graphics unit. "With three GTX 280s or two of the R700 cards, we're recommending they go with a 1,200-watt power supply," Reeves said, referring to the newest graphics chips from Nvidia and ATI respectively.
This is just about the limit, he said. "We can't go too much more over that before--if you actually pull that (power)--you start tripping the client's household circuit breaker."
Neither Nvidia nor ATI show any signs of slowing down, according to Reeves. "Eventually these chips get so hot that their own heat becomes a barrier to performance," he said.
Nvidia admits that its chips are drawing more power than before. "If we go back about three years, our graphics card power was in the 120- to 130-watt range," said Jason Paul, product manager in charge of enthusiast GPUs (graphics processing units) at Nvidia. "The GTX 280 which we launched a couple of months back, it's around 230 watts (of) graphics card power," he said.
But Paul claims the performance per watt is the key yardstick, not raw power. "Where you see a little under 2X increase in maximum power, you've seen probably 3-times or 4-times (the) increase in the level of performance. So, overall we see a substantial improvement in performance per watt. This is the big metric we track to ensure we're delivering efficient architectures. "
Paul says Nvidia has implemented power savings techniques on its GTX 280 that keep the power down when it's not running at top performance loads. "With the GTX 280 at idle, that card runs at about 25 watts, which is one-tenth of its absolute worst-case power," he said. Nvidia also offers hybrid graphics technology that turns off all the power-sucking boards when they're not in use.
Dell XPS 730 game box uses special liquid cooling to control heat.
(Credit: Dell Computer)Moreover, Paul says that the multiboard systems are limited to a small niche at the very top of the market. "There's definitely a segment of the market that wants more and more performance. Remember, however, that this is the ultimate performance (segment)."
But game box makers ship many--if not most--of their systems to the very niche that Paul is describing. "We're all about the high end. The higher-end the graphics card is, and the more expensive, the more we sell," said Reeves.
And the trend in power supplies exemplifies how this market has changed. "The power supply used to be just silver box, and nobody gave it a second thought," he said. "(But) as graphics cards have evolved, they have forced the power supply makers to keep providing more and more power pipes--or cabling--to the graphics cards"--increasing the unit's complexity, he said.
Reeves cites GPUs, not CPUs from Intel, as the culprit. "The latest CPUs use very little wattage. If you overclock a 3GHz Intel CPU to 4GHz, you might pull 40 more watts. Whereas a graphics card, you put three of them in a system, they'll pull 800 watts running some of the higher-end games," he said.
Update at July 15, 3:00 a.m. PDT with additional information and corrections concerning the Intel-Nvidia dispute.
Nvidia said Monday that its multichip technology will be architected to work on Intel's upcoming Nehalem chip platform.
Nvidia SLI technology supports multiple graphics boards.
(Credit: Nvidia)This announcement may help Nvidia to work around a standoff with Intel over whether Nvidia can make chipsets that work with Intel's next-generation Nehalem platform, due later this year. And also demonstrates that despite Nvidia CEO Jen-Hsun Huang' s rhetoric, Nvidia must cooperate with Intel in order to thrive.
Monday's announcement has no relation to separate licensing negotiations, according to sources familiar with the discussions. In other words, Nvidia is not announcing a chipset for Nehalem--which would require a license. It is simply a statement that Nvidia will support Nehalem with its nForce 200 Scalable Link Interconnect (SLI) chip.
However, some reports say Nvidia has reached an agreement with Intel to license Intel's Quick Path Interconnect (QPI) technology, paving the way for Nvidia to design chipsets for Nehalem.
The nForce 200 chip will work with Intel's "Bloomfield" line of Nehalem processors and the accompanying Intel chipset. SLI allows Nvidia to use multiple graphics boards in one system.
Upcoming SLI motherboards will use Nvidia nForce 200 SLI silicon, Intel Bloomfield processors, and Intel Tylersburg (X58) chipsets, Nvidia said in a statement.
"The nForce 200 SLI processor features patented SLI technology for graphics bandwidth management and multi-GPU peer-to-peer communications, both required to optimize graphics performance," Nvidia said. GPU stands for graphics processing unit.
Future systems "can be powered by one, two, or even three Nvidia GeForce GPUs, including the new...GeForce GTX 280 and GTX 260 GPUs," according to Nvidia.
Nvidia included statements from system suppliers in the Monday release. "It's great to see that Nvidia opted to enable SLI on the future Intel Bloomfield platform," said Rahul Sood, CTO Voodoo Business Unit, HP. "Make love not war I say...and Nvidia's (enabling) of Intel chipsets to support SLI will make our jobs much easier."
Nvidia claims that nForce 200 SLI silicon with Intel's new Bloomfield processor and Tylersburg chipset core logic chipset will deliver up to a 2.8X performance boost over traditional single graphics card platforms.
Motherboards and PC systems that will use the Nvidia nForce 200 SLI chip, Nvidia GeForce GPUs, and Nvidia SLI technology will be available from companies such as Acer, ASUS, Dell, Falcon Northwest, Legend, and Velocity Micro.
Nvidia has slashed the price of products with its newest GTX 260 and 280 graphics processors only a few weeks after it launched the chips, in response to stiffer competition from Advanced Micro Devices' ATI unit.
Nvidia said Sunday night that the GeForce GTX 280 is now available for $499 and the GTX 260 for $299. The high-end GTX 280 was originally $649, while the 260 was priced previously at $399. Both products were rolled out less than a month ago.
Nvidia's graphics boards are now more in line with ATI's newest offerings. At $299, the GTX 260 price now matches that of ATI's comparable HD 4870.
There's more to come from ATI too. Later this quarter, ATI is expected to launch the 4870 X2, which combines two chips on one board. This will be ATI's high-end offering for the enthusiast gaming market. The lower $499 price for Nvidia's high-end GTX 280 should bring it close to 4870 X2 pricing.
ATI appears to be faring well in this round of graphics chip competition, putting more pricing pressure than usual on Nvidia. Not only are its individual chips more competitive than previous generations, but its strategy of building smaller, lower-cost chips is paying off. Instead of building one large, expensive graphics processor as Nvidia does, ATI is building less power-hungry chips for the mid-range market, then ganging them together to boost performance for the high-end enthusiast market.
Advanced Micro Devices' bet on a new approach to graphics chip design appears to be paying off, according to analyst Jon Peddie. This could put AMD's ATI graphics chip unit on top again--or at least on equal footing with Nvidia, the graphics leader over the last few years.
Peddie heads Tiburon, Calif.-based Jon Peddie Research, which specializes in graphics chip market research.
Test reports on AMD's and Nvidia's newest graphics chips are pouring in. Both companies are racking up good scores. (See Diamond Viper Radeon HD 4850 review here.)
But beyond the day-to-day test scores, AMD's ATI graphics chip unit may be winning the longer strategic battle. ATI has gambled on a radically different strategy for its latest series of chips--the HD 4850, HD 4870, and upcoming dual-chip R700.
"(AMD) is starting in the middle of the market and scaling up. That's a break with tradition," said Peddie. "We always started at the very, very tippy-top and build the most powerful thing you could and then let it scale down over time."
But Peddie said this traditional approach just isn't practical anymore. "The chipsets keep getting larger and larger despite the fact that we were going to smaller and smaller (manufacturing) process nodes. The chips grew faster than the process nodes shrunk and the consequences of that is that the power consumption went up, the costs went up, and it got to the point where it's kind of impractical to continue along that way," he said.
In essence, AMD's ATI unit strategy is to build smaller, less power-hungry chips and then gang them up to get better performance. Nvidia's strategy has been to build one large, extremely fast--and extremely power hungry--chip.
For ATI, the execution of this chip-ganging strategy is the key. And this is where ATI appears to have been successful. "The inter-processor communications. Getting that to work has been the trick. This is what ATI has done. They've come up with this stellar way of doing inter-processor communications so they can in fact get the scaling," according to Peddie.
And there's more than meets the eye. ATI has also cut in half the number of bits in the memory interface, Peddie said: down to 256 bits while Nvidia has remained at 512. "That has the benefit to ATI of reducing a big hunk of the power consumption."
Peddie said in the past this kind of approach would have been suicidal because it would have decimated ATI's test scores. "The argument against this is that graphics performance is a function of memory," he said. "Typically you want wider and wider (bit width)."
But ATI has countered this by using the fastest memory standard available. "So to compensate for shrinking down the bit width, ATI has jumped to the next-generation in memory design called GDDR5. GDDR5 is approximately three times faster than GDDR3--which is what Nvidia is still using and what ATI uses on their smaller cards," according to Peddie.
"So with three times the speed but half the width, they end up with 1.5 times the processing capability with the memory."
"A very clever thing that they did but mind you it was a gamble that looks like it's going to pay off," he said.
ATI has more processing units than Nvidia inside its chip too. "The other thing is that ATI has 800 processors in their chip and Nvidia has 240. That has a processor count advantage," Peddie said.
Though it remains to be seen if this advantage is borne out in testing over time, he added. "Nvidia and ATI keep improving their drivers so they'll seesaw back and forth with their scores, almost from week to week."
But in the long run, Nvidia may be forced to adopt ATI's strategy to keep pace in these week-to-week battles. "If ATI is successful, as we expect that to be, then Nvidia will have no choice but to adopt (ATI's) approach, just out of practicality," Peddie said. "It just makes a whole lot of sense."
AMD-ATI's upcoming R700 (rumored to be called the 4870 X2) two-chip graphics board will be the ultimate test of this strategy.
"It's a new proprietary inter-processor communication technology. If they put these two chips on one board and it does scale properly, then they have pulled off a coup," he said.
"When you gang up graphics chips (using the traditional Scalable Link Interface or CrossFire technologies) they roll off pretty fast. ("Roll off" implies that performance doesn't scale up well.) "So when you put two boards in, you don't get twice the performance but you (only) get one and a half. You put four boards in and you (only) get about 1.7, 1.8. What ATI is saying is that with two chips using (their) proprietary inter-bus, they will get 1.8 (the performance) with two chips. If that's true, you can expect to see four of them giving you something around 2.5."
Getting 2.5 times the performance from four boards would be a masterstroke for ATI.
The previous ATI dual-chip solution was very different, Peddie said. "The HD 3870 X2 was not a proprietary bus but a CrossFire connection. The CrossFire connection and the SLI connection are at the very, very end of the pipeline. Not the most efficient place to do an inter-processor communication. That's one of the reasons ATI has abandoned it."
AMD's ATI unit is also better positioned than it was before in manufacturing. "Part of the reason that Nvidia has been ahead is that ATI has been suffering over the last three or four years with manufacturing problems. It's not that ATI didn't have a good chip, the problem was that ATI couldn't build enough of them."
This should change with the newest series of chips. "This (design)--so they say--will really go into high-volume production." Though he cautioned this still remains to be seen.
"The (new ATI chip) is a really efficient, tight design. They used to do this all the time but they kind of got off that trail. And now they're back on it."
Nvidia has posted a video that involves the new GTX 280 chip, overclocking, lots of liquid nitrogen, and the Nvidia labs. Oh, and no quad-core processor. Get the point?
Nvidia NForce 790i Ultra SLI motherboard
(Credit: Nvidia)"A lot of people believe you need an Intel quad-core or Intel quad-core Extreme to build an extreme PC," says the post by "Steffee" on the Nvidia Web site. "Today I'm going to build a gaming PC using the Intel Core 2 Duo. That's duo. Got that? Duo, two cores."
I think the point the blogger is trying to make is that the test system has only has two cores, though I could be mistaken.
Here's an excerpt from the blog: "Think you need a quad-core CPU for an extreme gaming PC with impressive 3DMark Vantage numbers and gaming performance?...I overclocked and hyper-cooled an SLI gaming rig using two of our latest and greatest GPUs--GeForce GTX 280...and a sub-$200 CPU."
Intel, of course, has a different take on this. "Most of what people do today with their computers requires powerful processors. Examples of processor-intensive applications include: creating content, viewing/editing high definition video, using social media sites, office tools, downloading music, and editing photos," Intel said in a statement.
In the test, the blogger uses an NForce 790i Ultra SLI motherboard, Intel core 2 duo E8400, in addition to the GTX 280 graphics card.
With the single GTX 280 board, the score is a 3DMark Vantage "X4796."
Then the blogger (an Nvidia employee) takes a hike to the Nvidia lab. "Now we're going to take a field trip to Nvidia labs to do some serious overclocking," she says. After adding what seems to be prodigious amounts of liquid nitrogen and adding another GTX 280 SLI board, the 3DMark score jumps to "X10,282."
Intel could respond (which it hasn't) by saying that the E8400 is not a slow processor: it runs at 3.0GHZ and has 6MB of cache. And overclocked with liquid nitrogen, it would probably get some pretty good scores too. And then, of course, it might be simpler to just get a quad-core Extreme CPU.
In the test, the Nvida GTX 280 core clock was overclocked to 727MHz and the shader to 1458MHz. The core clock is normally 602MHz and the shader clock 1296MHz.
System specifications:
--2× NVIDIA GeForce GTX 280 graphics cards running in SLI nForce 790i Ultra SLI motherboard
--Intel Core 2 Duo E8400 CPU
--4 GB SLI-ready Corsair DDR3 memory
--PC Power and Cooling TurboCool 1200W power supply
--Windows Vista 32-bit operating system
UPDATE: On Monday, Advanced Micro Devices and Nvidia are launching graphics chips based on distinctly different design philosophies.
Nvidia chips are big and powerful
(Credit: Nvidia)Nvidia's GTX 280 and GTX 260 are designed to deliver the biggest performance bang per chip. A so-called "monolithic" approach packs 1.4 billion transistors and 240 processing cores onto one piece of silicon.
(See Peter Glaskowsky's review of the GTX 280.)
AMD's modular approach tends toward less is more: smaller, less power-hungry chips that can be strung together to achieve higher performance. The company plans to implement this strategy with the HD 4850 and HD 4870 graphics processing units (GPUs) that are being introduced on Monday. (See "Notes" below.)
This design philosophy is based on the same multicore strategy now employed by AMD (and Intel) for the CPU, the main PC processor. Instead of making one chip bigger and faster, AMD boosts performance by stringing together multiple chips. This approach also allows a chipmaker to avoid the time and expense of designing separate processors for the midrange and very-high-end segments.
(Clarification: In the case of a CPU, a core refers to a "scalar" x86 processor core found, for example, in a quad-core Intel or AMD chip. The GPU core is a "parallel processing core." GPUs these days typically contain hundreds of these cores.)
"The beauty of this design is that it's scalable. You can put one or two (chips) on a board," said Matt Skynner, vice president of marketing at AMD's Graphics Products Group.
In the midrange segment ($200 to $300) AMD uses a single chip--for example, an HD 4870. At the high-end ($500 and above), it adds another chip to scale up to better performance. This dual-chip design--code-named the R700--will be marketed as the 4870 X2.
On the other hand, Nvidia says its emphasis on a single, very-high-performance chip is necessary to keep it out front. "At the high end, there is no prize for second place," Ujesh Desai, general manager for GeForce products at Nvidia, said in an interview with Nanotech: The Circuits Blog last month.
(Nvidia also offers multichip designs, but it puts each chip on a separate board, while AMD puts two chips on one board.)
And Nvidia is trying to raise the bar with GTX 280. "We're rendering about 3 million triangles per frame," Curtis Beason, an engineer at Nvidia, said last month at an event where Nvidia previewed the GTX 280 chip.
AMD targets smaller chips that can be strung together to get better performance. AMD chip on left, Nvidia chip on right.
(Credit: AMD)"With (the previous-generation) GeForce 8800, what we achieved is a very photorealistic character. Very detailed skin. But it was a single character," Jason Paul, the GeForce product manager, said at the Nvidia event last month. "With GTX 200 what we're moving to is multiple highly realistic characters."
Nvidia is also boasting that a dual-core GTX 280 can convert a high-definition movie into iPod video format in 35 minutes, compared to about five hours for a quad-core CPU system with low-end integrated graphics.
Hewlett-Packard's Voodoo unit will be one of the first to adopt the new Nvidia GPUs. "We are excited to be one of the first companies in the world to offer the technology in the new Exhilaration Edition of the...HP Blackbird 002," said Rahul Sood, chief technology officer, HP Voodoo Business Unit.
Notes: updated 6/16, 12:10 PM:
--AMD 4800 series processors will be available starting next week at Besy Buy, according to AMD VP Rick Bergman, speaking Monday at an AMD event. "In just a little over a week from today. You'll be able to walk into a Best Buy and buy this chip (4800 series) on a graphics board for about $200. A teraflop for $200," Bergman said. He added that systems will also be available from Falcon Northwest, Velocity Micro, and ibuypower. "We're also introducing a system that can take four of these boards," he said. "That's almost five teraflops of performance in a personal computer."
--Both AMD and Nvidia say their GPUs can achieve about one teraflops (trillion floating point operations per second) of performance.
--Nvidia's GeForce GTX 280 will retail for $649 and be available on graphics boards starting Tuesday. The GeForce GTX 260 will be priced at $399, with availability slated for June 26.
Advanced Micro Devices' ATI graphics chip unit doesn't want to build "huge" chips like rival Nvidia, an executive says.
But an Nvidia exec says smaller isn't always better or more efficient.
Such statements will help define how the two chip giants do battle at the high end of the graphics chip market in the coming years.
One of the largest graphics chips yet will be Nvidia's upcoming high-end GTX 280. This is the kind of chip that high-end gaming enthusiasts crave. But great performance often means a large transistor count. And the GTX 280 is expected to have both.
Here is an Asus board using AMD-ATI 3870 X2 that will be superseded by the new X2 board.
(Credit: Asus)AMD, of course, also intends to deliver extreme graphics technology with its upcoming X2, a follow-on to the current 3870 X2 series. And AMD wants to be clear: its strategy is fundamentally different than Nvidia's.
"We took two chips and put it on one board (X2). By doing that we have a smaller chip that is much more power efficient," said Matt Skynner, vice president of marketing for the graphics products group at AMD.
"We believe this is a much stronger strategy than going for a huge, monolithic chip that is very expensive and eats a lot of power and really can only be used for a small portion of the market," he said. "Scaling that large chip down into the performance segment doesn't make sense--because of the power and because of the size."
Skynner said that AMD tries to design GPUs (graphics processing units) for the mainstream segment of the market, then ratchet up performance by adding GPUs rather than designing one large, very-high-performance chip.
Nvidia's "strategy is to design for the highest performance at all cost. And we believe designing for the sweet spot and then leveraging for the extreme enthusiast market with multiple GPUs is the preferred approach," Skynner said.
This applies to memory too. AMD thinks support for technologies like GDDR5 memory is another way to deliver good performance at a reasonable cost. "You don't need a huge chip with a huge data path to get the bandwidth. You can utilize a technology like GDDR5 to get that bandwidth," Skynner said.
Nvidia tends to favor very-fast, single-chip solutions.
Nvidia, of course, has a different take on why it chooses to develop big, fast chips.
"If you take two chips and put them together, you then have to add a bridge chip that allows the two chips to talk to each other...And you can't gang the memory together," said Ujesh Desai, general manager for GeForce products at Nvidia.
"So when you add it all up, you now have the power of two GPUs, the power of the bridge chip, and the power that all of that additional memory consumes. That's why it's too simplistic of an argument to say that two smaller chips is always more efficient."
Desai takes this argument a bit further. "They don't have the money to invest in high-end GPUs anymore. At the high end, there is no prize for second place. If you're going to invest a half-billion dollars--which is what it takes to develop a new enthusiast-level GPU--you have to know you're going to win. You either do it to win, or you don't invest the money."
(Note: Nvidia does offer GeForce 9800 GX2 technology but the GX2 uses a dual-board design--two 9800 chips, one on each board--rather than putting two chips on a single board as with AMD's Radeon HD 3870 X2.)
- prev
- 1
- next




