AnandTech Article Channel

Rosewill Blackhawk Ultra Case Review: Were It Not For Competition
HandBrake to Get QuickSync Support
Intel's PixelSync & InstantAccess: Two New DirectX Extensions for Haswell
FCAT: The Evolution of Frame Interval Benchmarking, Part 1

Rosewill Blackhawk Ultra Case Review: Were It Not For Competition

Posted: 27 Mar 2013 09:01 PM PDT

We've long maintained that Rosewill's Thor v2 is one of the best deals floating around for enthusiasts. In that enclosure, Rosewill has a product that's fairly feature rich, quiet, and offers stellar performance. Yet the Thor v2 isn't the flagship of their enclosure line, but today we have that flagship in house. Given its predecessor's stellar performance, expectations are pretty high for the Blackhawk Ultra.

HandBrake to Get QuickSync Support

Posted: 27 Mar 2013 06:01 PM PDT

The latest version of Intel's Media SDK open sourced a key component of the QuickSync pipeline that would allow the open source community to begin to integrate QuickSync into their applications (if you're not familiar with QS, it's Intel's hardware accelerated video transcode engine included in most modern Core processors). I mentioned this open source victory back at CES this year, and today the HandBrake team is officially announcing support for QuickSync.

The support has been in testing for a while, but the HandBrake folks say that they expect to get comparable speedups to other QuickSync enabled applications.

No word on exactly when we'll see an official build of HandBrake with QuickSync support, although I fully expect Intel to want to have something neat to showcase QuickSync performance on Haswell in June. I should add that this won't apply to OS X versions of HandBrake unfortunately, enabling that will require some assistance from Apple and Intel - there's no Media SDK available for OS X at this point, and I don't know that OS X exposes the necessary hooks to get access to QuickSync.

Intel's PixelSync & InstantAccess: Two New DirectX Extensions for Haswell

Posted: 27 Mar 2013 06:00 PM PDT

As Intel continues its march towards performance relevancy in the graphics space with Haswell, it should come as no surprise that we're hearing more GPU related announcements from the company. At this year's Game Developer Conference, Intel introduced two new DirectX extensions that will be supported on Haswell and its integrated GPU. The first extension is called PixelSync (yep, Intel branding appears alive and well even on the GPU side of the company - one of these days the HD moniker will be dropped and Core will get a brother). PixelSync is Intel's hardware/software solution to enable Order Independent Transparency (OIT) in games. The premise behind OIT is the quick sorting of transparent elements so they are rendered in the right order, enabling some pretty neat effects in games if used properly. Without OIT, game designers are limited in what sort of scenes they can craft. It's a tough problem to solve, but one that has big impacts on game design.

Although OIT is commonly associated with DirectX 11, it's not a feature of the API but rather something that's enabled using the API. Intel's claim is that current implementations of OIT require unbounded amounts of memory and memory bandwidth (bandwidth requirements can scale quadratically with shader complexity). Given that Haswell (and other integrated graphics solutions) will be more limited on memory and memory bandwidth than the highest end discrete GPUs, it makes sense that Intel is motivated to find a smaller footprint and more bandwidth efficient way to implement OIT.

The hardware side of PixelSync is simply the enabling of programmable blend operations on Haswell. On PC GPU architectures, all frame buffer operations flow through fixed function hardware with limited flexibility. Interestingly enough, this is one area where the mobile GPUs have moved ahead of the desktop world - NVIDIA's Tegra GPUs reuse programmable pixel shader ALUs for frame buffer ops. The Haswell implementation isn't so severe. There are still fixed function ROPs, but the Haswell GPU core now includes hardware that locks and forces the serialization of memory accesses when triggered by the PixelSync extension. With 3D rendering being an embarrassingly parallel problem, having many shader instructions working on overlapping pixel areas can create issues when running things like OIT algorithms. What PixelSync does is allows the software to tell the hardware that for a particular segment of code, that any shaders running on the same pixel(s) need to be serialized rather than run in parallel. The serialization is limited to directly overlapping pixels, so performance should remain untouched for the rest of the code. This seemingly simple change goes a long way to enabling techniques like OIT, as well as giving developers the option of creating their own frame buffer operations.

The software side of PixelSync is an evolved version of Intel's own Order Independent Transparency algorithm that leverages high quality compression to reduce memory footprint and deliver predictable performance. Intel has talked a bit about an earlier version of this algorithm here for those interested.

Intel claims that two developers have already announced support for PixelSync, with Codemasters producer Clive Moody (GRID 2) appearing in the Intel press release excited about the new extension. Creative Assembly also made an appearance in the PR, claiming the extensions will be used in Total War: Rome II.

The second extension, InstantAccess is simply Intel's implementation of zero copy. Although Intel's processor graphics have supported unified memory for a while (CPU + GPU share the same physical memory), the two processors don't get direct access to each others memory space. Instead, if the GPU needs to work on something the CPU has in memory, it needs to make its own copy of it first. The copy process is time consuming and wasteful. As we march towards true heterogeneous computing, we need ways of allowing both processors to work on the same data in memory. With InstantAccess, Intel's graphics driver can deliver a pointer to a location in GPU memory that the CPU can then access directly. The CPU can work on that GPU address without a copy and then release it back to the GPU. AMD introduced support for something similar back with Llano.

FCAT: The Evolution of Frame Interval Benchmarking, Part 1

Posted: 27 Mar 2013 06:00 AM PDT

In the last year stuttering, micro-stuttering, and frame interval benchmarking have become a very big deal in the world of GPUs, and for good reason. Through the hard work of the Tech Report’s Scott Wasson and others, significant stuttering issues were uncovered involving AMD’s video cards.

At the same time however, as AMD has fixed those issues, both AMD and NVIDIA have laid out their concerns over existing testing methodologies, and why these may not be the best methodologies. To that end, today NVIDIA will be introducing a new too evaluating frame latencies: FCAT. As we will see, the Frame Capture Analysis Tool stands to change frame interval benchmarking as we know it, and as we will see it will be for the better.

Auto Car Full Design

Thursday, March 28, 2013

AnandTech Article Channel

AnandTech Article Channel

No comments:

Post a Comment

Popular Posts

Blog Archive

Followers