Introduction SIGGRAPH 2002- State of the Art in Hardware Shading Course Notes RADEON 9700

格式：pdf
大小：1.61 MB
文档页数：39

下载文档原格式

/ 39

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

RADEON™ 9700 Shading

Jason L. Mitchell

JasonM@

3D Application Research Group Lead

ATI Research

Introduction

In this supplement to Chapter 3 in the bound course notes, we will outline the shading capabilities of the new RADEON™ 9700 graphics processor and illustrate its power with a series of examples. After a brief overview of vertex shading on the RADEON™ 9700, we will spend the bulk of the time on the fragment (aka pixel) shader programming model. For much of this chapter, we will use Direct3D® syntax due to its brevity and readability, but all of this functionality (as shown in the section on homomorphic factorization of BRDFs) is available through multivendor and ATI-developed OpenGL extensions.

RADEON™ 9700 Vertex Shaders

The vertex shading model in the RADEON™ 9700 has advanced beyond the programming model found in previous graphics processors such as the RADEON™ 8500 by adding constant-based flow control capabilities. The motivation for this extension to the earlier programming models is to cut down on the number of shaders that must be created and managed by application developers. In previous vertex shader programming models such as the one found in DirectX® 8.1, developers generally had to create and use large numbers of shaders from fragments of shader code. This was necessary to handle the large numbers of permutations of drawing state (i.e. with/without environment mapping, with/without skinning, different numbers of and types of lights etc). The vertex shading model found in the RADEON™ 9700 enables developers to create a much smaller set of domain-specific über shaders which can contain constant-based loops, subroutines and branches to manage the permutation problem.

Vertex Shader Assembly Language

Like the vertex shading model in the previous generation of graphics processors, the RADEON™ 9700 can read from a bank of 4D float vectors as well as the input vertex components. A bank of read/write temps is available for intermediate computations and a bank of write-only output registers define the output from the vertex shader. Specific register counts for the RADEON™ 9700 are listed below.

Type Name Count

Float Constant c[n] 256 read-only vectors

Temporary r n12 read/write vectors

Input v n16 read-only vectors

RADEON™ 9700 ALU Register Counts

In addition to the ALU-related inputs and temps above, the RADEON™ 9700 supports a set of constants and a counter to control the flow of execution as shown below.

Type Name Count

Integer Constant I n16 read-only vectors

Address A

read/write

vector

Loop Counter aL 1 scalar

Boolean Constant B 16 read-only bits

RADEON™ 9700 Flow Control Register Counts

These constants are set through API calls and persist until set again. In this way, an application can program the control flow by setting a shader once and updating a few flow-control constants as appropriate. A vertex shader on the RADEON™ 9700 can be up to 256 instructions long, though more instructions than this may be executed due to looping, subroutines etc. The ALU instruction set supports the usual functionality such as ADD, DP3, DP4, EXP, FRAC, LOG, MAD, MAX, MIN, MOV, MUL, RCP, RSQ, SGE (set greater than or equal to) and SLT (set less than). The control flow instructions include CALL, LOOP, ENDLOOP, JUMP, JNZ, LABEL, REPEAT, ENDREPEAT and RETURN.

Introduction SIGGRAPH 2002- State of the Art in Hardware Shading Course Notes RADEON 9700

合集下载

文档推荐

最新文档