rygorous · December 15, 2015 05:39 · darksylinc · Mar 21, 2013 · darksylinc · Mar 21, 2013
diff --git a/gistfile1.txt b/gistfile1.txt
 Weird rendering problem:

 We need to render a 3D object such that the z values getting passed on to depth test/write for all pixels
 are all exactly the same value (constant per batch), and we need to be able to choose that value freely.

 This is what we'd like to do, but it doesn't work:

  // at the end of the VS
  out.pos.z = ourZValue * out.pos.w;

 Because of round-off error, this is only *approximately* the same value at all vertices, not exactly the
 same like we need.

 Here's the ways we've come up with to solve the problem:

 1. Do the perspective divide in the vertex shader

  // at the end of the VS
  float oneOverW = 1.0f / out.pos.w;
  out.pos.xy *= oneOverW;
  out.pos.z = ourZValue;
  our.pos.w = 1.0f;

 With this, we can exactly control the depth value that gets written, but we lose perspective
 correction for interpolated quantities. We could pass multiply all attributes by oneOverW, pass
 oneOverW as extra attribute, and then do the perspective interpolation ourselves in the pixel shader,
 so now we need every pixel shader to be specialized for this, and we do manual perspective correction.

 Ugh.

 2. Pass ourZValue to the pixel shader (as constant / attribute), write it to oDepth.

 This is reasonably straightforward, but it involves writes to oDepth, and again having variants of the
 pixel shaders that do this. This is less "ugh" in terms of amount of code but still requires having
 basically 2x the pixel shaders and lots of ugly code paths.

 3. Massive depth bias abuse.

 We set ourZValue = 0 - this always ends up exact. Then, we set the actual Z value we want as a depth
 bias. This is nice in that it involves absolutely no modifications to any of the shaders, it's just a
 weird projection matrix we send to the VS with a z=0 row. It should also work fine with most rendering
 APIs we support.

 The problem is that on D3D10+, the depth bias is part of the rasterizer state, and in our case it
 changes per batch. So we'd probably end up creating (and destroying) a bunch of rasterizer state objects
 per frame. This is fairly iffy.

 4. Massive depth range / viewport abuse

 Set a depth range that has both the min and max end at ourZValue. Now, no matter what the VS outputs, we
 get ourZValue back, or at least should in theory!

 But now we're calling glDepthRange (GL) or *SetViewport (D3D) for all affected batches. There's no reason
 this cannot be fast - but it's extremely weird so I also wouldn't be surprised if it's a slow path
 regardless.

 5. ???

 If you have other ideas, please ping me: @rygorous on Twitter!
	Weird rendering problem:

	We need to render a 3D object such that the z values getting passed on to depth test/write for all pixels
	are all exactly the same value (constant per batch), and we need to be able to choose that value freely.

	This is what we'd like to do, but it doesn't work:

	// at the end of the VS
	out.pos.z = ourZValue * out.pos.w;

	Because of round-off error, this is only approximately the same value at all vertices, not exactly the
	same like we need.

	Here's the ways we've come up with to solve the problem:

	1. Do the perspective divide in the vertex shader

	// at the end of the VS
	float oneOverW = 1.0f / out.pos.w;
	out.pos.xy *= oneOverW;
	out.pos.z = ourZValue;
	our.pos.w = 1.0f;

	With this, we can exactly control the depth value that gets written, but we lose perspective
	correction for interpolated quantities. We could pass multiply all attributes by oneOverW, pass
	oneOverW as extra attribute, and then do the perspective interpolation ourselves in the pixel shader,
	so now we need every pixel shader to be specialized for this, and we do manual perspective correction.

	Ugh.

	2. Pass ourZValue to the pixel shader (as constant / attribute), write it to oDepth.

	This is reasonably straightforward, but it involves writes to oDepth, and again having variants of the
	pixel shaders that do this. This is less "ugh" in terms of amount of code but still requires having
	basically 2x the pixel shaders and lots of ugly code paths.

	3. Massive depth bias abuse.

	We set ourZValue = 0 - this always ends up exact. Then, we set the actual Z value we want as a depth
	bias. This is nice in that it involves absolutely no modifications to any of the shaders, it's just a
	weird projection matrix we send to the VS with a z=0 row. It should also work fine with most rendering
	APIs we support.

	The problem is that on D3D10+, the depth bias is part of the rasterizer state, and in our case it
	changes per batch. So we'd probably end up creating (and destroying) a bunch of rasterizer state objects
	per frame. This is fairly iffy.

	4. Massive depth range / viewport abuse

	Set a depth range that has both the min and max end at ourZValue. Now, no matter what the VS outputs, we
	get ourZValue back, or at least should in theory!

	But now we're calling glDepthRange (GL) or *SetViewport (D3D) for all affected batches. There's no reason
	this cannot be fast - but it's extremely weird so I also wouldn't be surprised if it's a slow path
	regardless.

	5. ???

	If you have other ideas, please ping me: @rygorous on Twitter!