URP管线下unity_LODFade的骚操作

玩 The Illustrated Nature，发现他的植物做了Lod，并且Lod的过渡并不是硬切的，有一个渐变，如下：

跟了一下代码，发现主要的逻辑就3行：

#ifdef LOD_FADE_CROSSFADE
	LODDitheringTransition( IN.clipPos.xyz, unity_LODFade.x );
#endif

上述代码里的 clipPos 即 齐次裁剪空间 的坐标，unity_LODFade 是Unity内置变量，unity_LODFade.x 表示是 当前LOD等级的权重，Unity文档对其有一个简单说明：

Unity doesn’t provide a default built-in technique to blend LOD geometries. You need to implement your own technique according to your game type and Asset production pipeline.

Unity calculates a blend factor from the GameObject’s screen size and passes it to your shader in the unity_LODFade.x uniform variable. Depending on the Fade Mode you choose, use either the LOD_FADE_PERCENTAGE or LOD_FADE_CROSSFADE keyword for GameObjects rendered with LOD fading.

再来看一下 LODDitheringTransition 函数。因为我使用的是 URP 管线，直接下载源码，看一下 LODDitheringTransition 到底做了什么。

// LOD dithering transition helper
// LOD0 must use this function with ditherFactor 1..0
// LOD1 must use this function with ditherFactor -1..0
// This is what is provided by unity_LODFade
void LODDitheringTransition(uint2 fadeMaskSeed, float ditherFactor)
{
    // Generate a spatially varying pattern.
    // Unfortunately, varying the pattern with time confuses the TAA, increasing the amount of noise.
    float p = GenerateHashedRandomFloat(fadeMaskSeed);

    // This preserves the symmetry s.t. if LOD 0 has f = x, LOD 1 has f = -x.
    float f = ditherFactor - CopySign(p, ditherFactor);
    clip(f);
}

LODDitheringTransition 函数先对 clipPos 做了一个hash操作生成一个浮点数，再把这个浮点数结合 unity_LODFade.x 做 clip 操作，丢弃一些像素以达到 平滑过渡 的效果。

下面我们就来看看 LODDitheringTransition 的实现细节。

实现细节

关于unity_LODFade.x的设置

关于LOD，如果作上面的 平滑过渡 效果，那么在过渡的阶段，存在 2个LOD等级共存 的情况，这里有一定的性能问题，并且 clip 操作在很多平台是很昂贵的。

我们姑且不考虑性能问题，LODDitheringTransition 的代码注释写的比Unity文档清楚的地方在于，他告诉了我们 unity_LODFade.x 到底是怎么设值的。

LOD0 must use this function with ditherFactor 1..0
LOD1 must use this function with ditherFactor -1..0
This is what is provided by unity_LODFade

在 平滑过渡 的过程中，LOD0 从 1过渡到0，LOD1 从 -1过渡到0，裁剪计算会依据这个权重的正负做不同的处理。

关于GenerateHashedRandomFloat

GenerateHashedRandomFloat 针对不同维度的输入，实现代码如下：

float GenerateHashedRandomFloat(uint x)
{
    return ConstructFloat(JenkinsHash(x));
}

float GenerateHashedRandomFloat(uint2 v)
{
    return ConstructFloat(JenkinsHash(v));
}

float GenerateHashedRandomFloat(uint3 v)
{
    return ConstructFloat(JenkinsHash(v));
}

float GenerateHashedRandomFloat(uint4 v)
{
    return ConstructFloat(JenkinsHash(v));
}

可以看到它主要包括2步操作： JenkinsHash 和 ConstructFloat。

JenkinsHash 首先对 clipPos 做一个简化的 Jenkins hash，考虑1维的情况，算法如下：

// A single iteration of Bob Jenkins' One-At-A-Time hashing algorithm.
uint JenkinsHash(uint x)
{
    x += (x << 10u);
    x ^= (x >>  6u);
    x += (x <<  3u);
    x ^= (x >> 11u);
    x += (x << 15u);
    return x;
}

Hash完毕后，ConstructFloat 函数利用Hash的结果再生成一个 [0,1) 区间的浮点数，用于最后的 clip 操作。

// Construct a float with half-open range [0, 1) using low 23 bits.
// All zeros yields 0, all ones yields the next smallest representable value below 1.
float ConstructFloat(int m) {
    const int ieeeMantissa = 0x007FFFFF; // Binary FP32 mantissa bitmask
    const int ieeeOne      = 0x3F800000; // 1.0 in FP32 IEEE

    m &= ieeeMantissa;                   // Keep only mantissa bits (fractional part)
    m |= ieeeOne;                        // Add fractional part to 1.0

    float  f = asfloat(m);               // Range [1, 2)
    return f - 1;                        // Range [0, 1)
}

这里是一个 骚操作，考虑一下 单精度浮点数的内存布局

ConstructFloat 风骚的利用Hash结果的 低23位 做尾数，和 0x3F800000(浮点数1.0的int表示) 做 或操作，再把得到的结果 从int转为float，就得到了 [1,2) 区间的浮点数，再减去1.0f，就得到了[0,1) 区间的浮点数结果。

关于CopySign

拿到 GenerateHashedRandomFloat 的计算结果后，最后就是根据 unity_LODFade.x 做裁剪操作。前面提到 unity_LODFade.x 的取值 有正有负，所以这里要对正负做不同的处理。

这里最主要的就是 CopySign 这个操作。

// Composes a floating point value with the magnitude of 'x' and the sign of 's'.
// See the comment about FastSign() below.
float CopySign(float x, float s, bool ignoreNegZero = true)
{
#if !defined(SHADER_API_GLES)
    if (ignoreNegZero)
    {
        return (s >= 0) ? abs(x) : -abs(x);
    }
    else
    {
        uint negZero = 0x80000000u;
        uint signBit = negZero & asuint(s);
        return asfloat(BitFieldInsert(negZero, signBit, asuint(x)));
    }
#else
    return (s >= 0) ? abs(x) : -abs(x);
#endif
}

CopySign 就是让 GenerateHashedRandomFloat 结果的符号和 unity_LODFade.x 的符号一样。

这样无论 unity_LODFade.x 是 [1,0] 还是 [-1,0]，我们都可以得到正确的裁剪结果。

关于标准管线

最后，LODDitheringTransition 是 SRP 特有的函数。

回到Unity的 标准管线，也有一个类似的操作： UNITY_APPLY_DITHER_CROSSFADE，这个留待下文介绍。

URP管线下unity_LODFade的骚操作