WebGL Clustered Deferred and Forward+ Shading

Course project #5 for CIS 565: GPU Programming and Architecture, University of Pennsylvania

(TODO) YOUR NAME HERE
Tested on: Google Chrome 62.0.3202.62 on:
- Mac OSX 10.10.5
- Processor: 2.5 GHz Intel Core i7
- Memory: 16 GB 1600 MHz DDR3
- Graphics: Intel Iris Pro 1536 MB

Project Overview

The goal of this project was to get an introduction to Clustered Deferred and Clustered Forward+ Shading in WebGL.

Live Online

Demo Video/GIF

Forward+ (100 lights)

Deferred (250 lights, with Blinn-Phong shading and gamma correction)

Features and Optimizations

Clustered Forward+ shading
Clustered Deferred shading with g-buffers
Blinn-Phong shading (diffuse + specular) for point lights
Gamma Correction
Optimized g-buffer format (by reducing the number and size of g-buffers)
- Packing values together into vec4s
- Using 2-component normals

Algorithm Descriptions

Forward Rendering

Forward rendering works by rasterizing each geometric object in the scene. For each light in the scene, each object is shaded according to their material/light-type, which means there is one shader per material/light-type. This means that every geometric object has to consider every light in the scene.

One optimization is to remove geometric objects that are occluded or do not appear in the view frustum of the camera. This can also be applied to lights as well. You can perform frustum culling on the light volumes before rendering the scene geometry.

Object culling and light volume culling provide limited optimizations for this technique and light culling is often not practiced when using a forward rendering pipeline. It is better to limit the number of lights that affect the entire object.

Clustered Forward+

Clustered Forward+ is a rendering technique that combines forward rendering with tiled light culling to reduce the number of lights that must be considered during shading. Forward+ primarily consists of two stages: light culling and forward rendering.

The first pass of the Forward+ rendering technique uses a uniform grid of tiles in screen space to partition the lights into per-tile lists.

Rather than using 2D tiles, we use 3D versions of them called "clusters". Lights in the scene are divided into these clusters. Each cluster represents a portion of the camera frustum that we currently see as we move around in the scene. Each cluster is stored as a 2D texture, which holds information about how many lights each cluster contains, and a list of which lights they are.

The second pass uses a standard forward rendering pass to shade the objects in the scene but instead of looping over every dynamic light in the scene, the current pixel’s screen-space position is used to look-up the list of lights in the cluster that was computed in the previous pass. The light culling provides a significant performance improvement over the standard forward rendering technique as it greatly reduces the number of lights that must be iterated to correctly light the pixel.

Clustered Deferred

Clustered deferred works by rasterizing all of the scene objects (without lighting) into a series of 2D image buffers (g-buffers) that store the geometric information that is required to perform the lighting calculations in a later pass. The information that is stored into the 2D image buffers can be things like:

screen space depth
surface normals
diffuse color

After the g-buffer has been generated, the geometric information can then be used to compute the lighting information in the lighting pass. The lighting pass is performed by rendering each light source as a geometric object in the scene. Each pixel that is touched by the light’s geometric representation is shaded using the desired lighting equation. This is done using the same clustering technique as described in the Forward+ section above.

Advantages compared to forward rendering:

It decouples lighting from the scene complexity
You only transform and rasterize each object once
The expensive lighting calculations are only computed once per light per covered pixel.

Disadvantages:

Memory bandwidth usage: must read g-buffer for each light
Must recalculate full lighting equation for each light
Can't handle transparent objects because only have g-buffers for front-most fragment

More on transparency (from Rendering Technique Comparisons):

One of the disadvantage of using deferred shading is that only opaque objects can be rasterized into the G-buffers. The reason for this is that multiple transparent objects may cover the same screen pixels but it is only possible to store a single value per pixel in the G-buffers. In the lighting pass the depth value, surface normal, diffuse and specular colors are sampled for the current screen pixel that is being lit. Since only a single value from each G-buffer is sampled, transparent objects cannot be supported in the lighting pass.

To circumvent this issue, transparent geometry must be rendered using the standard forward rendering technique which limits either the amount of transparent geometry in the scene or the number of dynamic lights in the scene. A scene which consists of only opaque objects can handle about 2000 dynamic lights before frame-rate issues start appearing.

Another disadvantage of deferred shading is that only a single lighting model can be simulated in the lighting pass. This is due to the fact that it is only possible to bind a single pixel shader when rendering the light geometry. This is usually not an issue for pipelines that make use of übershaders as rendering with a single pixel shader is the norm, however if your rendering pipeline takes advantage of several different lighting models implemented in various pixel shaders then it will be problematic to switch your rendering pipeline to use deferred shading.

Performance Analysis

Rendering Analysis: Forward vs. Clustered Forward+ vs. Clustered Deferred

As can be seen in the graph above, deferred shading is drastically faster than forward+ and forward rendering, starting from 10 lights in the scene. As explained in the section above, by decoupling lights from the scene complexity and storing geometry information in g-buffers, rasterization is done only once per object and expensive lighting calculations are only computed once per light per covered pixel. Light culling through cluster organization also offers a huge time advantage.

Effects Analysis: Blinn-Phong shading with gamma correction

This reflection model uses a combination of diffuse reflection, specular reflection (shiny surfaces), and ambient lighting (lighting in places which aren't lightened by direct light rays). This is model of local lighting of points on a surface, where result of lighting doesn't depend on other objects in the scene or on repeatedly reflected light rays. More info here

As can be seen in the graph above, the extra computations needed to accomplish a Blinn-Phong shading model add time as the number of lights increases.

Optimization Analysis

In the first pass of the deferred shader, you want to send over the color, normals, and fragment position data. Rather than using 3 g-buffers, you can use 2 by compacting the x and y values of the normal into the first 2 buffers. Make sure to multiply the normal by the view matrix, which makes sure that the z value of the normal are all positive. You also know that then the magnitude of the vector is 1. With this information, you can use the equation of calculating the magnitude of a vector in order to decode the z value in the second shader pass.

As can be seen in the graph and chart above, compacting normals creates somewhat of advantage (about 10ms faster) especially when rendering above 500 lights in the scene.

Other optimizations to consider

Some other optimizations that I would like to implement would be:

Using octahedron normal encoding
Calculating the fragment position in view/camera space in the vertex shader by multiplying it with the view matrix

Credits and Resources

Three.js by @mrdoob and contributors
stats.js by @mrdoob and contributors
webgl-debug by Khronos Group Inc.
glMatrix by @toji and contributors
minimal-gltf-loader by @shrekshao
CIS 460 lecture notes on camera frustum
Blinn-Phong Shading Model
Foward vs Deferred Rendering
glMatrix Documentation
Intro to real-time shading of many lights SIGGRAPH course notes
Practical Clustered Shading - Avalanche Studios
Rendering Technique Comparisons

Normal Compression

Other good resources (unused)

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
lib		lib
models/sponza		models/sponza
renders		renders
src		src
.gitignore		.gitignore
INSTRUCTION.md		INSTRUCTION.md
README.md		README.md
index.html		index.html
package.json		package.json
webpack.config.js		webpack.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WebGL Clustered Deferred and Forward+ Shading

Project Overview

Live Online

Demo Video/GIF

Forward+ (100 lights)

Deferred (250 lights, with Blinn-Phong shading and gamma correction)

Features and Optimizations

Algorithm Descriptions

Forward Rendering

Clustered Forward+

Clustered Deferred

Performance Analysis

Rendering Analysis: Forward vs. Clustered Forward+ vs. Clustered Deferred

Effects Analysis: Blinn-Phong shading with gamma correction

Optimization Analysis

Other optimizations to consider

Credits and Resources

About

Uh oh!

Releases

Packages

Languages

MegSesh/Project5-WebGL-Clustered-Deferred-Forward-Plus

Folders and files

Latest commit

History

Repository files navigation

WebGL Clustered Deferred and Forward+ Shading

Project Overview

Live Online

Demo Video/GIF

Forward+ (100 lights)

Deferred (250 lights, with Blinn-Phong shading and gamma correction)

Features and Optimizations

Algorithm Descriptions

Forward Rendering

Clustered Forward+

Clustered Deferred

Performance Analysis

Rendering Analysis: Forward vs. Clustered Forward+ vs. Clustered Deferred

Effects Analysis: Blinn-Phong shading with gamma correction

Optimization Analysis

Other optimizations to consider

Credits and Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages