ITR/AP: Beyond Polygons and Pixels: New Paradigms for Real-Time, Physically-Based Rendering
Cornell University, Ithaca NY
Investigators
Abstract
ITR: Beyond Polygons and Pixels: New Paradigms for Real-Time, Physically-Based Rendering The ultimate goal of this research proposal is to provide real-time, physically accurate synthetic images, delivering breakthrough realism at fully interactive rates. We will achieve this by combining new approaches for parallel graphics rendering with advanced, feature-level psychophysical models of human vision and new image representations. Currently, the two extremes of image synthesis are physically-based rendering, where accurate simulation of light reflection gives faithful predictions of the appearance of scenes, and real-time rendering, where crude approximations to accurate simulation are tolerated to provide dynamic imagery at interactive rates. High-quality, physically accurate images incorporating indirect lighting, inter-reflections between surfaces, and color bleeding can take hours or even days to compute on today's workstations, and incremental improvements to the speed of rendering will not be enough to bridge the gap. We estimate that real-time simulations of global illumination for complex scenes might require 10 7 times more processing power than we have on multi-processor workstations today. This can only be achieved by taking a radically different approach to how we generate, encode, and display synthetic images, an approach that separates computation by light reflection components and is based on advanced psychophysical models of human vision and new image representations. Low-cost, efficiently pipelined graphics accelerator boards have become extremely popular, but these architectures only process local illumination components, and thus cannot provide global illumination effects or guarantee physical accuracy. To address this shortcoming, we have developed new algorithms that exploit the speed of these accelerator boards for direct lighting while performing global illumination computations in parallel on clusters of off-the-shelf Intel microprocessors. We have reduced computation times from hours to minutes for complex environments, but for real-time image synthesis we need another four orders of magnitude speed-up. To reach this goal, we must develop more advanced models of human visual perception. Current perceptually-driven rendering methods are based on threshold models of human vision that predict the limits of our abilities to discriminate luminance contrasts, spatial patterns, motions, and colors, but provide no guidance for optimizing the order or precision of rendering operations. For our new perception oracles, we are developing higher-level visual models to monitor the importance of scene features such as shadows and reflections to perceived image quality. These new models will then drive the allocation of parallel computing resources as well as select appropriate algorithms to provide the "steepest ascent" solutions. New data structures for pictorial representation will incorporate illumination and contrast gradients as well as pixel-by-pixel intensities to ensure optimal display of physically accurate and perceptually indistinguishable solutions under all viewing conditions. These capabilities will extend the scientific, educational, and commercial application of graphical simulations into visually critical tasks where predictive reliability and speed are paramount.
View original record on NSF Award Search →