htmlimage-processinghtml5-canvascontrast

HTML5 Canvas image contrast


I've been writing an image processing program which applies effects through HTML5 canvas pixel processing. I've achieved Thresholding, Vintaging, and ColorGradient pixel manipulations but unbelievably I cannot change the contrast of the image! I've tried multiple solutions but I always get too much brightness in the picture and less of a contrast effect and I'm not planning to use any Javascript libraries since I'm trying to achieve these effects natively.

The basic pixel manipulation code:

var data = imageData.data;
for (var i = 0; i < data.length; i += 4) {
 //Note: data[i], data[i+1], data[i+2] represent RGB respectively
data[i] = data[i];
data[i+1] = data[i+1];
data[i+2] = data[i+2];
}

Pixel manipulation example

Values are in RGB mode which means data[i] is the Red color. So if data[i] = data[i] * 2; the brightness will be increased to twice for the Red channel of that pixel. Example:

var data = imageData.data;
for (var i = 0; i < data.length; i += 4) {
 //Note: data[i], data[i+1], data[i+2] represent RGB respectively
 //Increases brightness of RGB channel by 2
data[i] = data[i]*2;
data[i+1] = data[i+1]*2;
data[i+2] = data[i+2]*2;
}

*Note: I'm not asking you guys to complete the code! That would just be a favor! I'm asking for an algorithm (even Pseudo code) that shows how Contrast in pixel manipulation is possible! I would be glad if someone can provide a good algorithm for Image Contrast in HTML5 canvas.


Solution

  • A faster option (based on Escher's approach) is:

    function contrastImage(imgData, contrast){  //input range [-100..100]
        var d = imgData.data;
        contrast = (contrast/100) + 1;  //convert to decimal & shift range: [0..2]
        var intercept = 128 * (1 - contrast);
        for(var i=0;i<d.length;i+=4){   //r,g,b,a
            d[i] = d[i]*contrast + intercept;
            d[i+1] = d[i+1]*contrast + intercept;
            d[i+2] = d[i+2]*contrast + intercept;
        }
        return imgData;
    }
    

    Derivation similar to the below; this version is mathematically the same, but runs much faster.


    Original answer

    Here is a simplified version with explanation of an approach already discussed (which was based on this article):

    function contrastImage(imageData, contrast) {  // contrast as an integer percent  
        var data = imageData.data;  // original array modified, but canvas not updated
        contrast *= 2.55; // or *= 255 / 100; scale integer percent to full range
        var factor = (255 + contrast) / (255.01 - contrast);  //add .1 to avoid /0 error
    
        for(var i=0;i<data.length;i+=4)  //pixel values in 4-byte blocks (r,g,b,a)
        {
            data[i] = factor * (data[i] - 128) + 128;     //r value
            data[i+1] = factor * (data[i+1] - 128) + 128; //g value
            data[i+2] = factor * (data[i+2] - 128) + 128; //b value
    
        }
        return imageData;  //optional (e.g. for filter function chaining)
    }
    

    Notes

    1. I have chosen to use a contrast range of +/- 100 instead of the original +/- 255. A percentage value seems more intuitive for users, or programmers who don't understand the underlying concepts. Also, my usage is always tied to UI controls; a range from -100% to +100% allows me to label and bind the control value directly instead of adjusting or explaining it.

    2. This algorithm doesn't include range checking, even though the calculated values can far exceed the allowable range - this is because the array underlying the ImageData object is a Uint8ClampedArray. As MSDN explains, with a Uint8ClampedArray the range checking is handled for you:

    "if you specified a value that is out of the range of [0,255], 0 or 255 will be set instead."

    Usage

    Note that while the underlying formula is fairly symmetric (allows round-tripping), data is lost at high levels of filtering because pixels only allow integer values. For example, by the time you de-saturate an image to extreme levels (>95% or so), all the pixels are basically a uniform medium gray (within a few digits of the average possible value of 128). Turning the contrast back up again results in a flattened color range.

    Also, order of operations is important when applying multiple contrast adjustments - saturated values "blow out" (exceed the clamped max value of 255) quickly, meaning highly saturating and then de-saturating will result in a darker image overall. De-saturating and then saturating however doesn't have as much data loss, because the highlight and shadow values get muted, instead of clipped (see explanation below).

    Generally speaking, when applying multiple filters it's better to start each operation with the original data and re-apply each adjustment in turn, rather than trying to reverse a previous change - at least for image quality. Performance speed or other demands may dictate differently for each situation.

    Mandrill contrast examples

    Code Example:

    function contrastImage(imageData, contrast) {  // contrast input as percent; range [-1..1]
        var data = imageData.data;  // Note: original dataset modified directly!
        contrast *= 255;
        var factor = (contrast + 255) / (255.01 - contrast);  //add .1 to avoid /0 error.
    
        for(var i=0;i<data.length;i+=4)
        {
            data[i] = factor * (data[i] - 128) + 128;
            data[i+1] = factor * (data[i+1] - 128) + 128;
            data[i+2] = factor * (data[i+2] - 128) + 128;
        }
        return imageData;  //optional (e.g. for filter function chaining)
    }
    
    $(document).ready(function(){
      var ctxOrigMinus100 = document.getElementById('canvOrigMinus100').getContext("2d");
      var ctxOrigMinus50 = document.getElementById('canvOrigMinus50').getContext("2d");
      var ctxOrig = document.getElementById('canvOrig').getContext("2d");
      var ctxOrigPlus50 = document.getElementById('canvOrigPlus50').getContext("2d");
      var ctxOrigPlus100 = document.getElementById('canvOrigPlus100').getContext("2d");
      
      var ctxRoundMinus90 = document.getElementById('canvRoundMinus90').getContext("2d");
      var ctxRoundMinus50 = document.getElementById('canvRoundMinus50').getContext("2d");
      var ctxRound0 = document.getElementById('canvRound0').getContext("2d");
      var ctxRoundPlus50 = document.getElementById('canvRoundPlus50').getContext("2d");
      var ctxRoundPlus90 = document.getElementById('canvRoundPlus90').getContext("2d");
      
      
      var img = new Image();
      img.onload = function() {
        //draw orig
        ctxOrig.drawImage(img, 0, 0, img.width, img.height, 0, 0, 100, 100); //100 = canvas width, height
        
        //reduce contrast
        var origBits = ctxOrig.getImageData(0, 0, 100, 100);
        contrastImage(origBits, -.98);
        ctxOrigMinus100.putImageData(origBits, 0, 0);
        
        var origBits = ctxOrig.getImageData(0, 0, 100, 100);
        contrastImage(origBits, -.5);
        ctxOrigMinus50.putImageData(origBits, 0, 0);
        
        // add contrast
        var origBits = ctxOrig.getImageData(0, 0, 100, 100);
        contrastImage(origBits, .5);
        ctxOrigPlus50.putImageData(origBits, 0, 0);
        
        var origBits = ctxOrig.getImageData(0, 0, 100, 100);
        contrastImage(origBits, .98);
        ctxOrigPlus100.putImageData(origBits, 0, 0);
        
        
        //round-trip, de-saturate first
        origBits = ctxOrig.getImageData(0, 0, 100, 100);
        contrastImage(origBits, -.98);
        contrastImage(origBits, .98);
        ctxRoundMinus90.putImageData(origBits, 0, 0);
        
        origBits = ctxOrig.getImageData(0, 0, 100, 100);
        contrastImage(origBits, -.5);
        contrastImage(origBits, .5);
        ctxRoundMinus50.putImageData(origBits, 0, 0);
        
        //do nothing 100 times
        origBits = ctxOrig.getImageData(0, 0, 100, 100);
        for(i=0;i<100;i++){
          contrastImage(origBits, 0);
        }
        ctxRound0.putImageData(origBits, 0, 0);
        
        //round-trip, saturate first
        origBits = ctxOrig.getImageData(0, 0, 100, 100);
        contrastImage(origBits, .5);
        contrastImage(origBits, -.5);
        ctxRoundPlus50.putImageData(origBits, 0, 0);
        
        origBits = ctxOrig.getImageData(0, 0, 100, 100);
        contrastImage(origBits, .98);
        contrastImage(origBits, -.98);
        ctxRoundPlus90.putImageData(origBits, 0, 0);
      };
      
      img.src = "";
      
    });
    canvas {width: 100px; height: 100px}
    div {text-align:center; width:120px; float:left}
    <script src="https://ajax.googleapis.com/ajax/libs/jquery/2.1.1/jquery.min.js"></script>
    
    <div>
      <canvas id="canvOrigMinus100" width="100" height="100"></canvas>
      -98%
    </div>
    
    <div>
      <canvas id="canvOrigMinus50" width="100" height="100"></canvas>
      -50%
    </div>
    
    <div>
      <canvas id="canvOrig" width="100" height="100"></canvas>
      Original
    </div>
    
    <div>
      <canvas id="canvOrigPlus50" width="100" height="100"></canvas>
      +50%
    </div>
    
    <div>
      <canvas id="canvOrigPlus100" width="100" height="100"></canvas>
      +98%
    </div>
    
      <hr/>
    
    <div style="clear:left">
      <canvas id="canvRoundMinus90" width="100" height="100"></canvas>
      Round-trip <br/> (-98%, +98%)
    </div>
    
    <div>
      <canvas id="canvRoundMinus50" width="100" height="100"></canvas>
      Round-trip <br/> (-50%, +50%)
    </div>
    
    <div>
      <canvas id="canvRound0" width="100" height="100"></canvas>
      Round-trip <br/> (0% 100x)
    </div>
    
    <div>
      <canvas id="canvRoundPlus50" width="100" height="100"></canvas>
      Round-trip <br/> (+50%, -50%)
    </div>
    
    <div>
      <canvas id="canvRoundPlus90" width="100" height="100"></canvas>
      Round-trip <br/> (+98%, -98%)
    </div>

    Explanation

    (Disclaimer - I am not an image specialist or a mathematician. I am trying to provide a common-sense explanation with minimal technical details. Some hand-waving below, e.g. 255=256 to avoid indexing issues, and 127.5=128, for simplifying the numbers.)

    Since, for a given pixel, the possible number of non-zero values for a color channel is 255, the "no-contrast", average value of a pixel is 128 (or 127, or 127.5 if you want argue, but the difference is negligible). For purposed of this explanation, the amount of "contrast" is the distance from the current value to the average value (128). Adjusting the contrast means increasing or decreasing the difference between the current value and the average value.

    The problem the algorithm solves then is to:

    1. Chose a constant factor to adjust contrast by
    2. For each color channel of each pixel, scale "contrast" (distance from average) by that constant factor

    Or, as hinted at in the CSS spec, simply choosing the slope and intercept of a line:

    <feFuncR type="linear" slope="[amount]" intercept="-(0.5 * [amount]) + 0.5"/>

    Note the term type='linear'; we are doing linear contrast adjustment in RGB color space, as opposed to a quadratic scaling function, luminence-based adjustment, or histogram matching.

    If you recall from geometry class, the formula for a line is y=mx+b. y is the final value we are after, the slope m is the contrast (or factor), x is the initial pixel value, and b is the intercept of the y-axis (x=0), which shifts the line vertically. Recall also that since the y-intercept is not at the origin (0,0), the formula can also be represented as y=m(x-a)+b, where a is the x-offset shifting the line horizontally.

    Formula for slope of a line

    For our purposes, this graph represents the input value (x-axis) and the result (y-axis). We already know that b, the y-intercept (for m=0, no contrast) must be 128 (which we can check against the 0.5 from the spec - 0.5 * the full range of 256 = 128). x is our original value, so all we need is to figure out the slope m and x-offset a.

    First, the slope m is "rise over run", or (y2-y1)/(x2-x1) - so we need 2 points known to be on the desired line. Finding these points requires bringing a few things together:

    Taking all these together, we can deduce that regardless of the contrast (slope) applied, our resulting line will be centered at (and pivot around) 128,128. Since our y-intercept is non-zero, the x-intercept is also non-zero; we know the x-range is 256 wide and is centered in the middle, so it must be offset by half of the possible range: 256 / 2 = 128.

    Contrast function slopes

    So now for y=m(x-a)+b, we know everything except m. Recall two more important points from geometry class:

    To simplify the slope discussion, let's move the coordinate origin to the x-intercept (-128) and ignore a and b for a moment. Our original line will now pivot through (0,0), and we know a second point on the line lies away the full range of both x (input) and y (output) at (255,255).

    We'll let the new line pivot at (0,0), so we can use that as one of the points on the new line that will follow our final contrast slope m. The second point can be determined by moving the current end at (255,255) by some amount; since we are limited to a single input (contrast) and using a linear function, this second point will be moved equally in the x and y directions on our graph.

    Adjusting the contrast slope

    The (x,y) coordinates of the 4 possible new points will be 255 +/- contrast. Since increasing or decreasing both x and y would keep us on the original 1:1 line, let's just look at +x, -y and -x, +y as shown.

    The steeper line (-x, +y) is associated with a positive contrast adjustment; it's (x,y) coordinates are (255 - contrast,255 + contrast). The coordinates of the shallower line (negative contrast) are found the same way. Notice that the biggest meaningful value of contrast will be 255 - the most that the initial point of (255,255) can be translated before resulting in a vertical line (full contrast, all black or white) or a horizontal line (no contrast, all gray).

    So now we have the coordinates of two points on our new line - (0,0) and (255 - contrast,255 + contrast). We plug this into the slope equation, and then plug that into the full line equation, using all the parts from before:

    y = m(x-a) + b

    m = (y2-y1)/(x2-x1) =>
    ((255 + contrast) - 0)/((255 - contrast) - 0) =>
    (255 + contrast)/(255 - contrast)

    a = 128
    b = 128

    y = (255 + contrast)/(255 - contrast) * (x - 128) + 128 QED

    The math-minded will notice that the resulting m or factor is a scalar (unitless) value; you can use any range you want for contrast as long as it matches the constant (255) in the factor calculation. For example, a contrast range of +/-100 and factor = (100 + contrast)/(100.01 - contrast), which is was I really use to eliminate the step of scaling to 255; I just left 255 in the code at the top to simplify the explanation.


    Note about the "magic" 259

    The source article uses a "magic" 259, although the author admits he doesn't remember why:

    "I can’t remember if I had calculated this myself or if I’ve read it in a book or online.".

    259 should really be 255 or perhaps 256 - the number of possible non-zero values for each channel of each pixel. Note that in the original factor calculation, 259/255 cancels out - technically 1.01, but final values are whole integers so 1 for all practical purposes. So this outer term can be discarded. Actually using 255 for the constant in the denominator, though, introduces the possibility of a Divide By Zero error in the formula; adjusting to a slightly larger value (say, 259) avoids this issue without introducing significant error to the results. I chose to use 255.01 instead as the error is lower and it (hopefully) seems less "magic" to a newcomer.

    As far as I can tell though, it doesn't make much difference which you use - you get identical values except for minor, symmetric differences in a narrow band of low contrast values with a low positive contrast increase. I'd be curious to round-trip both versions repeatedly and compare to the original data, but this answer already took way too long. :)