I have the following script that takes an image in RGB and converts it to Lab color space:
import tensorflow as tf
import tensorflow_io as tfio
img = tf.io.read_file(tf.keras.utils.get_file("tf", "https://upload.wikimedia.org/wikipedia/commons/e/e5/TensorFlow_Logo_with_text.png"))
img = tf.image.decode_png(img, channels=3)
img = tf.image.resize(img, [512, 512])
lab = tfio.experimental.color.rgb_to_lab(img)
lab = lab.numpy()
lab.shape # (512, 512, 3)
lab[:, :, 0].min() # 3660.3594
lab[:, :, 0].max() # 9341.573
lab[:, :, 1].min() # -49.76082
lab[:, :, 1].max() # 4273.1514
lab[:, :, 2].min() # -1256.8489
lab[:, :, 2].max() # 6293.9043
LAB space is three-dimensional, and covers the entire range of human color perception, or gamut. It is based on the opponent color model of human vision, where red/green forms an opponent pair, and blue/yellow forms an opponent pair. The lightness value, L*, also referred to as "Lstar," defines black at 0 and white at 100. The a* axis is relative to the green–red opponent colors, with negative values toward green and positive values toward red. The b* axis represents the blue–yellow opponents, with negative numbers toward blue and positive toward yellow.
The a* and b* axes are unbounded, and depending on the reference white they can easily exceed ±150 to cover the human gamut. Nevertheless, software implementations often clamp these values for practical reasons. For instance, if integer math is being used it is common to clamp a* and b* in the range of -128 to 127.
Why isn't 0 <= lab[:, :, 0].min() <= lab[:, :, 0].max() <= 100
true?
The function tfio.experimental.color.rgb_to_lab
expects its input to be a float normalized between 0 and 1.
You can call tf.image.convert_image_dtype
to normalize you image (if your input is a integer and your target output is a float, the function will normalize it between 0 and 1 automatically).
import tensorflow as tf
import tensorflow_io as tfio
img = tf.io.read_file(tf.keras.utils.get_file("tf", "https://upload.wikimedia.org/wikipedia/commons/e/e5/TensorFlow_Logo_with_text.png"))
img = tf.image.decode_png(img, channels=3)
img = tf.image.convert_image_dtype(img, dtype=tf.float32)
img = tf.image.resize(img, [512, 512])
lab = tfio.experimental.color.rgb_to_lab(img)
lab = lab.numpy()
And checking the L
dimension :
>>> lab[:,:,0].min()
33.678085
>>> lab[:,:,0].max()
100.0