Study Material

Wednesday, April 23, 2025

Unit 2 - Image Quantization and Image Transform Theory based answer

1. Sampling Theorem

▶ Theory:

The Sampling Theorem, also known as the Nyquist-Shannon Sampling Theorem, is the foundation of digital signal and image processing. It states that a band-limited analog signal can be perfectly reconstructed from its samples if it is sampled at a frequency greater than or equal to twice the maximum frequency present in the signal.

▶ In image processing:

Images are sampled in both horizontal and vertical directions. Insufficient sampling leads to loss of detail and aliasing.

2. Anti-Aliasing

▶ Theory:

Aliasing is the effect of different signals becoming indistinguishable when sampled, leading to visual artifacts like moiré patterns. It occurs when the sampling rate is too low.

Anti-aliasing techniques involve pre-filtering the image using a low-pass filter to remove high-frequency components before sampling. This ensures the sampled image retains important features without distortion.

3. Image Quantization

▶ Theory:

Quantization is the process of mapping continuous values into a finite set of discrete values. In images, this usually refers to reducing the number of gray levels or color values.

Spatial Quantization → Reduces resolution.
Intensity Quantization → Reduces the number of brightness levels.

▶ Example:

For an 8-bit image, intensity values range from 0–255. Reducing it to 4 bits maps all values into 16 levels.

Quantization introduces errors (quantization noise), but with intelligent algorithms, quality can be preserved.

4. Orthogonal and Unitary Transforms

▶ Orthogonal Transforms:

Orthogonal transforms use basis vectors that are mutually perpendicular.
They preserve energy and allow lossless transformations.
Examples: DCT, DFT, Haar, Hadamard.

▶ Unitary Transforms:

A unitary matrix is the complex counterpart of an orthogonal matrix.
It satisfies: $U^H U = I$ (conjugate transpose of U times U is identity).
Useful in transforms involving complex values, e.g., DFT.

5. Discrete Fourier Transform (DFT)

▶ Theory:

The DFT transforms a signal or image from the spatial domain to the frequency domain. It represents the image in terms of its frequency components, where low frequencies describe smooth areas, and high frequencies describe edges and noise.

▶ Applications:

Image filtering
Image compression
Frequency analysis

6. Discrete Cosine Transform (DCT)

▶ Theory:

The DCT expresses an image as a sum of cosine functions oscillating at different frequencies. It’s similar to the DFT but uses only cosine components, making it real-valued and more efficient.

▶ Advantage:

DCT is highly efficient in energy compaction, making it ideal for image compression, e.g., JPEG.

7. Hadamard Transform

▶ Theory:

The Hadamard transform uses a matrix with only +1 and -1 values and operates on image data using simple addition and subtraction. It is orthogonal and fast to compute.

▶ Use:

Image compression
Pattern recognition

8. Haar Transform

▶ Theory:

The Haar transform is the earliest wavelet transform. It represents data as a set of averages and differences, making it ideal for multi-resolution analysis (processing the image at multiple scales).

▶ Properties:

Simple and fast
Good for edge detection
Used in image compression and analysis

9. Karhunen-Loeve Transform (KLT) / PCA

▶ Theory:

KLT is a statistical method that transforms data into a set of uncorrelated variables using eigenvalue decomposition. It is data-dependent and optimal for decorrelation and energy compaction.

▶ Steps:

Compute the covariance matrix.
Calculate eigenvectors and eigenvalues.
Project the image onto these eigenvectors.

▶ Applications:

Face recognition (Eigenfaces)
Compression
Dimensionality reduction

unit 2 - Image quantization and Image Transforms

1. Sampling Theorem

Definition:

The Sampling Theorem (Shannon-Nyquist) states that a continuous signal can be perfectly reconstructed from its samples if it is sampled at twice the maximum frequency present in the signal.

Formula:

$f_s \geq 2f_{max}$

where:

$f_s$ = sampling frequency
$f_{max}$ = highest frequency component in the signal

Application:

Used in digitizing analog images to ensure no information is lost during sampling.

2. Anti-Aliasing

Aliasing:

Occurs when sampling is done below the Nyquist rate, leading to overlapping frequency components and distortion.

Anti-Aliasing:

A process to suppress high frequencies before sampling using low-pass filters to prevent aliasing.

3. Image Quantization

Definition:

Process of mapping a range of continuous pixel values to a finite number of levels.

Types:

Scalar Quantization: Each pixel is quantized independently.
Vector Quantization: Blocks of pixels are quantized together.

Quantization Error:

$\text{Error} = \text{Original Pixel} - \text{Quantized Pixel}$

Too few levels → loss of detail, visible banding.

4. Orthogonal and Unitary Transforms

Orthogonal Transform:

A linear transformation using an orthogonal matrix $T$ where:

$T^T T = I$

Preserves energy.
Examples: DFT, DCT, Haar, Hadamard.

Unitary Transform:

A generalization using complex numbers:

$T^H T = I$

where $T^H$ is the conjugate transpose.

5. Discrete Fourier Transform (DFT)

Formula:

$F(u,v) = \sum_x \sum_y f(x,y) \cdot e^{-j2\pi \left(\frac{ux}{M} + \frac{vy}{N}\right)}$

Converts spatial image to frequency domain.
Captures periodic patterns.
Used in filtering, compression.

6. Discrete Cosine Transform (DCT)

Formula (1D):

$X_k = \sum_{n=0}^{N-1} x_n \cdot \cos\left[\frac{\pi}{N}(n + 0.5)k\right]$

Like DFT, but uses only real cosine terms.
Energy compaction is high → used in JPEG compression.

7. Hadamard Transform

Properties:

Uses only +1 and −1 (binary values).
Fast to compute (no multiplications).
Not based on sinusoidal functions.

Matrix:

Hadamard matrix is recursively defined:

$H_2 = \begin{bmatrix} 1 & 1 \\ 1 & -1 \end{bmatrix}, \quad H_{2^n} = \begin{bmatrix} H_{2^{n-1}} & H_{2^{n-1}} \\ H_{2^{n-1}} & -H_{2^{n-1}} \end{bmatrix}$

8. Haar Transform

Properties:

Simplest wavelet transform.
Breaks signal into approximation and detail parts.
Useful for multi-resolution analysis.

Steps:

Divide signal into pairs.
Calculate average and difference.
Recurse on averages.

9. Karhunen-Loeve Transform (KLT / PCA)

Definition:

A statistical transform that decorrelates data. Also known as Principal Component Analysis (PCA).

Steps:

Calculate covariance matrix.
Compute eigenvalues and eigenvectors.
Transform data using eigenvectors.

Advantage:

Optimal energy compaction.
Basis vectors are data-dependent.
Used in face recognition, compression.

unit -1 Introduction of Image processing

What is Image Processing?

Image processing is a method to perform operations on an image to enhance it or extract useful information. It is a type of signal processing where the input is an image, and the output may be either an image or characteristics/features associated with that image.

Goals of Image Processing

Image Enhancement: Improving visual appearance (e.g., contrast, sharpness)
Image Restoration: Removing noise or distortion
Image Compression: Reducing the amount of data required to represent an image
Feature Extraction: Identifying objects, edges, or patterns
Image Analysis: Understanding and interpreting image content
Object Recognition: Detecting and identifying objects in an image

What is an Image?

An image is a two-dimensional function f(x, y), where x and y are spatial coordinates, and f is the intensity (brightness or color) at that point. For digital images, both x, y, and f are finite and discrete.

Types of Image Representation

Spatial Domain Representation: Direct representation using pixel intensity values in a grid.
Frequency Domain Representation: Using transforms like Fourier to represent the image in terms of its frequency components.

Types of Images

Binary Image: Only black and white (pixel values: 0 or 1)
Grayscale Image: Shades of gray (pixel values: 0 to 255)
Color Image: Consists of multiple channels, commonly RGB (Red, Green, Blue)
Indexed Image: Uses a colormap or palette to store color information

Image Models

Geometric Model: Describes the shape and position of image elements.
Photometric Model: Describes the brightness/intensity or color of each point.
Color Models:

RGB: Red, Green, Blue components
HSV: Hue, Saturation, Value
YCbCr: Used in video compression
CMYK: Used in printing

Resolution

Spatial Resolution: Amount of detail in an image (measured in pixels)
Gray-level Resolution: Number of distinct gray levels available (e.g., 8-bit = 256 levels)

Image Size

Described in terms of width × height × number of channels (e.g., 512 × 512 × 3 for RGB)

2D Linear System

A 2D linear system in image processing refers to a system where the output image is a linear transformation of the input image, usually involving operations like convolution.
Linearity implies two properties:
1. Additivity: T[f1 + f2] = T[f1] + T[f2]
2. Homogeneity (Scaling): T[a·f] = a·T[f]
Spatial Invariance: The system's response doesn’t change when the input is shifted.
Example: Applying a kernel (filter) over an image using convolution is a classic example of a 2D linear system:
$g(x, y) = \sum_m \sum_n h(m, n) \cdot f(x - m, y - n)$

Luminance

The measured intensity of light emitted or reflected from a surface in a given direction.
Closely related to the perceived brightness, but it's a physical quantity.
Important in grayscale and color image processing.

Contrast

The difference in luminance or color that makes an object distinguishable from others or the background.
High contrast makes features pop; low contrast makes the image appear flat.
Often enhanced using techniques like contrast stretching or histogram equalization.

Brightness

A subjective visual perception of how much light an image appears to emit or reflect.
Can be increased by adding a constant to all pixel intensities.

Color Representation

Images can be represented using various color models, each suitable for different applications:

RGB (Red, Green, Blue)

Additive color model (used in screens).
Each color is a mix of Red, Green, and Blue components.

CMY/CMYK (Cyan, Magenta, Yellow, Key/Black)

Subtractive color model (used in printing).

HSV (Hue, Saturation, Value)

Hue: Color type (0° to 360°)
Saturation: Color purity
Value: Brightness of the color

YUV / YCbCr

Used in video processing.
Separates brightness (Y) from color information (U and V or Cb and Cr).

Visibility Functions

Visibility functions describe how sensitive the human eye is to different spatial frequencies.
The Contrast Sensitivity Function (CSF) is a common example. It shows that humans are:
- Most sensitive to mid-range spatial frequencies
- Less sensitive to very low or very high frequencies
Important in compression algorithms and display optimization.

Monochrome and Color Vision Models

Monochrome Vision Model

Uses only intensity (luminance) values.
No color, only grayscale from black to white.
Basis of early vision systems and useful in medical/scientific imaging.

Color Vision Model

Based on how the human eye perceives color using three types of cones:
- L (long wavelengths) → Red
- M (medium) → Green
- S (short) → Blue
Color models (like RGB, HSV) are built around this biological model.
Opponent Process Theory: Human vision processes color differences (Red-Green, Blue-Yellow) rather than absolute colors.

Shannon Nyquist Theorem

What should be the ideal size of the pixel? should it be big or small?

The answer is given by the shannon nyquist theorem. As per this theorem, the sampling frequency should be greater than or equal to 2 ✕ fmax, where fmax is the highest frequency present in the image.

SubSampling

The key idea in image sub-sampling is to throw away every other row and column to create a half-size image. When the sampling rate gets too low, we are not able to capture the details in the image anymore.

Instead, we should have a minimum signal/image rate, called the Nyquist rate.

Using Shannons Sampling Theorem, the minimum sampling should be such that :

$f_{s} \geq 2 f_{max}$

Simple Image Model

Simple Image model

I(x, y, l) = σ(x, y, l) × L(l)

Mach Bands

Mach band effect is a phenomenon of lateral inhibition of rods and cones, where the sharp intensity changes are attenuated by the visual system.

Components of Digital Camera

The essential components of a digital camera are as follows

A subsystem of sensors to capture the image. The subsystem uses photodiodes to convert light energy into electrical signals.

A subsystem that converts analog signals to digital data.

A storage subsystem for storing the captured images. A digital camera also has an immediate feedback system to see the captured image. Digital cameras can be connected to computers through a cable, to transfer images to the computer system.

Tuesday, April 22, 2025

Digital Imaging System

A digital imaging system is a set of devices for acquiring, storing, manipulating, and transmitting digital images.

Components-

Human being perceive objects because of light. Light source are of two types- primary and secondary.
The sun and Lamps are example of primary light sources.

wavelength- wavelength is the distance between two successive wave crests or wave troughs in the direction of travel.

Amplitude- Amplitude is the maximum distance the oscillation travels, away from its horizontal axis.

Frequency- The frequency of vibration is the number of waves crossing at a point.

mode Imaging

Reflective Mode Imaging

Reflective mode imaging represents the simplest form of imaging and uses a sensor to acquire the digital image. All video cameras, digital cameras, and scanners use some types of sensors for capturing the image.

Emissive Type Imaging

Emissive type imaging is the second type, where the images are acquired from self-luminous objects without the help of a radiation source. In emissive type imaging, the objects are self-luminous. The radiation emitted by the object is directly captured by the senor to form an image. Thermal Imaging is an example of emissive type imaging.

Transmissive Imaging

Transmissive imaging is the third type, where the radiation source illuminates the object. The absorption of radiation by the objects depends upon the nature of the material. Some of the radiation passes through the objects. The attenuated radiation is sensed into an image.

Image Processing Environment

Radiation Source

↓

------------ --------------------------------------------------

| | |

↓ ↓ ↓

object self-luminous object Transparent Object

\ | /

\ ↓ /

========================

↓

↓

↓

↓

Nature of Image Processing

images are everywhere! Sources of images are paintings, photographs in magazines, Journals, image galleries, digital Libraries, newspapers, advertisement, boards, television and Internet.

Images are limitations of Images.

In image processing, the term 'image' is used to denote the image data that is sampled, quantized, and readily available in a form suitable for further processing by digital computers.

Friday, April 18, 2025

cryptocurrency, but do you know exactly what it is and how it works?

Cryptocurrency

Cryptocurrency is digital money that can be used to buy goods and services, using strong encryption techniques to secure online transactions. Banks, governments and even companies like Microsoft and AT&T are very aware of its importance and are jumping on the cryptocurrency bandwagon!

1 - Cryptocurrency owners keep their money in encrypted, virtual ‘wallets.’ When a transaction takes place between the owners of two digital wallets, the details are recorded in a decentralized, electronic ledger or blockchain system. This means it is carried out with a degree of anonymity and is self-managed, with no interference from third parties such as central banks or government entities.

2- Approximately every ten minutes, special computers collect data about the latest cryptocurrency transactions, turning them into mathematical puzzles to maintain confidentiality.

3- These transactions are then verified through a technical and highly complex process known as ‘mining.’ This step typically involves an army of ‘miners’ working on high-end PCs to solve mathematical puzzles and authenticate transactions.

Once verified, the ledger is updated and electronically copied and disseminated worldwide to anyone belonging to the blockchain network, effectively completing a transaction.

Cryptojacking

Cryptojacking is an emerging threat that hides on a user’s computer, mobile phone, tablet, laptop or server, using that machine’s resources to 'mine’ cryptocurrencies without the user's consent or knowledge.

Many victims of cryptojacking didn’t even know they’d been hacked until it was too late!

Thursday, April 10, 2025

Image processing

What is image quantization? why it is important?

Image quantization is process of converting the continuous range of pixel value into a limited set of discrete value. This step follows sampling and reduces the precision of sampled values to manageable level for digital representation.