NumPy and OpenCV Tutorial for Machine Vision

Start your machine vision coding with Python

Motivation

Human beings perceive the environment and environment with our vision system. The human eye, brain, and limbs work together to perceive the environment and act accordingly. An intelligent system can perform those tasks that require a certain level of intelligence if performed by a human being. So to perform intelligent tasks, machine vision system is one of the important things for a computer. Typically, the camera and image are used to collect the information needed to get the job done. Image processing and computer vision techniques help us perform tasks similar to those performed by humans, such as image recognition, object tracking, etc.

In computer vision, the camera works like a human eye to capture the image and the processor works like a brain to process the captured image and generate meaningful results. But there is a basic difference between humans and computers. The human brain works automatically and intelligence is an innate acquisition. On the contrary, the computer has no intelligence without human instruction (program). Computer vision is the way to provide proper instruction so that it can function in a way that is compatible with the human vision system. But the capacity is limited.

In the upcoming sections, we will discuss the basic idea of how the image is formed and can be manipulated using python.

How the image is formed and displayed

The image is nothing more than a combination of pixels with different color intensities. The jargon for ‘pixels’ and ‘color intensity’ may be unfamiliar to you. Don’t worry. It will be very clear, just read the article to the end.

pixel it is the smallest unit/element of the digital image. Details are in the image below.

The screen is made up of pixels. In the above figure, there are 25 columns and 25 rows. Each small square is considered a pixel. The configuration can accommodate 625 pixels. Represents a screen with 625 pixels. If we make the pixels shine with different color intensity (brightness), it will form a digital image.

How does the computer store the image in memory?

If we look closely at the image, we can compare it to a 2D matrix. An array has rows and columns, and its elements can be addressed by their index. The matrix structure is similar to a matrix. And the computer stores the image in a computer memory array.

Each element of the array contains the intensity value of a color. In general, the intensity value ranges from 0 to 255. For demonstration purposes, I’ve included a matrix representation of an image.

Sample matrix representation of a grayscale image (Image by author)

Color and grayscale image

gray scale the image is a black and white image. It is formed with a single color. A pixel value close to 0 represents darkness and gets brighter with higher intensity values. The highest value is 255, which represents the color white. A 2D matrix is enough to contain the grayscale image, as the last figure shows.

color images it cannot be formed with a single color; there can be hundreds of thousands of color combinations. Mainly, there are three primary color channels RED (R), GREEN(G), and Blue(B). And each color channel is stored in a 2D matrix and maintains its intensity values, and the final image is the combination of these three color channels.

This color model has (256 x 256 x 256) = 16,777,216 possible color combinations. You may visualize the combination here.

But in computer memory, the image is stored differently.

Image stored in computer memory (author’s image)

The computer does not know the RGB channels. Know the value of intensity. The red channel is stored with high intensity and the green and blue channels are stored with medium and low intensity values, respectively.

NumPy basics for working with Python

NumPy is a fundamental Python package for scientific computing. It works primarily as an array object, but its operation is not limited to the array. However, the library can handle various numeric and logical operations on numbers [1]. You will get NumPy official documentation here.

Let’s start our journey. First thing’s first.

Import of the NumPy library.

NumPy and OpenCV Tutorial for Machine Vision

OpenCV Basics

Play with NumPy

Conclusion

Technical Terrence Team

Justin Sun Traded Fear on USDC After SVB Collapse

Leave a Reply Cancel reply

Recommended.

Solv protocol attracts more than 12,000 BTC investments and integrates the Babylon association

PancakeSwap v3 is available on BNB Chain and Ethereum, could drive pre-sale tokens like digitads

Greenbrier Companies to Raise Dividends Despite Cash Flow Woes By Investing.com

Economists expect the Fed to reveal another 25bp rate hike before pausing for the rest of 2023 Cryptocurrencies and ICOs

Ethereum Could Reclaim $2,700 as Key Data Signals Reduced Selling Pressure

Categories

Important Links

NumPy and OpenCV Tutorial for Machine Vision

Start your machine vision coding with Python

Motivation

How the image is formed and displayed

How does the computer store the image in memory?

Color and grayscale image

NumPy basics for working with Python

OpenCV Basics

Play with NumPy

Conclusion

Related

Technical Terrence Team

Justin Sun Traded Fear on USDC After SVB Collapse

Leave a Reply Cancel reply

Recommended.

Solv protocol attracts more than 12,000 BTC investments and integrates the Babylon association

PancakeSwap v3 is available on BNB Chain and Ethereum, could drive pre-sale tokens like digitads

Greenbrier Companies to Raise Dividends Despite Cash Flow Woes By Investing.com

Economists expect the Fed to reveal another 25bp rate hike before pausing for the rest of 2023 Cryptocurrencies and ICOs

Ethereum Could Reclaim $2,700 as Key Data Signals Reduced Selling Pressure

Categories

Important Links

Get daily news updates to your inbox!