Data Processing Cycle
Data Processing Cycle
• The data processing cycle consists of a series of steps where raw data
(input) is fed into a process (CPU) to produce actionable insights
(output). Each step is taken in a specific order, but the entire process
is repeated in a cyclic manner. The first data processing cycle's output
can be stored and fed as the input for the next cycle.
Step 1: Collection
• The collection of raw data is the first step of the data processing cycle.
The type of raw data collected has a huge impact on the output
produced. Hence, raw data should be gathered from defined and
accurate sources so that the subsequent findings are valid and usable.
Raw data can include monetary figures, website cookies, profit/loss
statements of a company, user behavior, etc.
Step 2: Preparation
• Data preparation or data cleaning is the process of sorting and
filtering the raw data to remove unnecessary and inaccurate data.
Raw data is checked for errors, duplication, miscalculations or missing
data, and transformed into a suitable form for further analysis and
processing. This is done to ensure that only the highest quality data is
fed into the processing unit.
Step 3: Input
• In this step, the raw data is converted into machine readable form
and fed into the processing unit. This can be in the form of data entry
through a keyboard, scanner or any other input source
Step 4: Data Processing
• In this step, the raw data is subjected to various data processing
methods using machine learning and artificial intelligence
algorithms to generate a desirable output. This step may vary slightly
from process to process depending on the source of data being
processed (data lakes, online databases, connected devices, etc.) and
the intended use of the output.
Step 5: Output
• The data is finally transmitted and displayed to the user in a readable
form like graphs, tables, vector files, audio, video, documents, etc.
This output can be stored and further processed in the next data
processing cycle.
Step 6: Storage
• The last step of the data processing cycle is storage, where data and
metadata are stored for further use. This allows for quick access and
retrieval of information whenever needed, and also allows it to be
used as input in the next data processing cycle directly