INTRODUCTION TO DATA SCIENCE
Module 3 BY Safi Kasongo
ARTIFICIAL INTELLIGENCE AND
MACHINE LEARNING
• Artificial intelligence (AI), the ability of a digital computer or
computer-controlled robot to perform tasks commonly associated with
intelligent beings.
• Artificial intelligence is the science of making machines that can think
like humans. It can do things that are considered "smart." AI
technology can process large amounts of data in ways, unlike humans.
• The goal for AI is to be able to do things such as recognize patterns,
make decisions, and judge like humans.
Machine learning (ML) is a discipline of artificial intelligence
(AI) that provides machines with the ability to automatically
learn from data and past experiences while identifying patterns
to make predictions with minimal human intervention.
Machine Learning is the field of study that gives computers the
ability to learn without being explicitly programmed.
Just like human being, ML enables computers learn how to do a
given task without being programmed to do so.
APPLICATIONS OF AI
• Agriculture
• Some of the ways that AI is being used in agriculture include:
• Crop yield improvement: Analyzing data on soil conditions, weather
patterns, and crop growth with AI models and tools could help to
develop strategies that can improve crop yields
• Cost reduction: Automating tasks with AI, such as harvesting and
irrigation, which can reduce labor costs
• Environmental protection: Monitoring and managing natural
resources, such as water and soil
• Health and Medicine
the technology is helping medical practitioners in analyzing trends or
flagging events that may help in improved patient diagnoses and
treatment. ML algorithms even allow medical experts to predict the
lifespan of a patient suffering from a fatal disease with increasing
accuracy.
• Retail
Retail websites extensively use machine learning to recommend items
based on users’ purchase history. Retailers use ML techniques to capture
data, analyze it, and deliver personalized shopping experiences to their
customers. They also implement ML for marketing campaigns,
customer insights, customer merchandise planning, and price
optimization.
• Entertainment:AI is extensively used in entertainment
platforms such as YouTube, Netflix and Amazon Prime Video to
provide personalised recommendations to users. By analysing user
preferences, browsing history, and behaviour, AI algorithms can
suggest movies, TV shows, music, books, and other content that users
are likely to enjoy.
• Fitness: Smartwatches and fitness trackers equipped with AI
algorithms can provide users with valuable insights into their health
and fitness. They can monitor heart rate, sleep patterns, and activity
levels, allowing users to make informed decisions about their lifestyle.
• Gymbuddy AI is a mobile app that uses artificial intelligence to create
personalized workout programs for users. The app aims to offer
enjoyable and effective workouts tailored to each user's fitness level
and goals.
Types of Machine Learning
There are three types of machine learning which are
• Supervised machine Learning
• Unsupervised Machine Learning
• Reinforcement Machine Learning
Supervised Machine Learning
This type of ML involves supervision, where machines are trained on
labeled datasets and enabled to predict outputs based on the provided
training. The labeled dataset specifies that some input and output
parameters are already mapped. Hence, the machine is trained with the
input and corresponding output. A device is made to predict the outcome
using the test dataset in subsequent phases.
• Supervised machine learning is further classified into two broad
categories:
• Classification
• Regression
• Classification: These refer to • Regression: Regression
algorithms that address algorithms handle regression
classification problems where the problems where input and output
output variable is categorical; for variables have a linear
example, yes or no, true or false, relationship. These are known to
male or female, etc. Real-world predict continuous output
applications of this category are variables. Examples include
evident in spam detection and weather prediction, market trend
email filtering. analysis, etc.
Unsupervised Machine Learning
• Unsupervised learning refers to a learning technique that’s devoid of
supervision. Here, the machine is trained using an unlabeled dataset
and is enabled to predict the output without any supervision. An
unsupervised learning algorithm aims to group the unsorted dataset
based on the input’s similarities, differences, and patterns.
Unsupervised machine learning is further classified into two broad
categories:
• Clustering: The clustering • Association: Association
technique refers to grouping learning refers to identifying
objects into clusters based on typical relations between the
parameters such as similarities or variables of a large dataset. It
differences between objects. For determines the dependency of
example, grouping customers by various data items and maps
the products they purchase. associated variables. Typical
applications include web usage
mining and market data analysis.
Suppose the unsupervised learning algorithm is given an input
dataset containing images of different types of cats and dogs.
Unsupervised learning algorithm will perform this task by
clustering the image dataset into the groups according to
similarities between images.
Reinforcement Machine Learning
Reinforcement Learning (RL) is the type of ML method that
enables an agent to learn in an interactive environment by trial and
error using feedback from its own actions. OR
A Machine learning based on rewarding desired behaviors and/or
punishing undesired ones.
There is no labeled data, so the agent Learns from actions not from
data
• Reinforcement learning approaches can be used to train computers to
do many tasks such as Controlling robot limbs
Machine Learning process
1. Data Collection and Preparation:
• Data gathering: This involves collecting relevant data for the task at hand. The
data can come from various sources like databases, surveys, sensor readings, or web
scraping.
• Data cleaning and pre-processing: Raw data often contains errors, inconsistencies,
or missing values. This stage involves cleaning the data, handling missing entries,
and ensuring it's in a format suitable for the machine learning algorithm.
2. Model Selection and Training:
• Model selection: Choosing the right machine learning algorithm depends on
the type of problem you're trying to solve. Common algorithms include
decision trees, linear regression, support vector machines, and neural networks.
• Training the model: The prepared data is divided into two sets: training data
and testing data. The training data is used to "train" the machine learning
model. The algorithm learns by analyzing patterns and relationships within the
training data.
3. Model Evaluation and Tuning:
• Model testing: The trained model is evaluated using the testing
data. This helps assess how well the model performs on unseen
data and identifies potential areas for improvement.
• Model tuning: Based on the evaluation results, the model can be
further tuned by adjusting its parameters or even trying different
algorithms. The goal is to optimize the model's performance and
accuracy.
4. Deployment and Monitoring:
• Deployment: Once a satisfactory level of performance is achieved,
the model can be deployed into production. This means integrating
it into an application or system where it can be used to make
predictions or classifications on new, real-world data.
• Monitoring and maintenance: The deployed model's
performance should be monitored over time. As new data becomes
available, the model might need to be retrained or updated to
maintain its effectiveness.
Training machines to recognize pattern
Thank You