Click Here To Download Ipython Notes For This Chapter Eda: The Output of This Above Program

This Python code demonstrates how to calculate a histogram with 5 bins from an input array of numbers ranging from 1 to 51. It shows how to determine the bin edges and counts for each bin. It also explains the difference between using the 'density=True' parameter, which normalizes the counts to represent a probability density function, versus just reporting the raw counts.

Uploaded by

03sri03

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views1 page

Click Here To Download Ipython Notes For This Chapter Eda: The Output of This Above Program

Uploaded by

03sri03

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

CLick here to download IPYTHON notes for this chapter EDA

import numpy as np
# consider you have an array like this
a= [1, 2, 3, 5, 10,11, 12, 13, 14, 15, 21,23, 25, 26, 27, 29, 30,31, 35, 51]
# here we have decided to group all these numbers into 5 bins
# i.e bins = 5
# the minimum number in the array is 1
# the maximum number in the array is 51
# the width of each bin is calculated as = ((max - min) / bins)
# width of each bin = (51-1)/5 = 10
# Since we got each bin with as 10, we can choose the bin edges like this
# 1 ...... 11 ....... 21 ........ 31 ....... 41 ....... 51
# |---10---|----10----|----10-----|----10----|----10----|
# so we have found out the bin edges now
# to find thte counts we calcuate how many number of points fall into each bin
# therefore the count of a bin = number of elements of a such that left_bin_egde<=ai 5 [1,2,3,5,10]
# ii. number of elements belongs to the 2nd bin 11<=x<21 => 5 [11,12,13,14,15]
# iii. number of elements belongs to the 3rd bin 21<=x<31 => 7 [21,23,25,26,27,29,30]
# iii. number of elements belongs to the 3rd bin 21<=x<31 => 7 [21,23,25,26,27,29,30]
# iv. number of elements belongs to the 4th bin 31<=x<41 => 2 [31,35]
# v. number of elements belongs to the 5th bin 41<=x<=51 => 1 [51]

# note: from the documentation: https://docs.scipy.org/doc/numpy/reference/generated/numpy.histogram.html

# All but the last (righthand-most) bin is half-open i.e [1,2,3,4], the bins are [1,2), [2,3), [3,4]
# [1,10) = 1,2,3,4,5,6,7,8,9 means includig 1 and but not 10. its half open bracket

print('='30, "explaining 'bin edges and counts",'='30)

counts,bins = np.histogram(a, bins=5)

print("bin edges :",bins)

print("counts per each bin :",counts)
# density: bool, optional
# If False, the result will contain the number of samples in each bin.
# If True, the result is the value of the probability density function at the bin, normalized such that the integral over the range is 1.
# Note that the sum of the histogram values will not be equal to 1 unless bins of unity width are chosen;
# it is not a probability mass function.
# and from the source code
#if density:
# db = np.array(np.diff(bin_edges), float)
# return n/db/n.sum(), bin_edges
# here the n => number of elements for each bin
n = counts
# and db = difference between bin edges
db = np.array(np.diff(bins))
# n.sum() number of all the elemnts

print('='30, "explaining 'density=True' parameter",'='30)

print("manual calculated densities for each bin",counts/db/counts.sum())
counts, bins = np.histogram(a, bins=5, density=True)

print("bin edges :",bins)

print("counts per each bin using density=True:",counts)
print('='*30, "explaining counts/sum(counts)",'='*30)
# pleasen note that the documentation says when you have density=True,
# "that the sum of the histogram values will not be equal to 1"
# this is simple logic we used, to make the whole sum=1, we have divided each element by the number of whole elements
counts, bins = np.histogram(a, bins=5, density=True)
print("bin edges :",bins)
# sum(counts) = summ of all the elements in the counts array = [0.025 + 0.025 + 0.035 + 0.01 + 0.005] = 0.1
# counts/sum(counts) = devide every element of counts=[0.025/0.1, 0.025/0.1, 0.035/0.1, 0.01/0.1, 0.005/0.1] = [0.25 0.25 0.35 0.1 0.05]
print("counts per each bin using density=True:",counts/sum(counts))

The output of this above program

============================== explaining 'bin edges and counts ==============================
bin edges : [ 1. 11. 21. 31. 41. 51.]
counts per each bin : [5 5 7 2 1]
============================== explaining 'density=True' parameter ==============================
manual calculated densities for each bin [0.025 0.025 0.035 0.01 0.005]
bin edges : [ 1. 11. 21. 31. 41. 51.]
counts per each bin using density=True: [0.025 0.025 0.035 0.01 0.005]
============================== explaining counts/sum(counts) ==============================
bin edges : [ 1. 11. 21. 31. 41. 51.]
counts per each bin using density=True: [0.25 0.25 0.35 0.1 0.05]

you can find the link for this program here: https://ideone.com/IqCwsI

International Financial Management 8th Edition Cheol Eun Bruce Resnick PDF Download
100% (1)
International Financial Management 8th Edition Cheol Eun Bruce Resnick PDF Download
323 pages
12 IP-Data Visualization (Part-2) - Note
No ratings yet
12 IP-Data Visualization (Part-2) - Note
20 pages
Distributions Demo
No ratings yet
Distributions Demo
28 pages
Lecture 5: Let's Look at Some Data: Exploratory Data Analysis
No ratings yet
Lecture 5: Let's Look at Some Data: Exploratory Data Analysis
29 pages
13 Density Estimation Note
No ratings yet
13 Density Estimation Note
48 pages
Creating and Customizing Advanvced Plots
No ratings yet
Creating and Customizing Advanvced Plots
10 pages
Statistics and Risk Modelling Using Python
No ratings yet
Statistics and Risk Modelling Using Python
99 pages
Chapter 2 - Part 2 - (Histogram)
No ratings yet
Chapter 2 - Part 2 - (Histogram)
18 pages
Statistical Analysis in Physics Practical File
No ratings yet
Statistical Analysis in Physics Practical File
28 pages
ML3 Data Analysis
No ratings yet
ML3 Data Analysis
80 pages
Histogram With Plotnine
No ratings yet
Histogram With Plotnine
21 pages
Call For Papers-SPG 2025
No ratings yet
Call For Papers-SPG 2025
16 pages
05 Density Estimation
No ratings yet
05 Density Estimation
29 pages
Week07b FitProbDist
No ratings yet
Week07b FitProbDist
19 pages
HK6 2.1 Engl
No ratings yet
HK6 2.1 Engl
31 pages
Numpy and Matplotlib Practical
No ratings yet
Numpy and Matplotlib Practical
8 pages
Histogram
No ratings yet
Histogram
16 pages
His To Graph
No ratings yet
His To Graph
2 pages
Explanationschatgtp
No ratings yet
Explanationschatgtp
8 pages
Histograms Cheatsheet
No ratings yet
Histograms Cheatsheet
2 pages
Practical 3: Aim: 1. Discrete Frequency Distribution
No ratings yet
Practical 3: Aim: 1. Discrete Frequency Distribution
4 pages
Ch11a Numpy
No ratings yet
Ch11a Numpy
8 pages
DVP 1
No ratings yet
DVP 1
24 pages
Giuaki
No ratings yet
Giuaki
7 pages
What Is A Histogram in Matplotlib
No ratings yet
What Is A Histogram in Matplotlib
5 pages
Density - Contour Plot
No ratings yet
Density - Contour Plot
18 pages
CH 4 Plotting With Pyplot II - Histograms, Frequency Distribution, Boxplots CPA
No ratings yet
CH 4 Plotting With Pyplot II - Histograms, Frequency Distribution, Boxplots CPA
1 page
ML Lab
No ratings yet
ML Lab
12 pages
Document 15
No ratings yet
Document 15
3 pages
Fds Assigns
No ratings yet
Fds Assigns
5 pages
CH 4 Plotting With Pyplot II - Histograms, Frequency Distribution, Boxplots
No ratings yet
CH 4 Plotting With Pyplot II - Histograms, Frequency Distribution, Boxplots
1 page
Root and Pyroot
No ratings yet
Root and Pyroot
48 pages
Notes Data Science 1
No ratings yet
Notes Data Science 1
6 pages
HW 1
No ratings yet
HW 1
11 pages
U4 ProbabilityDensityEstimation
No ratings yet
U4 ProbabilityDensityEstimation
6 pages
DATA VISUALIZATION - Part 4
No ratings yet
DATA VISUALIZATION - Part 4
12 pages
Precision and Recall
No ratings yet
Precision and Recall
13 pages
FDS Lab 1 Manuel .1..1new
No ratings yet
FDS Lab 1 Manuel .1..1new
38 pages
HISTOGRAM
No ratings yet
HISTOGRAM
28 pages
SESION 12 (Pandas)
No ratings yet
SESION 12 (Pandas)
41 pages
MT2023 Sol
No ratings yet
MT2023 Sol
8 pages
Computer Vision Lab Exp 1: Group Members
No ratings yet
Computer Vision Lab Exp 1: Group Members
8 pages
FDS Lab 1 Manuel .1..1new
No ratings yet
FDS Lab 1 Manuel .1..1new
34 pages
Histogram Tools
No ratings yet
Histogram Tools
18 pages
Exp 2 SDK Ok
No ratings yet
Exp 2 SDK Ok
18 pages
Scipy - Stats.norm - SciPy v1.11.2 Manual
No ratings yet
Scipy - Stats.norm - SciPy v1.11.2 Manual
3 pages
Neoteric Study Europe 2025 Aug - Brochure
No ratings yet
Neoteric Study Europe 2025 Aug - Brochure
10 pages
Graphs Using Matplotlib
No ratings yet
Graphs Using Matplotlib
23 pages
42 Histograms2
No ratings yet
42 Histograms2
6 pages
Advanced Plot Types With Matplotlib
No ratings yet
Advanced Plot Types With Matplotlib
8 pages
Notes 58 Creating Histogram
No ratings yet
Notes 58 Creating Histogram
2 pages
Deleuze and Space 1st Edition Ian Buchanan - The Ebook in PDF Format Is Ready For Immediate Access
No ratings yet
Deleuze and Space 1st Edition Ian Buchanan - The Ebook in PDF Format Is Ready For Immediate Access
41 pages
Graphing: Numpy NP Matplotlib - Pyplot PLT Scipy - Optimize
No ratings yet
Graphing: Numpy NP Matplotlib - Pyplot PLT Scipy - Optimize
11 pages
Experiment No.3 (DV)
No ratings yet
Experiment No.3 (DV)
3 pages
04.05-Histograms-and-Binnings - Ipynb - Colaboratory
No ratings yet
04.05-Histograms-and-Binnings - Ipynb - Colaboratory
7 pages
Data Visualization Exp. 3
No ratings yet
Data Visualization Exp. 3
3 pages
Sheet 3 Numpy
No ratings yet
Sheet 3 Numpy
10 pages
Boxplot, Histogram Codes With Explanations
No ratings yet
Boxplot, Histogram Codes With Explanations
2 pages
Exercises 02
No ratings yet
Exercises 02
3 pages
Histrogram: A Histogram Is A Graph Showing Frequency Distributions
No ratings yet
Histrogram: A Histogram Is A Graph Showing Frequency Distributions
10 pages
42 Histograms
No ratings yet
42 Histograms
5 pages
Matplotlib Starter: Import As Import As Import As
No ratings yet
Matplotlib Starter: Import As Import As Import As
24 pages
PDF P Classtruncatedtext Module Lineclamped 85ulhh Style Max Lines5analytical Groundwater Modeling Theory and Applications Using Python P - Compress
No ratings yet
PDF P Classtruncatedtext Module Lineclamped 85ulhh Style Max Lines5analytical Groundwater Modeling Theory and Applications Using Python P - Compress
20 pages
Urban Housing 1507234
No ratings yet
Urban Housing 1507234
14 pages
Ken Keyes-Handbook To Higher Consciousness (PDFDrive)
No ratings yet
Ken Keyes-Handbook To Higher Consciousness (PDFDrive)
70 pages
Unit 5
No ratings yet
Unit 5
10 pages
MPC 001 Cognitive Psychology Syllabus
100% (1)
MPC 001 Cognitive Psychology Syllabus
10 pages
Python Interview Questions 1
100% (1)
Python Interview Questions 1
32 pages
MPC 002 Life Span Psychology Syllabus
No ratings yet
MPC 002 Life Span Psychology Syllabus
10 pages
SQ L Practice Problems
100% (7)
SQ L Practice Problems
118 pages
U Value Calculator
No ratings yet
U Value Calculator
19 pages
Finite Element Analysis by G PDF
No ratings yet
Finite Element Analysis by G PDF
349 pages
San Gabriel Senior High School
100% (1)
San Gabriel Senior High School
8 pages
Ger 2461d Speedtronic Mark I Mark II Controls
100% (1)
Ger 2461d Speedtronic Mark I Mark II Controls
23 pages
PythonHist Legend
No ratings yet
PythonHist Legend
1 page
HackerRank Python Practice Topics
0% (1)
HackerRank Python Practice Topics
14 pages
Datasheet S7-300
No ratings yet
Datasheet S7-300
124 pages
Radio Nav Exam 5
No ratings yet
Radio Nav Exam 5
16 pages
Business As A Career Option
No ratings yet
Business As A Career Option
7 pages
5A
No ratings yet
5A
15 pages
Seniorprojectfinaldraft
No ratings yet
Seniorprojectfinaldraft
15 pages
Transcript (The Final Chapter)
No ratings yet
Transcript (The Final Chapter)
7 pages
ARTIFICIAL Intelligence in Support of Defence
No ratings yet
ARTIFICIAL Intelligence in Support of Defence
32 pages
66102E
No ratings yet
66102E
197 pages
GRD 10 Math Test, Mar 2021
No ratings yet
GRD 10 Math Test, Mar 2021
2 pages
IMT Asia Pasifik
No ratings yet
IMT Asia Pasifik
7 pages
Top 30 NLP Interview Questions and Answers: 1. What Do You Understand by Natural Language Processing?
No ratings yet
Top 30 NLP Interview Questions and Answers: 1. What Do You Understand by Natural Language Processing?
18 pages
JUDUL (Arial Narrow, Caps Lock/UPPERCASE, 13 PT, Bold, Centered)
No ratings yet
JUDUL (Arial Narrow, Caps Lock/UPPERCASE, 13 PT, Bold, Centered)
8 pages
TGN 1 6 Notional Loading Edited3
100% (1)
TGN 1 6 Notional Loading Edited3
3 pages
D11 - D12 - D13 - 0412 - CSE2004 - TOC and Compiler Design - 100382
No ratings yet
D11 - D12 - D13 - 0412 - CSE2004 - TOC and Compiler Design - 100382
2 pages
Credible Resources For Research PowerPoint Presentation
No ratings yet
Credible Resources For Research PowerPoint Presentation
25 pages
Welding Formula
No ratings yet
Welding Formula
13 pages
IMOR Formulae
No ratings yet
IMOR Formulae
12 pages
Resistance Temperature Detector
No ratings yet
Resistance Temperature Detector
2 pages
Natural Language Processing (NLP) Introduction:: Top 10 NLP Interview Questions For Beginners
No ratings yet
Natural Language Processing (NLP) Introduction:: Top 10 NLP Interview Questions For Beginners
24 pages
Natural Language Processing (NLP) Introduction:: Top 10 NLP Interview Questions For Beginners
No ratings yet
Natural Language Processing (NLP) Introduction:: Top 10 NLP Interview Questions For Beginners
24 pages
14-02-24 ATZ Lightweight Design Potential With Forging
No ratings yet
14-02-24 ATZ Lightweight Design Potential With Forging
6 pages
6.1 Transition Element
No ratings yet
6.1 Transition Element
1 page
Asto PDF
No ratings yet
Asto PDF
2 pages
RFQ Crane (MHE - DEMAG)
No ratings yet
RFQ Crane (MHE - DEMAG)
6 pages
R13 SWM Apr 2018
No ratings yet
R13 SWM Apr 2018
4 pages
Ms Dhoni: Yearly Dhoni Rank Yearly Runs
No ratings yet
Ms Dhoni: Yearly Dhoni Rank Yearly Runs
1 page
Mechanical Seal
No ratings yet
Mechanical Seal
8 pages
Board Exam Module
No ratings yet
Board Exam Module
1 page
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet