[go: up one dir, main page]

0% found this document useful (0 votes)
207 views22 pages

Set Theory: Prof. Asim Tewari IIT Bombay

The document discusses key concepts in set theory and topology in Euclidean space. It defines what a set is and provides examples. It describes common set operations like union, intersection, difference, symmetric difference, and complement. It also discusses the Cartesian product of sets. Additionally, it covers topological concepts such as open and closed sets, interior and closure of sets, convergence of sequences, and properties of compact sets in Euclidean space. Finally, it outlines common operations that can be performed on subsets of Euclidean space including addition, translation, scalar multiplication, reflection, and Minkowski operations.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
207 views22 pages

Set Theory: Prof. Asim Tewari IIT Bombay

The document discusses key concepts in set theory and topology in Euclidean space. It defines what a set is and provides examples. It describes common set operations like union, intersection, difference, symmetric difference, and complement. It also discusses the Cartesian product of sets. Additionally, it covers topological concepts such as open and closed sets, interior and closure of sets, convergence of sequences, and properties of compact sets in Euclidean space. Finally, it outlines common operations that can be performed on subsets of Euclidean space including addition, translation, scalar multiplication, reflection, and Minkowski operations.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 22

Set Theory

Prof. Asim Tewari


IIT Bombay

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Set Theory: Basics
Set is a correlation of mathematical objects taken from a suitable
domain of disclosure.

Examples:
▪ N = {0, 1, 2, 3, 4. . .}
(set of natural numbers)

▪ Z = {. . ., -3, -2, -1, 0, 1, 2, 3,. . .}


(set of integers)

▪ E = {0, 2, 4, 6. . .}
(set of even natural numbers)

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Set Operators
▪ Union of two sets A and B is the set of all elements in either set A or B.
▪ Written A  B.
▪ A  B = {x | x  A or x  B}

▪ Intersection of two sets A and B is the set of all elements in both sets
A or B.
▪ Written A  B.
▪ A  B = {x | x  A and x  B}

▪ Difference of two sets A and B is the set of all elements in set A which
are not in set B.
• Written A - B.
• A - B = {x | x  A and x  B}

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Set Operators
▪ Symmetric Difference of two sets A and B is the set of all elements
which are either in “set A and not in B” or in “set B and not in A”
▪ Written A Δ B.
▪ A Δ B = (A \ B)  (B \ A)

▪ Complement of a set is the set of all elements not in the


set.
• Written Ac
• Depends on the choice of superset/universal set S
• Ac = {x | x  S / A }

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Cartesain Product
▪ Cartesian Product: Given two sets A and B, the set of
– All ordered pairs of the form (a , b) where a is any
element of A and b any element of B, is called the
Cartesian product of A and B.
▪ Denoted as A x B
• A x B = {(a,b) | a A and b B}
• Example: Let A = {1,2,3}; B = {x,y}
– A x B = {(1,x),(1,y),(2,x),(2,y),(3,x),(3,y)}
– B x A = {(x,1),(y,1),(x,2),(y,2),(x,3),(y,3)}
– B x B = B2 = {(x,x),(x,y),(y,x),(y,y)}
▪ In general,
A1 x A2 x……x An = {(a1,a2………an) | a1 A1,……… an An}
Example:
𝑅 2 = 𝑅 𝑋 𝑅 2 − 𝐷 𝐸𝑢𝑐𝑙𝑖𝑑𝑒𝑎𝑛 𝑠𝑝𝑎𝑐𝑒
= { 𝒙𝟏 , 𝒙𝟐 : 𝒙𝟏 , 𝒙𝟐 ∈ 𝑹}
𝑅 𝑑 = 𝑅 𝑋 𝑅 … … … 𝑋 𝑅 𝑑 − 𝐷 𝐸𝑢𝑐𝑙𝑖𝑑𝑒𝑎𝑛 𝑠𝑝𝑎𝑐𝑒
= { 𝒙𝟏 , 𝒙𝟐 … … … 𝒙𝒅 : 𝒙𝟏 , 𝒙𝟐 … … … 𝒙𝒅 ∈ 𝑹}
Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Topology in Euclidean Space
▪ Euclidean Metric – A measure of distance between two points
𝑥−𝑦 = 𝑥1 − 𝑦1 2 + … … … 𝑥𝑑 − 𝑦𝑑 2

▪ Closed Ball
𝑏 𝑎, 𝑟 = {𝑥 ∈ 𝑅𝑑 : 𝑥 − 𝑎 ≤ 𝑟}

▪ Open Ball
𝑏 𝑖𝑛𝑡 𝑎, 𝑟 = {𝑥 ∈ 𝑅𝑑 : 𝑥 − 𝑎 < 𝑟}

▪ Bounded Set – Set A is bounded if there is a ball 𝑏(𝑎, 𝑟), such


that 𝐴 ∁ 𝑏(𝑎, 𝑟)

▪ A sequence 𝑥1 , 𝑥2 … … … is said to converge to x


if lim ||𝑥𝑛 − 𝑥 || = 0
𝑛→∞

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Topology in Euclidean Space
▪ Open Set - A set is said to be open if for each 𝑥 ∈ 𝐴, a
positive number 𝜖 can be found depending on x, such that
𝑏 𝑥, 𝜖 ∁ 𝐴
▪ Example: In case of d = 1, (u,v) is a open set
▪ System of open sets of 𝑅𝑑 is denoted by O.

▪ Closed Set – A set is said to be closed if its complement Ac is


open
▪ Important: For closed set we need a specific superset S to
define Ac
▪ Example: Hypercubes
𝒖𝟏 , 𝒗𝟏 𝑿 𝒖𝟐 , 𝒗𝟐 𝑿 … … … 𝒖𝒅 , 𝒗𝒅
Hyperplanes
𝒙 = { 𝒙𝟏 , … … … 𝒙𝒅 ∈ 𝑹𝒅 ∶ σ𝒅𝒊−𝟏 𝑿𝒊 𝒂𝒊 = 𝒃}
where 𝑏, 𝑎1 , … … … 𝑎𝑑 are constants with 𝑎𝑖 are not equal to 0

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Topology in Euclidean Space
▪ The Interior Aint of a general set A is the union of all the
open sets contained in A
▪ Aint is the largest open set contained in A.

▪ The Closure Acl of a general set A is the intersection of all


the closed sets containing A
▪ Acl is the smallest closed set containing A

▪ Properties of Acl and Aint


▪ Aint ⊂ A ⊂ Acl
▪ Also, Aint = ((Ac)cl)c
▪ A set A is open precisely when Aint = A
▪ A set A is closed precisely when Acl = A
▪ If A = (Aint)cl, then A is said to be regular closed
▪ The boundary of a set A, 𝜕A = Acl \ Aint
▪ A set K 𝜖 Rd is said to be Compact, if it is both closed as
well as bounded

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Operations on Subsets of Euclidean Space
▪ Addition: 𝑥 + 𝑦 = 𝑥1 + 𝑦1 , 𝑥2 + 𝑦2 , … … … 𝑥𝑑 + 𝑦𝑑

▪ Translation: 𝐴𝑥 = 𝐴 + 𝑥 = 𝑦 + 𝑥: 𝑦 ∈ 𝐴 , 𝑓𝑜𝑟 𝑥 𝑎𝑛𝑑 𝐴 ∈ 𝑅𝑑

▪ Scalar Multiplication by c ∈ 𝑅,
𝑐. 𝑥 = 𝑐𝑥 = (𝑐𝑥1 , 𝑐𝑥2 , … … … 𝑐𝑥𝑑 )

▪ Reflection: Scalar Multiplication by c = -1,


Ǎ = −𝐴 = −𝑥 ∶ 𝑥 ∈ 𝐴 𝑓𝑜𝑟 𝐴 ⊂ 𝑅𝑑

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Operations on Subsets of Euclidean Space

▪ Minkowski Addition: 𝐴 ⊕ 𝐵 = 𝑥 + 𝑦: 𝑥 ∈ 𝐴, 𝑦 ∈ 𝐵 𝑓𝑜𝑟 𝐴, 𝐵


▪ It is both Associative and Commutative
▪ 𝐴𝑥 = 𝐴 ⊕ 𝑥
▪ 𝐴 ⊕ 𝐵 = ‫𝑥𝐵 𝐴∈𝑥ڂ = 𝑦𝐴 𝐵∈𝑦ڂ‬
▪ 𝐵 ⊕ 𝐴 = 𝑥 ∶ 𝐵 ∩ Ǎ𝑥 𝑖𝑠 𝑛𝑜𝑡 𝑒𝑚𝑝𝑡𝑦
▪ 𝐴 ⊕ 𝐵1 ∪ 𝐵2 = 𝐴 ⊕ 𝐵1 ∪ 𝐴 ⊕ 𝐵2
▪ 𝐼𝑓 𝐴1 ⊂ 𝐴2 , 𝑡ℎ𝑒𝑛 𝐴1 ⊕ 𝐵 ⊂ 𝐴2 ⊕ 𝐵

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Operations on Subsets of Euclidean Space
▪ Minkowski Subtraction: 𝐴 ⊝ 𝐵 = ‫𝑦𝐴 𝐵∈𝑦ځ‬
▪ 𝑜𝑟 𝐴𝑐 ⊕ 𝐵 𝑐

▪ (A ⊝ Č) ⊕ 𝐶 ⊆ 𝐴 ⊆ (A ⊕ Č) ⊝ 𝐶

▪ Dilation: 𝑨 ⟼ 𝑨 ⊕ Č

▪ Erosion: 𝑨 ⟼ 𝑨 ⊝ Č
▪ Opening of A by C (Erosion followed by Dilation)
▪ Closing of A by C (Dilation followed by Erosion)

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Operations on Subsets of Euclidean Space
(a) The operations of erosion and opening
applied to a set. Components that overlap
are separated while small components and
roughnesses vanish or are reduced.

(b) The operations of dilation and closing


applied to a set. Gaps are closed up,
concavities vanish or are reduced, and
clusters of small particles are merged

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Monty Hall Problem Puzzle

• The host of a game show, offers the guest a choice of three doors. Behind one is a
expensive car, but behind the other two are goats.
After you have chosen one door, he reveals one of the other two doors behind which is a
goat (he wouldn't reveal a car).
Now he gives you the chance to switch to the other unrevealed door or stay at your
initial choice. You will then get what is behind that door.
You cannot hear the goats from behind the doors, or in any way know which door has
the prize.
Should you stay, or switch, or doesn't it matter?

Your first choice has a 1/3 chance of having the car, and that does not change.
The other two doors HAD a combined chance of 2/3, but now a Goat has been revealed
behind one, all the 2/3 chance is with the other door.

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Probability Of Second Girl Child
• A family has two kids, one of them is a girl. Assume safely that the probability of each gender is 1/2.
What is the probability that the other kid is also a girl?

• ANS:
1/3

This is a famous question in understanding conditional probability, which simply means that given some information you might be
able to get a better estimate.

The following are possible combinations of two children that form a sample space in any earthly family:
Girl – Girl
Girl – Boy
Boy – Girl
Boy - Boy

Since we know one of the children is a girl, we will drop the Boy-Boy possibility from the sample space.
This leaves only three possibilities, one of which is two girls. Hence the probability is 1/3

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Hard Logic Probabilty Puzzle
• Bruna was first to arrive at a 100 seat theater.
She forgot her seat number and picks a random seat for herself.
After this, every single person who get to the theater sits on his/her seat if its
available else chooses any available seat at random.
Neymar is last to enter the theater and 99 seats were occupied.
Whats the probability what Neymar gets to sit in his own seat ?

• ANS: 1/2
one of two is the possibility
1. If any of the first 99 people sit in neymar seat, neymar will not get to sit in his
own seat.
2. If any of the first 99 people sit in Bruna's seat, neymar will get to sit in his seat.

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Probability Space
▪ Probability space, which has three components (Ω; F; P),
respectively the sample space, event space, and probability
function

▪ Ω : sample space. Set of outcomes of an experiment

▪ In probability theory, the event space F is modelled as a σ-algebra


(or σ – field) of Ω, which is a collection of subsets of with the
following properties:
(1) Ω ∈ F
(2) If an event A ∈ F, then B ∈ F, then A – B ∈ F and
(3) If Ai ∈ F, then ‫( 𝐹 ∈ 𝑖𝐴 ڂ‬closed under countable union). A
countable sequence can be indexed using the natural integers.

▪ Probability function P assigns a number (“probability") to each


event in F. It is a function mapping F ⟶ [0,1] satisfying:
1. P(A) ≥ 0, for all A ∈ B.
2. P(Ω) = 1
3. Countable additivity: If Ai ∈ B are pairwise disjoint
(i.e., Ai ∩ Aj = 𝜙, for all i ≠ j), then P(‫∞ڂ‬ ∞
𝑖=1 𝐴𝑖 ) =σ𝑖=1 𝑃(𝐴𝑖 ).
Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Probability Space (Sample Space and Event)
Probability space, which has three components (Ω; F; P), respectively the sample
space, event space, and probability function

Ω : sample space. Set of all possible outcomes of an experiment in the sample


space. An element of Ω is denoted by ω.

Example: Driving to work, a commuter passes through a sequence of three


intersections with traffic lights. At each light, she either stops, s, or continues, c.
The sample space is the set of all possible outcomes:
Ω = {ccc, ccs, css, csc, sss, ssc, scc, scs}
where csc, for example, denotes the outcome that the commuter continues through
the first light, stops at the second light, and continues through the third light.

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Probability Space (Sample Space and Event)
Ω : sample space. Set of all possible outcomes of an experiment in the sample space. An
element of Ω is denoted by ω.

Example: Driving to work, a commuter passes through a sequence of three intersections


with traffic lights. At each light, she either stops, s, or continues, c. The sample space is
the set of all possible outcomes:
Ω = {ccc, ccs, css, csc, sss, ssc, scc, scs}

We are often interested in particular subsets of Ω, which in probability language are


called events. In the above example, the event that the commuter stops at the first light
is the subset of denoted by
A = {sss, ssc, scc, scs}

The algebra of set theory carries over directly into probability theory. The union of two
events, A and B, is the event C that either A occurs or B occurs or both occur:
C = A ∪ B. For example, if A is the event that the commuter stops at the first light (listed
before), and if B is the event that she stops at the third light,
B = {sss, scs, ccs, css}
then C is the event that she stops at the first light or stops at the third light and consists
of the outcomes that are in A or in B or in both:
C = {sss, ssc, scc, scs, ccs, css}

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Probability Space (Sample Space and Event)
Similar to the union of two events, intersection of two events and complement of an
event is also defined.
The laws of set theory work for the events:

Commutative Laws:
A∪B=B∪A
A∩B=B∩A

Associative Laws:
(A ∪ B) ∪ C = A ∪ (B ∪ C)
(A ∩ B) ∩ C = A ∩ (B ∩ C)

Distributive Laws:
(A ∪ B) ∩ C = (A ∩ C) ∪ (B ∩ C)
(A ∩ B) ∪ C = (A ∪ C) ∩ (B ∪ C)

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Probability Space (Event space F)
▪ In probability theory, the event space F is modelled as a σ-algebra (or σ – field) of
Ω, which is a collection of subsets of with the following properties:
(1) Ω ∈ F
(2) If an event A ∈ F, and B ∈ F, then A – B ∈ F and
(3) If Ai ∈ F, then ‫( 𝐹 ∈ 𝑖𝐴 ڂ‬closed under countable union). A countable
sequence can be indexed using the natural integers.

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Probability Measure
▪ Probability function P assigns a number (“probability") to each event in F. It is
a function mapping F ⟶ [0,1] satisfying:
1. P(A) ≥ 0, for all A ∈ F.
2. P(Ω) = 1
3. Countable additivity: If Ai ∈ F are pairwise disjoint (i.e., Ai ∩ Aj = 𝜙,
for all i ≠ j), then P(‫∞ڂ‬
𝑖=1 𝐴 𝑖 ) =σ∞
𝑖=1 𝑃(𝐴𝑖 ).

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining
Random Variable
▪ A random variable is a function from the sample space to the real
numbers, 𝑋 𝑟 , 𝑟 ∈ Ω, and measurable with respect to P, i.e for every
real number x, the set {𝑟: 𝑋 𝑟 < 𝑥} is an event in F

▪ CDF and PDF


➢ For a random variable X on (R, B(R), Px), we define its cumulative
distribution function (CDF) 𝐹𝑋 𝑥 ≡ 𝑃𝑥 (𝑋 ≤ 𝑥), for all x
(note that all the sets 𝑋 ≤ 𝑥 are in B(R)).

▪ Important: F(x) is a CDF if


1. lim 𝐹(𝑥) = 1 and lim 𝐹(𝑥) = 0
𝑥 →∞ 𝑥 →−∞
2. F(x) is non-decreasing
3. F(x) is right-continuous: for every x0, lim 𝐹(𝑥) = 𝐹(𝑥0 )
𝑥 →𝑥𝑜 +

Asim Tewari, IIT Bombay ME 781: Statistical Machine Learning and Data Mining

You might also like