CSE3029 – Data Mining and Analytics
Name: J. Abhi ram
Reg No: 21BCB7107
1.Use of Weka tool to apply 15 Unsupervised filters
(Attribute) and record the output
Details of Dataset
For this experiment we have taken a preloaded dataset in WEKA
named IRIS dataset, in this dataset we have mainly 5 attributes named
sepal length, sepal width, petal length, petal width, class and 150
instances.
No. of attributes = 5
No. of instances = 150
We also taken the dataset named glass which is preloaded in WEKA
software, in this dataset we have mainly 10 attributes and 214
instances.
No. of attributes = 10
No. of instances = 214
Now we apply filters under unsupervised Attributes,
FILTER 1: - Add
Description: - An instance filter that adds a new attribute to
the dataset.
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 2: - Add clusters
Description: - A filter that adds a new nominal attribute
representing the cluster assigned to each instance by the specified
clustering algorithm.
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 3 : - AddExpression
Description: - An instance filter that creates a new attribute
by applying a mathematical expression to existing attributes.
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 4: - Add id
Description: - An instance filter that adds an ID attribute to
the dataset.
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 5: -Addnoice
Description: - An instance filter that changes a percentage of
a given attribute's values.
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 6: - Adduser fields
Description: - A filter that adds new attributes with user
specified type and constant value.
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 7: - Addvalues
Description: - Adds the labels from the given list to an
attribute if they are missing.
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 8: - CartesianProduct
Description: - A filter for performing the Cartesian product of
a set of nominal attributes.
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 9: -Center
Description: - Centers all numeric attributes in the given
dataset to have zero mean (apart from the class attribute, if
set).
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 10: - ClusterMembership
Description: - A filter that uses a density-based clusterer to
generate cluster membership values; filtered instances are
composed of these values plus the class attribute (if set in the
input data).
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 11: - Discretize
Description: - An instance filter that discretizes a range of
numeric attributes in the dataset into nominal attributes.
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 12 : - Interquartile Range
Description: - A filter for detecting outliers and extreme
values based on interquartile ranges.
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 13: -KernalFilter
Description: - Converts the given set of data into a kernel
matrix.
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 14: - NumericToNominal
Description: - A filter for turning numeric attributes into
nominal ones.
Dataset before applying the filter: -
Dataset After applying the filter: -
FILTER 15 : - RandomProjection
Description: - Reduces the dimensionality of the data by
projecting it onto a lower dimensional subspace using a
random matrix with columns of unit length.
Dataset before applying the filter: -
Dataset After applying the filter: -