0% found this document useful (0 votes)

16 views6 pages

Exp1ml

Uploaded by

Jui Bhanushali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views6 pages

Exp1ml

Uploaded by

Jui Bhanushali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Experiment No 1

Aim: a) Case study on 15 Python libraries used for machine learning

b) Case study on 10 machine learning tools

a) Case study Python libraries used for machine learning:

Theory :

1. NumPy
Description: NumPy is the fundamental package for numerical computing in Python. It
provides support for arrays and matrices, along with a collection of mathematical functions to
operate on these data structures.
Features:
● Efficient multi-dimensional container of generic data.
● Mathematical functions for fast operations on arrays, including element-wise and
matrix operations.
● Support for large, multi-dimensional arrays and matrices.
● Broadcasting functions.

2. Pandas
Description: Pandas is a powerful, flexible, and easy-to-use data analysis and data
manipulation library built on top of NumPy.
Features:
● DataFrame: Two-dimensional size-mutable, potentially heterogeneous tabular data
structure.
● Series: One-dimensional array with axis labels.
● Data alignment and handling of missing data.
● Tools for reading and writing data between in-memory data structures and different
formats (CSV, text, Excel, SQL databases).

3. SciPy
Description: SciPy is built on NumPy and provides a large number of higher-level functions
that are useful for scientific and technical computing.
Features:
● Modules for optimization, integration, interpolation, eigenvalue problems, algebraic
equations, and other tasks.
● Special functions, statistical distributions, and more.
● Integration with NumPy arrays for linear algebra, Fourier transform, and signal
processing.

4. Scikit-Learn
Description: Scikit-Learn is a simple and efficient tool for data mining and data analysis,
built on NumPy, SciPy, and Matplotlib.
Features:
● Classification: SVM, nearest neighbors, random forest, logistic regression, etc.
● Regression: Lasso, ridge regression, SVR, etc.
● Clustering: K-means, spectral clustering, DBSCAN, etc.
● Dimensionality reduction: PCA, factor analysis, non-negative matrix factorization,
etc.
● Model selection: Grid search, cross-validation, and more.
● Preprocessing: Feature extraction and normalization.

5. TensorFlow
Description: TensorFlow is an open-source library developed by Google for deep learning
and machine learning tasks.
Features:
● Support for building and training deep learning models.
● Flexible architecture allows deployment across various platforms (CPUs, GPUs,
TPUs).
● TensorFlow Lite for mobile and embedded devices.
● TensorFlow.js for running models in the browser using JavaScript.
● TensorBoard for visualising the model training.

6. Keras
Description: Keras is a high-level neural networks API, written in Python and capable of
running on top of TensorFlow, CNTK, or Theano.
Features:
● User-friendly API that makes building deep learning models easy and fast.
● Supports both convolutional networks and recurrent networks.
● Runs seamlessly on CPUs and GPUs.
● Modular and extensible, with a simple interface for building complex models.

7. PyTorch
Description: PyTorch is an open-source deep learning platform that provides a seamless path
from research prototyping to production deployment.
Features:
● Dynamic computation graph (define-by-run), allowing for flexible model building.
● Strong GPU acceleration support.
● TorchScript for transitioning between eager and graph execution modes.
● Distributed training support.

8. Matplotlib
Description: Matplotlib is a plotting library for creating static, interactive, and animated
visualizations in Python.
Features:
● Comprehensive library for creating a wide variety of plots and charts.
● Integration with IPython/Jupyter notebooks for interactive plots.
● Extensive customization options for plot appearance.
● Support for embedding plots in applications using GUIs like Tkinter, wxPython, etc.

9. Seaborn
Description: Seaborn is a statistical data visualisation library based on Matplotlib, providing a
high-level interface for drawing attractive and informative statistical graphics.
Features:
● Built-in themes to improve the aesthetic appeal of plots.
● Tools for visualizing univariate and bivariate distributions.
● Functions to visualize linear regression models.
● Integration with Pandas data structures.

10. Statsmodels
Description: Statsmodels is a library for estimating and testing statistical models, including
linear regression, generalized linear models, and more.
Features:
● Comprehensive collection of tools for statistical data analysis.
● Models for linear and nonlinear regression, time-series analysis, and more.
● Functions for hypothesis testing and statistical inference.
● Integration with Pandas for handling data.

11. XGBoost
Description: XGBoost is an optimized gradient boosting library designed to be highly
efficient, flexible, and portable.
Features:
● Highly efficient and scalable implementation of gradient boosting.
● Support for various objective functions, including regression, classification, and
ranking.
● Built-in cross-validation and early stopping.
● Parallel processing and GPU support for faster training.

12. LightGBM
Description: LightGBM is a gradient boosting framework that uses tree-based learning
algorithms, designed for performance and efficiency.
Features:
● Faster training speed and higher efficiency.
● Lower memory usage compared to other gradient boosting libraries.
● Support for large-scale data and parallel learning.
● Accurate and scalable, suitable for many machine learning tasks.

13. CatBoost
Description: CatBoost is a gradient boosting library with categorical features support, which
provides fast and scalable models.
Features:
● Support for categorical features without the need for extensive preprocessing.
● High performance and fast training speed.
● Robust against overfitting with built-in regularization techniques.
● Easy-to-use API compatible with other popular machine learning libraries.

14. NLTK (Natural Language Toolkit)

Description: NLTK is a suite of libraries and programs for symbolic and statistical natural
language processing (NLP) for English.
Features:
● Tokenization, stemming, tagging, parsing, and other NLP tasks.
● Text classification, language modeling, and more.
● Large collection of text corpora and lexical resources.
● Easy-to-use interfaces and comprehensive documentation.

15. Gensim
Description: Gensim is a library for topic modeling and document similarity analysis, useful
in natural language processing and information retrieval tasks.
Features:
● Efficient implementations of popular topic modeling algorithms like LDA (Latent
Dirichlet Allocation).
● Tools for building document similarity models.
● Scalable and efficient, capable of handling large text corpora.
● Integration with other NLP libraries for preprocessing and analysis.
● These libraries provide a solid foundation for a wide range of machine learning tasks,
from data preprocessing and visualization to building and deploying complex models.

b) Case study on 10 machine learning tools:

Theory :

1. Jupyter Notebook
Jupyter Notebook is an open-source web application that allows you to create and share
documents that contain live code, equations, visualizations, and narrative text.
Features:
● Supports over 40 programming languages, including Python, R, and Julia.
● Interactive data visualization and easy sharing of results.
● Integration with big data tools like Apache Spark.

2. Google Colab
Google Colab is a free cloud service that supports Python coding and provides free access to
GPU and TPU, facilitating machine learning model training.
Features:
● No setup required; runs in the cloud.
● Integration with Google Drive for easy file storage and access.
● Collaboration with multiple users in real-time.

3. Anaconda
Anaconda is a distribution of Python and R for scientific computing and data science. It
simplifies package management and deployment.
Features:
● Includes Conda, a package and environment manager.
● Comes pre-installed with popular data science libraries like NumPy, Pandas, and
SciPy.
● Anaconda Navigator, a graphical interface to manage environments and packages.

4. MLflow
MLflow is an open-source platform for managing the end-to-end machine learning lifecycle.
It tackles four primary functions: tracking experiments, packaging code into reproducible
runs, managing and deploying models, and providing a central model registry.
Features:
● Supports any machine learning library and programming language.
● MLflow Projects to package data science code.
● MLflow Models to deploy models to various platforms.

5. Weka
Weka is a collection of machine learning algorithms for data mining tasks. It contains tools
for data preparation, classification, regression, clustering, association rules mining, and
visualization.
Features:
● GUI support for easy model building and data analysis.
● Extensive collection of pre-implemented algorithms.
● Scripting and command-line support.

6. KNIME
KNIME is an open-source software for creating data science applications and services. It
integrates various components for machine learning and data mining through its modular data
pipelining concept.
Features:
● Drag-and-drop interface for creating workflows.
● Supports integration with various data sources like databases and cloud services.
● Extensions for advanced analytics and big data processing.

7. RapidMiner
RapidMiner is a data science platform for teams that unites data prep, machine learning, and
model deployment. It features a drag-and-drop visual interface for building analytic
workflows.
Features:
● Automated machine learning for building and optimizing models.
● Real-time scoring and model deployment.
● Collaboration features for team-based data science projects.

8. H2O.ai
H2O.ai provides an open-source machine learning platform that makes it easy to build smart
applications.
Features:
● Supports distributed in-memory processing for speed and scale.
● Wide range of machine learning algorithms including deep learning.
● AutoML capabilities for automatic model selection and tuning.

9. Apache Spark
Apache Spark is an open-source unified analytics engine for large-scale data processing.
Features:
● In-memory computing for high-speed processing.
● Rich APIs in Java, Scala, Python, and R.
● Supports SQL, streaming data, machine learning, and graph processing.

10. Microsoft Azure ML Studio

Azure Machine Learning Studio is a collaborative, drag-and-drop tool you can use to build,
test, and deploy predictive analytics solutions on your data.
Features:
● Integration with Azure cloud services for scalable and secure deployments.
● Automated machine learning for building high-quality models quickly.
● Supports a wide range of data sources and integration with other Azure services like
Azure Databricks.

Conclusion: Thus, we conclude that Python libraries and machine learning tools are
essential for efficient, scalable, and collaborative model development and deployment,
driving innovation and effectiveness in various applications.

ACA100
0% (1)
ACA100
51 pages
Sap Hana SQL Script
No ratings yet
Sap Hana SQL Script
176 pages
Angular
No ratings yet
Angular
23 pages
Best Python Libraries For Machine Learning - GeeksforGeeks
No ratings yet
Best Python Libraries For Machine Learning - GeeksforGeeks
18 pages
Staple Python Libraries For Data Science
No ratings yet
Staple Python Libraries For Data Science
26 pages
ML Lab File
No ratings yet
ML Lab File
33 pages
100 Must-Know PythonMl Interview Questions and Answers 2024 - Devinterview - Io
No ratings yet
100 Must-Know PythonMl Interview Questions and Answers 2024 - Devinterview - Io
1 page
R4 AQUA-ASPICE-iDesigner EN v4
100% (1)
R4 AQUA-ASPICE-iDesigner EN v4
97 pages
Chapter 6 Python Libraries For Machine Learning
No ratings yet
Chapter 6 Python Libraries For Machine Learning
21 pages
Lab9 - Understanding Managed Disks - Azure
No ratings yet
Lab9 - Understanding Managed Disks - Azure
33 pages
Journal-2-14032022-032943pm-07102022-044959pm - 1 - 03032023-070607pm (AutoRecovered)
No ratings yet
Journal-2-14032022-032943pm-07102022-044959pm - 1 - 03032023-070607pm (AutoRecovered)
30 pages
DL Practical
No ratings yet
DL Practical
14 pages
Lecture 2.4
No ratings yet
Lecture 2.4
27 pages
PDS Labmanualword
No ratings yet
PDS Labmanualword
32 pages
Bernards
No ratings yet
Bernards
19 pages
Ai Assign2
No ratings yet
Ai Assign2
28 pages
No Code Data Science Tools
No ratings yet
No Code Data Science Tools
13 pages
Python
No ratings yet
Python
10 pages
Unit5 - AI - Top AIML Tools
No ratings yet
Unit5 - AI - Top AIML Tools
15 pages
Machine Learning Tools
No ratings yet
Machine Learning Tools
9 pages
Top 20 Python Libraries For Data Science
No ratings yet
Top 20 Python Libraries For Data Science
15 pages
Docer
No ratings yet
Docer
44 pages
Algorithms and Frameworks Used in The Development of Machine Learning Models
No ratings yet
Algorithms and Frameworks Used in The Development of Machine Learning Models
5 pages
Mekdela Amba University
No ratings yet
Mekdela Amba University
15 pages
Viva Question With Answer
No ratings yet
Viva Question With Answer
11 pages
Libraries For Data Science - CBS - PDS
No ratings yet
Libraries For Data Science - CBS - PDS
2 pages
Dsbda Unit4
No ratings yet
Dsbda Unit4
110 pages
01 Coding The God Bot (Dragged) 6
No ratings yet
01 Coding The God Bot (Dragged) 6
1 page
Mcmod 5
No ratings yet
Mcmod 5
36 pages
Practical 1
No ratings yet
Practical 1
8 pages
Python Data Analytics Libraries
No ratings yet
Python Data Analytics Libraries
8 pages
How To Install SQL Server On Linux (Ubuntu and CenOS - RHEL)
No ratings yet
How To Install SQL Server On Linux (Ubuntu and CenOS - RHEL)
6 pages
Performance Evaluation of Wireguard in Kubernetes Cluster.: Pavan Gunda Sri Datta Voleti
No ratings yet
Performance Evaluation of Wireguard in Kubernetes Cluster.: Pavan Gunda Sri Datta Voleti
58 pages
Python Libs For Ds
No ratings yet
Python Libs For Ds
5 pages
ML Libraries Frameworks Updated
No ratings yet
ML Libraries Frameworks Updated
13 pages
Smart Home Project
No ratings yet
Smart Home Project
16 pages
PDF 1675791423
No ratings yet
PDF 1675791423
11 pages
Research
No ratings yet
Research
8 pages
ENROLLMENT NO: 202203103510400: Utu/Cgpit/Ce/Sem-6/Machine Intelligence (Ce5008)
No ratings yet
ENROLLMENT NO: 202203103510400: Utu/Cgpit/Ce/Sem-6/Machine Intelligence (Ce5008)
6 pages
In Python, A Library Is A Collection of Pre-Writt...
No ratings yet
In Python, A Library Is A Collection of Pre-Writt...
3 pages
The Data Science Toolkit
No ratings yet
The Data Science Toolkit
5 pages
Essential Python Libraries For Data Science 1694045951
No ratings yet
Essential Python Libraries For Data Science 1694045951
7 pages
CCD Chapter 6 Notes
No ratings yet
CCD Chapter 6 Notes
18 pages
Expt-1 Dav
No ratings yet
Expt-1 Dav
5 pages
15 Python Libraries For Data Science
No ratings yet
15 Python Libraries For Data Science
17 pages
Group Assignment 05
No ratings yet
Group Assignment 05
3 pages
Exp 2
No ratings yet
Exp 2
6 pages
Library
No ratings yet
Library
23 pages
Deep Learning Tools
No ratings yet
Deep Learning Tools
23 pages
Sector Mapping Chart - Nov11
No ratings yet
Sector Mapping Chart - Nov11
177 pages
Top 20 Incredibly Impressive Trending Python Libraries To Work With
No ratings yet
Top 20 Incredibly Impressive Trending Python Libraries To Work With
15 pages
Assignment No.1: Theory
No ratings yet
Assignment No.1: Theory
4 pages
PROJECT REPORT (Singly Linked List)
100% (1)
PROJECT REPORT (Singly Linked List)
40 pages
Q1. What Are: Python Standard Library Ans
No ratings yet
Q1. What Are: Python Standard Library Ans
6 pages
Unit 2: Value Proposition of SAP BW/4HANA: Week 0: Introduction
No ratings yet
Unit 2: Value Proposition of SAP BW/4HANA: Week 0: Introduction
12 pages
Machine Learning Python Packages
No ratings yet
Machine Learning Python Packages
9 pages
FDS Lab
No ratings yet
FDS Lab
11 pages
Cisco UCS Director 5.2 Lab-V.1.5
100% (1)
Cisco UCS Director 5.2 Lab-V.1.5
180 pages
ML Tools
No ratings yet
ML Tools
2 pages
Deep Learning Blog
No ratings yet
Deep Learning Blog
6 pages
Batch File Virus Codes (Only For Education Purpose)
0% (1)
Batch File Virus Codes (Only For Education Purpose)
19 pages
AI Tools
0% (1)
AI Tools
2 pages
Sec-D ML Practical File PDF
No ratings yet
Sec-D ML Practical File PDF
19 pages
Practice Exercises 15-23
No ratings yet
Practice Exercises 15-23
13 pages
14 Python Automl Frameworks Data Scientists Can Use
No ratings yet
14 Python Automl Frameworks Data Scientists Can Use
3 pages
Python Subjects For AI
No ratings yet
Python Subjects For AI
2 pages
Best Practices For Protecting Your Epic EHR
No ratings yet
Best Practices For Protecting Your Epic EHR
4 pages
Python Machine Learning - Session 2
No ratings yet
Python Machine Learning - Session 2
6 pages
Pranshi Singla IX C AI Activity 1
No ratings yet
Pranshi Singla IX C AI Activity 1
24 pages
Top 15 AI Tools
No ratings yet
Top 15 AI Tools
4 pages
Machine Learning Document
No ratings yet
Machine Learning Document
7 pages
TechnoGrity Leveraging SOA
No ratings yet
TechnoGrity Leveraging SOA
28 pages
10 Essential Python Libraries For Data Professionals - by Sigli Mumuni - Medium
No ratings yet
10 Essential Python Libraries For Data Professionals - by Sigli Mumuni - Medium
6 pages
ML Exp
No ratings yet
ML Exp
9 pages
ML Week1 Tools Services PDF
No ratings yet
ML Week1 Tools Services PDF
1 page
Machine Learning Tools
No ratings yet
Machine Learning Tools
14 pages
Risk Chapter One
No ratings yet
Risk Chapter One
16 pages
Aberdeen Customer Analytics How To Make Best Use of Customer Data
No ratings yet
Aberdeen Customer Analytics How To Make Best Use of Customer Data
12 pages
Open-Source Frameworks For AI
100% (1)
Open-Source Frameworks For AI
3 pages
PACOM GMS System Overview Datasheet
No ratings yet
PACOM GMS System Overview Datasheet
2 pages
Latest Chapter Wise PLSQL Inteerview Questions and Answers PDF
100% (6)
Latest Chapter Wise PLSQL Inteerview Questions and Answers PDF
9 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Risk Chapter
No ratings yet
Risk Chapter
6 pages
Done Assignment
No ratings yet
Done Assignment
9 pages
Data Analysis Library: by Muthu Priya J 19MZ06
No ratings yet
Data Analysis Library: by Muthu Priya J 19MZ06
3 pages
Online Mobile Phone Shop A ASP - Net Project
50% (2)
Online Mobile Phone Shop A ASP - Net Project
40 pages
Essential Python Libraries and Functions For Data Science 1706295212
No ratings yet
Essential Python Libraries and Functions For Data Science 1706295212
12 pages
Cyberark Defender: Cyberark Cau201 Version Demo
No ratings yet
Cyberark Defender: Cyberark Cau201 Version Demo
7 pages
JR - Software Developer Resume
No ratings yet
JR - Software Developer Resume
2 pages
Ayesha React Developer
No ratings yet
Ayesha React Developer
2 pages
Core Libraries For Machine Learning
No ratings yet
Core Libraries For Machine Learning
5 pages
Entries in Transaction SM58 Are in Status - Transaction Recorded - ABAP Connectivity - SCN Wiki
No ratings yet
Entries in Transaction SM58 Are in Status - Transaction Recorded - ABAP Connectivity - SCN Wiki
4 pages
Basic Libraries For Data Science
No ratings yet
Basic Libraries For Data Science
4 pages
SAP EWM - S Joshi & A Waghole
100% (2)
SAP EWM - S Joshi & A Waghole
13 pages
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
From Everand
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
e3
No ratings yet
Learn C++
From Everand
Learn C++
Aishik Dutta
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet

Exp1ml

Uploaded by

Exp1ml

Uploaded by

Experiment No 1

Aim: a) Case study on 15 Python libraries used for machine learning

a) Case study Python libraries used for machine learning:

14. NLTK (Natural Language Toolkit)

b) Case study on 10 machine learning tools:

10. Microsoft Azure ML Studio

You might also like