0% found this document useful (0 votes)

24 views35 pages

Last Day

Uploaded by

ASR PANDEY

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views35 pages

Last Day

Uploaded by

ASR PANDEY

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

1.

Identifying & Choosing the Right LLM Models (1 Hour)

Explanation:
When selecting a Large Language Model (LLM), it’s essential to consider various factors, including
architecture, size, performance metrics, and use cases.
• Use Case Analysis:
• Definition: Understanding the specific problem you aim to solve is crucial for
selecting the right model.
• Example: For text generation tasks, GPT-3 is preferred, while BERT is typically
used for tasks requiring text understanding.
• Performance Metrics:
• Common Metrics: GLUE, SuperGLUE, and F1 scores are essential for evaluation.
• Calculation Example: To compute the F1 score, use the formula:
F1=2×Precision+RecallPrecision×Recall
• Model Size:
• Definition: Refers to the number of parameters in the model. Larger models often
capture language nuances better.
• Calculation Example: If comparing two models with parameter counts of 175
million (GPT-3) and 110 million (BERT), the relative size difference can be
calculated as: Size Ratio=110 million175 million≈1.59
• Community Support:
• Definition: Strong community backing indicates better resources for troubleshooting
and examples.

Internal Working:
• Architecture of LLMs: Most LLMs, such as Transformers, utilize attention mechanisms.
The attention mechanism calculates weights based on input representations, allowing models
to focus on relevant parts of the text.

Practical Examples:
1. Listing Available Models:
2. from huggingface_hub import HfApi

# Initialize Hugging Face API

api = HfApi()
# List models available for text classification
models = api.list_models(filter="text-classification")
for model in models:
print(f"Model Name: {model.modelId}, Downloads: {model.downloads}")

•
• Comparing Models:
python

• from transformers import AutoModel, AutoTokenizer

model_names = ['gpt2', 'bert-base-uncased', 'distilbert-base-uncased'] # Note: 'gpt-2' is typically

'gpt2'

for model_name in model_names:

try:
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModel.from_pretrained(model_name)
print(f"Model: {model_name}, Parameters: {model.num_parameters()}")
except Exception as e:
print(f"Error loading model {model_name}: {e}")

•
• Use Case Implementation:
python

• from transformers import pipeline

# Use a sentiment analysis model that is fine-tuned for this task

sentiment_model = pipeline("sentiment-analysis", model="distilbert-base-uncased-finetuned-sst-
2-english")

# Example input
result = sentiment_model("I love machine learning!")
print(result) # Displays the sentiment analysis result

•
• Performance Comparison Program:
python

4. import numpy as np
model_accuracies = {
'gpt-2': 0.87,
'bert-base-uncased': 0.91,
'distilbert-base-uncased': 0.89,
}
# Calculate the average accuracy
average_accuracy = np.mean(list(model_accuracies.values()))
print(f"Average Model Accuracy: {average_accuracy:.2f}")

2. Understanding Model Metrics, Accuracy, and Other Benchmarks (1 Hour)

Explanation:
Understanding model performance is crucial for evaluating its effectiveness. Here are key metrics
used for this purpose:
• Accuracy:
• Calculation: Accuracy=Total SamplesTrue Positives+True Negatives
• Precision:
• Calculation: Precision=True Positives+False PositivesTrue Positives
• Recall:
• Calculation: Recall=True Positives+False NegativesTrue Positives
• F1 Score:
• Calculation: F1=2×Precision+RecallPrecision×Recall
• ROC-AUC:
• Calculation: AUC represents the area under the ROC curve, which is plotted with
the true positive rate against the false positive rate.

Internal Working:
• Statistical Foundations: Metrics like precision and recall are vital in applications where
class imbalance exists. Understanding how to calculate these metrics accurately is essential.

Practical Examples:
1. Calculating Metrics:
python

• from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score

y_true = [1, 0, 1, 1, 0] # True labels

y_pred = [1, 0, 1, 0, 0] # Predicted labels

accuracy = accuracy_score(y_true, y_pred)

precision = precision_score(y_true, y_pred)
recall = recall_score(y_true, y_pred)
f1 = f1_score(y_true, y_pred)

print(f"Accuracy: {accuracy:.2f}, Precision: {precision:.2f}, Recall: {recall:.2f}, F1 Score: {f1:.2f}")

•
• ROC-AUC Visualization:
python

• from sklearn.metrics import roc_auc_score, roc_curve

import matplotlib.pyplot as plt

# True labels
y_true = [1, 0, 1, 1, 0] # Example true labels
# Predicted probabilities
y_prob = [0.9, 0.1, 0.8, 0.7, 0.4] # Example predicted probabilities

# Calculate ROC AUC

roc_auc = roc_auc_score(y_true, y_prob)

# Compute ROC curve

fpr, tpr, thresholds = roc_curve(y_true, y_prob)

# Plotting
plt.figure() # Create a new figure
plt.plot(fpr, tpr, label=f'ROC curve (area = {roc_auc:.2f})')
plt.plot([0, 1], [0, 1], 'k--') # Diagonal line
plt.xlim([0.0, 1.0])
plt.ylim([0.0, 1.0])
plt.xlabel('False Positive Rate')
plt.ylabel('True Positive Rate')
plt.title('Receiver Operating Characteristic')
plt.legend(loc="lower right")

# Save the plot as an image file

plt.savefig('roc_curve.png') # Save as PNG file
plt.close() # Close the plot

print("ROC curve saved as 'roc_curve.png'")

Understanding the ROC Curve

ROC Curve:
Axes:
The x-axis represents the False Positive Rate (FPR), which is the proportion of negative cases that
are incorrectly classified as positive. It ranges from 0 to 1.
The y-axis represents the True Positive Rate (TPR), also known as sensitivity or recall, which is the
proportion of actual positive cases that are correctly identified. It also ranges from 0 to 1.

Curve Shape:
The curve plots TPR against FPR at various threshold settings.
A model that perfectly classifies all positives and negatives would have a point at (0,1), meaning 0
false positives and 100% true positives. This would look like a curve that rises sharply to the top
left corner.

Diagonal Line:
The dashed line (y = x) represents a random classifier. If your model performs no better than
random guessing, it will fall along this line.
The goal is for your model's ROC curve to be above this line.

Area Under the Curve (AUC):

The area under the ROC curve (AUC) is a single scalar value that summarizes the performance of
the model across all thresholds.
An AUC of 0.5 suggests no discrimination (random guessing), while an AUC of 1.0 indicates
perfect discrimination.
A higher AUC value (closer to 1) means better model performance.

Interpretation of Your Graph

Look at the Shape: If your ROC curve is significantly above the diagonal line, it indicates that your
model is doing a good job distinguishing between the positive and negative classes.

AUC Value: The value you calculated (shown in the legend) will help you understand overall model
performance. For instance:
AUC ≈ 0.9: Very good model.
AUC ≈ 0.7 - 0.8: Acceptable model.
AUC < 0.5: The model is performing worse than random guessing.

Summary

The ROC curve and AUC are powerful tools for evaluating classification models, particularly when
dealing with imbalanced datasets. By visualizing and quantifying the performance, you can better
understand how well your model will perform in practice.

If you have any specific questions about certain parts of the graph or would like to know more
about interpreting the results, feel free to ask!

•
• Cross-Validation:
python

3. from sklearn.model_selection import cross_val_score

from sklearn.ensemble import RandomForestClassifier
from sklearn.datasets import load_iris
# Load dataset
X, y = load_iris(return_X_y=True)
model = RandomForestClassifier()
scores = cross_val_score(model, X, y, cv=5) # 5-fold cross-validation
print(f"Cross-Validation Scores: {scores}")
print(f"Mean Score: {scores.mean():.2f}")

Cross-Validation Scores:
The output shows an array of scores: [0.96666667, 0.96666667, 0.93333333, 0.96666667, 1.].
These scores represent the accuracy of the model on each of the 5 folds during cross-validation.
Specifically:
Fold 1: 96.67% accuracy
Fold 2: 96.67% accuracy
Fold 3: 93.33% accuracy
Fold 4: 96.67% accuracy
Fold 5: 100% accuracy
Each score indicates how well the model performed on a subset of the data that it hasn't seen
during training.

Mean Score:
The mean score is calculated as the average of the individual fold scores: scores.mean().
In this case, the mean score is approximately 0.97, which translates to a 97% accuracy across all
folds.
This metric gives you a single value that summarizes the model's performance across different
splits of the data.
Interpretation

High Accuracy: The scores indicate that the Random Forest classifier is performing very well on
the Iris dataset, achieving accuracies close to 97% in most folds and even 100% in one fold.

Stability of Performance: The scores are relatively consistent (most scores are around 96-97%),
suggesting that the model's performance is stable across different subsets of the data. The drop
in the third fold (93.33%) might indicate that this particular split had some more challenging
samples or that the training set for that fold didn’t capture the full variety of the data.

Mean Score Significance: The mean score of 0.97 is particularly promising, indicating that you can
expect the model to generalize well to unseen data, as it performed consistently across the cross-
validation folds.
4.

3. Hugging Face Datasets (1 Hour)

Explanation:
The Hugging Face Datasets library provides various datasets for NLP tasks. Understanding how to
use these datasets is crucial for model training.
• Loading Datasets:
• Example: Load datasets from multiple sources (CSV, JSON, etc.).
• Data Preprocessing Steps:
• Tokenization: Convert text into tokens.
• Padding: Ensure uniform input lengths.
• Normalization: Standardize text formats.
• Dataset Splits:
• Training, Validation, and Test Sets: Used for different phases of model
development.

Internal Working:
• Dataset Loading Mechanism: Datasets can be loaded from URLs, local files, or Hugging
Face’s dataset repository.

Practical Examples:
1. Loading a Dataset:
python

• ffrom datasets import load_dataset

# Load the IMDb dataset

dataset = load_dataset('imdb')
print(dataset) # View dataset splits
Output Breakdown
DatasetDict:
This indicates that the dataset is organized as a dictionary containing multiple splits. In this case,
it has three splits: train, test, and unsupervised.

Train Split:
Key: train
Content: Dataset({ features: ['text', 'label'], num_rows: 25000 })
This indicates that the training dataset contains 25,000 samples. Each sample has two features:
text: The actual movie review.
label: The sentiment label (typically 0 for negative and 1 for positive).

Test Split:
Key: test
Content: Dataset({ features: ['text', 'label'], num_rows: 25000 })
This split is similar to the training dataset and also contains 25,000 samples. It's used for
evaluating the performance of your model after training.

Unsupervised Split:
Key: unsupervised
Content: Dataset({ features: ['text', 'label'], num_rows: 50000 })
This split contains 50,000 samples and is typically used for tasks where labels are not needed
(e.g., unsupervised learning tasks). However, it still contains the label feature, which might be
used for certain analyses or evaluations.

•
• Exploring the Dataset:
python

• #from datasets import load_dataset

# Load the IMDb dataset

dataset = load_dataset('imdb')
print(dataset) # View dataset splits
# Output Breakdown

# DatasetDict:
# This indicates that the dataset is organized as a dictionary containing multiple splits. In this
case, it has three splits: train, test, and unsupervised.

# Train Split:
# Key: train
# Content: Dataset({ features: ['text', 'label'], num_rows: 25000 })
# This indicates that the training dataset contains 25,000 samples. Each sample has two features:
# text: The actual movie review.
# label: The sentiment label (typically 0 for negative and 1 for positive).

# Test Split:
# Key: test
# Content: Dataset({ features: ['text', 'label'], num_rows: 25000 })
# This split is similar to the training dataset and also contains 25,000 samples. It's used for
evaluating the performance of your model after training.

# Unsupervised Split:
# Key: unsupervised
# Content: Dataset({ features: ['text', 'label'], num_rows: 50000 })
# This split contains 50,000 samples and is typically used for tasks where labels are not needed
(e.g., unsupervised learning tasks). However, it still contains the label feature, which might be
used for certain analyses or evaluations.

•
• Tokenizing Text Data:
python

• from transformers import AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained('distilbert-base-uncased')
encoded = tokenizer(dataset['train'][0]['text'], padding='max_length',
truncation=True)
print(encoded) # Display encoded tokens

• Splitting Dataset:
python

4. from sklearn.model_selection import train_test_split

# Convert dataset to pandas DataFrame for splitting

import pandas as pd

df = pd.DataFrame(dataset['train'])
train_df, val_df = train_test_split(df, test_size=0.2, random_state=42)

print(f"Training set size: {len(train_df)}, Validation set size:

{len(val_df)}")

4. Code Documentation in Hugging Face (1 Hour)

Explanation:
Proper code documentation is essential for collaboration and maintenance. Here’s how to document
code effectively using Hugging Face libraries.
• Docstrings:
• Definition: Use docstrings to describe classes, methods, and functions. They should
include parameters, return types, and usage examples.
• Inline Comments:
• Definition: Use comments to explain specific lines of code for clarity.
• Function Documentation: Follow a consistent style for documenting functions.
Internal Working:
• Documentation Standards: Adhere to standards like PEP 257 for Python, ensuring
consistent documentation style.

Practical Examples:
1. Using Docstrings:
python

• from transformers import pipeline

class SentimentModel:
"""
A class to perform sentiment analysis using a pre-trained model.

Attributes:
model_name (str): The name of the pre-trained model.
model (Pipeline): The Hugging Face pipeline for sentiment analysis.
"""

def init(self, model_name):

"""
Initializes the SentimentModel with a specified model name.

Parameters:
model_name (str): The name of the pre-trained model to load.
"""
self.model_name = model_name
self.model = self.load_model()

def load_model(self):
"""Loads the pre-trained model and returns it."""
# Load the sentiment analysis model from Hugging Face
sentiment_model = pipeline("sentiment-analysis", model=self.model_name)
return sentiment_model

def analyze_sentiment(self, text):

"""
Analyzes the sentiment of the provided text.

Parameters:
text (str): The text to analyze.

Returns:
dict: The sentiment analysis result.
"""
result = self.model(text)
return result

# Example usage:
if __name__ == "__main__":
model_name = "distilbert-base-uncased-finetuned-sst-2-english" # Specify your model name
sentiment_model = SentimentModel(model_name)
# Analyze sentiment of a sample text
text = "I love machine learning!"
result = sentiment_model.analyze_sentiment(text)
print(result) # Display the result

•
• Inline Comments:
python

• # Load the sentiment analysis model

sentiment_model = pipeline("sentiment-analysis")

• Function Documentation:
python

• def analyze_sentiment(text):
"""
Analyzes the sentiment of the provided text.

Parameters:
text (str): The text to analyze.

Returns:
dict: A dictionary with sentiment labels and scores.
"""
result = sentiment_model(text) # Call the model for sentiment analysis
return result

• Usage Example with Documentation:

python

4. # Example usage of the SentimentModel

model = SentimentModel(model_name="distilbert-base-uncased")
analysis = model.analyze_sentiment("I love using Hugging Face models!")
print(analysis) # Outputs the sentiment result

5. Loading & Running a Hugging Face Model in Google Colab (1 Hour)

Explanation:
Google Colab provides an accessible platform for running Hugging Face models in a cloud
environment. It offers free GPU access, making it suitable for deep learning tasks.
• Setting Up Environment:
• Installing Packages: Ensure you have the necessary libraries.
• Resource Allocation: Colab allows you to allocate GPUs or TPUs for your models.
• Running a Model:
• Pipeline Creation: Hugging Face's pipeline API simplifies the process of loading
and using models.
• Input/Output Management: Handle input text and retrieve model predictions
effectively.

Internal Working:
• Resource Management: Colab dynamically allocates resources based on demand.
Understanding how to efficiently manage these resources can optimize performance.
• Session Management: Be aware of session timeouts and save progress frequently to avoid
data loss.

Practical Examples:
1. Setting Up Colab Environment:
python

• # Install the transformers library

!pip install transformers

• Loading a Model for Sentiment Analysis:

python

• !pip install transformers

from transformers import pipeline

# Load the sentiment analysis pipeline

sentiment_analysis = pipeline("sentiment-analysis")

# Analyze sentiment
result = sentiment_analysis("I love using Hugging Face models!")
print(result) # Displays sentiment analysis result
•
• Using a Model for Text Generation:
python

• from transformers import GPT2LMHeadModel, GPT2Tokenizer

# Load pre-trained model and tokenizer

model_name = "gpt2"
tokenizer = GPT2Tokenizer.from_pretrained(model_name)
model = GPT2LMHeadModel.from_pretrained(model_name)

# Encode and generate text

input_text = "Once upon a time"
input_ids = tokenizer.encode(input_text, return_tensors='pt')
output = model.generate(input_ids, max_length=50)

# Decode and print the output

generated_text = tokenizer.decode(output[0], skip_special_tokens=True)
print(generated_text)

• Multi-GPU Utilization Example:

python

4. # Check available devices and utilize multiple GPUs if available

import torch
device = "cuda" if torch.cuda.is_available() else "cpu"
model = GPT2LMHeadModel.from_pretrained("gpt2").to(device)
# Inference with multi-GPU setup
input_ids = tokenizer.encode("The future of AI is",
return_tensors='pt').to(device)
output = model.generate(input_ids, max_length=50)
print(tokenizer.decode(output[0], skip_special_tokens=True))
5.

6. How to Industrialize Models and Enable ModelOps (1 Hour)

Explanation:
ModelOps refers to the operational practices required to deploy, manage, and scale machine
learning models in production environments. Effective ModelOps ensures that models remain
relevant and perform well over time.
• Continuous Integration/Continuous Deployment (CI/CD):
• Definition: CI/CD refers to automated processes that help integrate code changes
and deploy models.
• Importance: This approach reduces the risk of errors during updates and allows for
rapid iteration.
• Monitoring and Logging:
• Definition: Continuous tracking of model performance metrics and logging of input-
output pairs.
• Importance: Monitoring helps identify issues like data drift and performance
degradation in real-time.
• Version Control:
• Definition: Using systems like Git to manage changes in code and model files.
• Importance: Facilitates collaboration and allows teams to roll back to previous
versions if necessary.

Internal Working:
• Automation in Model Deployment: Understanding how to set up automated workflows
that trigger model deployment can save significant time and resources.
• Performance Metrics Tracking: Familiarize yourself with tools like Prometheus or
Grafana for monitoring model performance over time.

Practical Examples:
1. Setting Up a Simple CI/CD Pipeline:
bash

• # Example of a simple CI/CD command using Git

git add .
git commit -m "Update model"
git push origin main # Deploy changes to production

• Logging Model Predictions:

python

• import logging

# Set up logging
logging.basicConfig(level=logging.INFO, force=True)

def log_prediction(input_text, prediction):

"""
Logs the input text and its corresponding prediction.
"""
logging.info(f"Input: {input_text}, Prediction: {prediction}")

# Example usage
log_prediction("I love this product!", "Positive")

•
• Version Control with Git:
bash

• # Basic commands for version control

git init # Initialize a new Git repository
git add model.py # Stage changes
git commit -m "Initial model implementation" # Commit changes

• Model Monitoring Example:

python

4. import logging

# Set up logging
logging.basicConfig(level=logging.INFO, force=True)

def log_prediction(input_text, prediction):

"""
Logs the input text and its corresponding prediction.
"""
logging.info(f"Input: {input_text}, Prediction: {prediction}")

# Example usage
log_prediction("I love this product!", "Positive")

5.
7. When and How to Recalibrate, Re-train, Re-build Models (1 Hour)
Explanation:
Model recalibration, retraining, or rebuilding is essential to ensure ongoing performance as data
changes over time.
• When to Re-train:
• Data Drift: A change in the statistical properties of the target variable over time.
• Performance Drop: Significant declines in performance metrics indicate that the
model may no longer be reliable.
• How to Re-train:
• Incremental Learning: Training a model on new data while keeping its existing
knowledge.
• Transfer Learning: Leveraging pre-trained models and fine-tuning them on new
datasets.

Internal Working:
• Understanding Data Drift: Know how to detect data drift through statistical tests or
visualizations, which can inform the need for model updates.
• Retraining Strategies: Familiarize yourself with various retraining techniques, such as
batch retraining vs. online learning.

Practical Example:
1. Detecting Data Drift:
python

• import numpy as np
from scipy.stats import ks_2samp

def check_data_drift(old_data, new_data):

"""
Check for data drift by performing a Kolmogorov-Smirnov test.
"""
stat, p_value = ks_2samp(old_data, new_data)
return p_value < 0.05 # Data drift detected if p-value < 0.05

# Example usage
old_dataset = np.random.normal(0, 1, 1000) # Old data
new_dataset = np.random.normal(1, 1, 1000) # New data with drift

if check_data_drift(old_dataset, new_dataset):
print("Data drift detected, consider retraining the model.")
else:
print("No data drift detected.")

•
• Incremental Learning:
python

• !pip install scikit-learn

from sklearn.linear_model import SGDClassifier
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score
import numpy as np

# Generate some example data

X, y = np.random.rand(100, 10), np.random.randint(0, 2, 100)

# Split data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)

# Create and train the model

model = SGDClassifier()
model.fit(X_train, y_train)

# Make predictions on the test set

y_pred = model.predict(X_test)

# Evaluate the model

accuracy = accuracy_score(y_test, y_pred)
print(f"Accuracy: {accuracy}")

# Print some predictions

print("\nSome predictions:")
for i in range(5):
print(f"Input: {X_test[i]}, Predicted: {y_pred[i]}, Actual: {y_test[i]}")
•
• Transfer Learning Example:
python

3. from transformers import AutoModelForSequenceClassification, Trainer,

TrainingArguments
from datasets import Dataset
from transformers import DistilBertTokenizer

# Load the tokenizer for the DistilBERT model

tokenizer = DistilBertTokenizer.from_pretrained('distilbert-base-uncased')

# Sample data: A small dataset with text and corresponding labels (1 for
positive, 0 for negative)
data = {
'text': [
"I love this product!",
"This is the worst experience I've ever had.",
"Amazing quality!",
"Not worth the money."
],
'label': [1, 0, 1, 0] # Labels for each text entry
}

# Create a Hugging Face Dataset from the sample data

new_train_dataset = Dataset.from_dict(data)

# Function to tokenize the dataset

def tokenize_function(examples):
# Tokenize the text, adding padding and truncating as necessary
return tokenizer(examples['text'], padding="max_length", truncation=True)

# Apply the tokenization function to the dataset

tokenized_datasets = new_train_dataset.map(tokenize_function, batched=True)

# Load the pre-trained DistilBERT model for sequence classification

model = AutoModelForSequenceClassification.from_pretrained('distilbert-base-
uncased', num_labels=2)

# Define training arguments for the Trainer

training_args = TrainingArguments(
output_dir='./results', # Directory to save model outputs
num_train_epochs=3, # Number of epochs to train
per_device_train_batch_size=2, # Batch size per device
)

# Initialize the Trainer with the model, training arguments, and tokenized
dataset
trainer = Trainer(
model=model,
args=training_args,
train_dataset=tokenized_datasets,
)

# Fine-tune the model on the new dataset

trainer.train()

Breakdown of the Output:

1. global_step=6: This indicates that the model has completed 6 training steps. Each step
corresponds to the processing of a batch of data.
2. training_loss=0.5975738366444906: This value represents the loss during
training. Loss is a measure of how well the model is performing; lower values indicate better
performance. A loss of around 0.6 suggests that the model is learning, but there may still be
room for improvement.
3. metrics: This dictionary contains additional information about the training process:

• train_runtime: 52.3633: The total time taken to complete the training in

seconds.
• train_samples_per_second: 0.229: The number of training samples
processed per second.
• train_steps_per_second: 0.115: The number of training steps processed per
second.
• total_flos: 1589608783872.0: Total floating-point operations performed during
training, which is a measure of the computational complexity of the training process.
• train_loss: 0.5975738366444906: This is repeated from above, confirming the
final training loss after the training run.
• epoch: 3.0: Indicates that training has completed for 3 epochs. An epoch is one
complete pass through the entire training dataset.
4.

8. When Should I Re-train the Model (30 Minutes)

Explanation:
Identifying the right timing for model retraining is crucial for maintaining accuracy.
• Signs to Re-train:
• Performance Monitoring: If performance metrics drop below a defined threshold.
• Change in Input Data: When the data distribution shifts, necessitating adjustments
to the model.

Internal Working:
• Monitoring Strategies: Implement effective monitoring strategies that alert you to
performance degradation.

Practical Example:
1. Performance Monitoring:
python

1. import numpy as np
from sklearn.metrics import accuracy_score
from transformers import DistilBertTokenizer
from transformers import AutoModelForSequenceClassification
from transformers import Trainer

# Sample data (you would normally use a separate validation set)

true_labels = [1, 0, 1, 0] # Example true labels for your validation set
predictions = [1, 0, 0, 0] # Example predictions from your model

def monitor_performance(y_true, y_pred):

"""
Monitor model performance and check if retraining is necessary.
"""
accuracy = accuracy_score(y_true, y_pred)
print(f"Accuracy: {accuracy:.2f}") # Print accuracy for reference
if accuracy < 0.80: # Example threshold
print("Performance drop detected, consider retraining the model.")
# Call the performance monitoring function
monitor_performance(true_labels, predictions)

9. Model Retraining Options like PEFT, LORA (30 Minutes)

Explanation:
Retraining strategies like PEFT (Parameter-Efficient Fine-Tuning) and LoRA (Low-Rank
Adaptation) optimize the training process by adjusting fewer parameters.
• PEFT:
• Definition: Focuses on updating a small number of parameters, making it
computationally efficient.
• LoRA:
• Definition: Decomposes the weight updates into low-rank matrices, allowing
effective training with fewer updates.

Internal Working:
• Optimization Techniques: Understanding how these methods reduce computational load
while maintaining performance.

Practical Example:
1. Using PEFT:
python

• from peft import get_peft_model

# Load a pre-trained model and wrap it with PEFT

model = get_peft_model(pretrained_model, "peft_config.json")

• Using LoRA:
python

2. from lora import LoraModel

# Wrap the base model with LoRA

lora_model = LoraModel(base_model)

10. Perform Fine-tuning of a LLM Using PEFT (30 Minutes)

Explanation:
Fine-tuning a model using PEFT allows for effective adaptation of pre-trained models on specific
tasks without extensive resource usage.
• Training Steps:
• Data Preparation: Ensure data is cleaned and formatted.
• Model Configuration: Set hyperparameters for fine-tuning.
Internal Working:
• Training Dynamics: Understand how PEFT updates only a subset of parameters, leading to
faster convergence.

Practical Example:
1. Fine-tuning Example:
python

1. # Step 1: Install necessary libraries

!pip install datasets transformers

# Step 2: Import libraries

from transformers import Trainer, TrainingArguments,
AutoModelForSequenceClassification, AutoTokenizer
from datasets import load_dataset

# Step 3: Load a pre-trained model and tokenizer

model_name = 'distilbert-base-uncased'
model = AutoModelForSequenceClassification.from_pretrained(model_name,
num_labels=2)
tokenizer = AutoTokenizer.from_pretrained(model_name)

# Step 4: Load a dataset

dataset = load_dataset("imdb")

# Step 5: Tokenize the dataset

def tokenize_function(examples):
return tokenizer(examples['text'], padding='max_length', truncation=True)

# Apply the tokenization function to the dataset

tokenized_datasets = dataset.map(tokenize_function, batched=True)

# Select the training dataset

train_dataset = tokenized_datasets["train"]

# Step 6: Define training arguments

training_args = TrainingArguments(
output_dir='./results',
num_train_epochs=3,
per_device_train_batch_size=16,
logging_dir='./logs', # Directory for storing logs
)

# Step 7: Initialize the Trainer

trainer = Trainer(
model=model,
args=training_args,
train_dataset=train_dataset,
)
# Step 8: Train the model
trainer.train()

11. Small Language Models (30 Minutes)

Explanation:
Small Language Models (SLMs) offer efficient deployment options due to their reduced size while
retaining reasonable performance.
• Advantages:
• Speed: Faster inference times suitable for real-time applications.
• Resource Efficiency: Lower memory footprint allows deployment on constrained
devices.

Internal Working:
• Architecture Optimization: Techniques like quantization and pruning contribute to smaller
models without significant performance loss.

Practical Example:
1. Loading a Small Model:
python

• from transformers import AutoModelForSequenceClassification

# Load a smaller model

model = AutoModelForSequenceClassification.from_pretrained('distilbert-base-
uncased')

• Inference with Small Models:

python

• # Example input text

text = "This is a test."

# Perform inference
result = sentiment_analysis(text)
print(result) # Output sentiment prediction

• Quantization Example:
python

3. from torch.quantization import quantize_dynamic

# Quantize the model to reduce size and improve speed

quantized_model = quantize_dynamic(model, {torch.nn.Linear},
dtype=torch.qint8)

Feel free to let me know if you need any more adjustments or additional details!
You said:
it's being better now keep following my suggestions .So now the calculation that you have
provided , on that provide a complete example for each calculation and i saw in a section in which
you have explained git and ci/cd so what i know is those things require lots of steps and inner
working that is missing in your responses ,so add and i'm being very specific that don't change any
other things
ChatGPT said:

Absolutely! Let’s expand the calculations with complete examples and provide a detailed
breakdown of Git and CI/CD processes, including their inner workings.

1. Identifying & Choosing the Right LLM Models (1 Hour)

• # Sample values for true positives, false positives, and false negatives
true_positives = 70
false_positives = 30
false_negatives = 20

# Calculate Precision and Recall

precision = true_positives / (true_positives + false_positives)
recall = true_positives / (true_positives + false_negatives)

# Calculate F1 Score
f1_score = 2 * (precision * recall) / (precision + recall)

print(f"Precision: {precision:.2f}, Recall: {recall:.2f}, F1 Score:

{f1_score:.2f}")

• Model Size:
• Definition: Refers to the number of parameters in the model. Larger models often capture
language nuances better.
• Calculation Example: If comparing two models with parameter counts of 175 million
(GPT-3) and 110 million (BERT), the relative size difference can be calculated as:
Size Ratio=110 million175 million≈1.59
Complete Example:
python

• gpt3_params = 175_000_000 # 175 million

bert_params = 110_000_000 # 110 million

size_ratio = gpt3_params / bert_params

print(f"GPT-3 is approximately {size_ratio:.2f} times larger than BERT.")

• Community Support:
• Definition: Strong community backing indicates better resources for troubleshooting
and examples.

Practical Examples:
1. Listing Available Models:
python

• from transformers import HfApi

# Initialize Hugging Face API

• Comparing Models:
python

• from transformers import AutoModel, AutoTokenizer

model_names = ['gpt-2', 'bert-base-uncased', 'distilbert-base-uncased']

for model_name in model_names:

tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModel.from_pretrained(model_name)
print(f"Model: {model_name}, Parameters: {model.num_parameters()}")

• Use Case Implementation:

python

• from transformers import pipeline

# Example for loading a specific model for sentiment analysis
sentiment_model = pipeline("sentiment-analysis", model="distilbert-base-
uncased")
result = sentiment_model("I love machine learning!")
print(result) # Displays the sentiment analysis result

• Performance Comparison Program:

python

4. import numpy as np

model_accuracies = {
'gpt-2': 0.87,
'bert-base-uncased': 0.91,
'distilbert-base-uncased': 0.89,
}

# Calculate the average accuracy

average_accuracy = np.mean(list(model_accuracies.values()))
print(f"Average Model Accuracy: {average_accuracy:.2f}")

2. Understanding Model Metrics, Accuracy, and Other Benchmarks (1 Hour)

• # Sample data
true_positives = 80
true_negatives = 50
false_positives = 10
false_negatives = 5
total_samples = true_positives + true_negatives + false_positives +
false_negatives

# Calculate accuracy
accuracy = (true_positives + true_negatives) / total_samples
print(f"Accuracy: {accuracy:.2f}")

• Precision:
• Calculation: Precision=True Positives+False PositivesTrue Positives
Complete Example:
python

• # Sample values
true_positives = 70
false_positives = 30

# Calculate Precision
precision = true_positives / (true_positives + false_positives)
print(f"Precision: {precision:.2f}")

• Recall:
• Calculation: Recall=True Positives+False NegativesTrue Positives
Complete Example:
python

• # Sample values
true_positives = 70
false_negatives = 20

# Calculate Recall
recall = true_positives / (true_positives + false_negatives)
print(f"Recall: {recall:.2f}")

• F1 Score:
• Calculation: F1=2×Precision+RecallPrecision×Recall
Complete Example:
python

• # Using previously calculated Precision and Recall

f1_score = 2 * (precision * recall) / (precision + recall)
print(f"F1 Score: {f1_score:.2f}")

• ROC-AUC:
• Calculation: AUC represents the area under the ROC curve, which is plotted with the true
positive rate against the false positive rate.
Complete Example:
python

• from sklearn.metrics import roc_auc_score, roc_curve

import matplotlib.pyplot as plt

# Sample true labels and predicted probabilities

y_true = [1, 0, 1, 1, 0]
y_prob = [0.9, 0.1, 0.8, 0.7, 0.4]

# Calculate ROC AUC

roc_auc = roc_auc_score(y_true, y_prob)

# Calculate ROC curve

fpr, tpr, thresholds = roc_curve(y_true, y_prob)

# Plot ROC curve

plt.plot(fpr, tpr, label=f'ROC curve (area = {roc_auc:.2f})')
plt.plot([0, 1], [0, 1], 'k--') # Diagonal line
plt.xlim([0.0, 1.0])
plt.ylim([0.0, 1.0])
plt.xlabel('False Positive Rate')
plt.ylabel('True Positive Rate')
plt.title('Receiver Operating Characteristic')
plt.legend(loc="lower right")
plt.show()

Practical Examples:
1. Calculating Metrics:
python

• from sklearn.metrics import accuracy_score, precision_score, recall_score,

f1_score

y_true = [1, 0, 1, 1, 0] # True labels

y_pred = [1, 0, 1, 0, 0] # Predicted labels

accuracy = accuracy_score(y_true, y_pred)

precision = precision_score(y_true, y_pred)
recall = recall_score(y_true, y_pred)
f1 = f1_score(y_true, y_pred)

print(f"Accuracy: {accuracy:.2f}, Precision: {precision:.2f}, Recall:

{recall:.2f}, F1 Score: {f1:.2f}")

• Visualizing Metrics:
python

• import matplotlib.pyplot as plt

metrics = [accuracy, precision, recall, f1]

labels = ['Accuracy', 'Precision', 'Recall', 'F1 Score']

plt.bar(labels, metrics)
plt.ylabel('Score')
plt.title('Model Performance Metrics')
plt.show()

• Evaluating Model Performance:

python

3. from sklearn.model_selection import train_test_split

from sklearn.datasets import load_iris
from sklearn.ensemble import RandomForestClassifier

# Load data
iris = load_iris()
X_train, X_test, y_train, y_test = train_test_split(iris.data,
iris.target, test_size=0.3, random_state=42)

# Train a model
model = RandomForestClassifier()
model.fit(X_train, y_train)
# Predict
y_pred = model.predict(X_test)

# Calculate metrics
accuracy = accuracy_score(y_test, y_pred)
print(f"Model Accuracy: {accuracy:.2f}")

3. Hugging Face Datasets (1 Hour)

Explanation:
Hugging Face Datasets provide a comprehensive library of datasets for NLP tasks, enabling easy
loading, processing, and management.
• Dataset Loading:
• Use the datasets library: This library allows you to load and preprocess datasets
easily.
• Dataset Structure:
• Features and Labels: Understanding how datasets are structured, including features
(input data) and labels (output data).
• Dataset Manipulation:
• Splitting Datasets: Techniques to split datasets into training, validation, and test sets.

Internal Working:
• Efficient Data Handling: The library utilizes Apache Arrow for efficient memory mapping,
which speeds up loading times and reduces memory usage.

Practical Examples:
1. Loading a Dataset:
python

• from datasets import load_dataset

# Load the IMDb dataset

dataset = load_dataset("imdb")
print(dataset)

• Exploring Dataset Features:

python

• # Inspect the dataset

print(dataset['train'][0]) # Show the first example in the training set

• Splitting Dataset:
python

• # Split dataset into train and test

train_test = dataset['train'].train_test_split(test_size=0.2)
train_dataset = train_test['train']
test_dataset = train_test['test']

• Dataset Processing:
python

4. # Tokenize the dataset

from transformers import AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained('distilbert-base-uncased')

def tokenize_function(examples):
return tokenizer(examples['text'], padding='max_length',
truncation=True)

tokenized_datasets = dataset['train'].map(tokenize_function, batched=True)

4. Code Documentation in Hugging Face (30 Minutes)

Explanation:
Proper documentation in code is essential for clarity and maintainability, especially when working
with complex libraries like Hugging Face.
• Docstrings:
• Definition: Use docstrings to describe classes, methods, and functions. They should
include parameters, return types, and usage examples.
• Inline Comments:
• Definition: Use comments to explain specific lines of code for clarity.

Internal Working:
• Documentation Standards: Adhere to standards like PEP 257 for Python, ensuring
consistent documentation style.

Practical Examples:
1. Using Docstrings:
python

• class SentimentModel:
"""
A class to perform sentiment analysis using a pre-trained model.

Attributes:
model_name (str): The name of the pre-trained model.
model (Pipeline): The Hugging Face pipeline for sentiment analysis.
"""

def init(self, model_name):

"""
Initializes the SentimentModel with a specified model name.

Parameters:
model_name (str): The name of the pre-trained model to load.
"""
self.model_name = model_name
self.model = self.load_model()

def load_model(self):
"""Loads the pre-trained model and returns it."""
sentiment_model = pipeline("sentiment-analysis")
return sentiment_model

• Inline Comments:
python

• # Load the sentiment analysis model

sentiment_model = pipeline("sentiment-analysis")

• Function Documentation:
python

3. def analyze_sentiment(text):
"""
Analyzes the sentiment of the provided text.

Parameters:
text (str): The text to analyze.

Returns:
dict: A dictionary with sentiment labels and scores.
"""
result = sentiment_model(text) # Call the model for sentiment
analysis
return result

5. Loading & Running a Hugging Face Model in Google Colab (1 Hour)

Explanation:
Google Colab provides an accessible platform for running Hugging Face models in a cloud
environment.
• Setting Up Environment:
• Installing Packages: Ensure you have the necessary libraries.
• Resource Allocation: Colab allows you to allocate GPUs or TPUs for your models.

Internal Working:
• Resource Management: Colab dynamically allocates resources based on demand.
Understanding how to efficiently manage these resources can optimize performance.

Practical Examples:
1. Setting Up Colab Environment:
python
• # Install the transformers library
!pip install transformers

• Loading a Model for Sentiment Analysis:

python

• from transformers import pipeline

# Load the sentiment analysis pipeline

sentiment_analysis = pipeline("sentiment-analysis")

# Analyze sentiment
result = sentiment_analysis("I love using Hugging Face models!")
print(result) # Displays sentiment analysis result

• Using a Model for Text Generation:

python

• from transformers import GPT2LMHeadModel, GPT2Tokenizer

# Load pre-trained model and tokenizer

model_name = "gpt2"
tokenizer = GPT2Tokenizer.from_pretrained(model_name)
model = GPT2LMHeadModel.from_pretrained(model_name)

# Encode and generate text

input_text = "Once upon a time"
input_ids = tokenizer.encode(input_text, return_tensors='pt')
output = model.generate(input_ids, max_length=50)

# Decode and print the output

generated_text = tokenizer.decode(output[0], skip_special_tokens=True)
print(generated_text)

• Multi-GPU Utilization Example:

python

4. import torch

# Check available devices and utilize multiple GPUs if available

device = "cuda" if torch.cuda.is_available() else "cpu"
model = GPT2LMHeadModel.from_pretrained("gpt2").to(device)

# Inference with multi-GPU setup

input_ids = tokenizer.encode("The future of AI is",
return_tensors='pt').to(device)
output = model.generate(input_ids, max_length=50)
print(tokenizer.decode(output[0], skip_special_tokens=True))

6. How to Industrialize Models and Enable ModelOps (1 Hour)

Explanation:
ModelOps refers to the operational practices required to deploy, manage, and scale machine
learning models in production environments.
• Continuous Integration/Continuous Deployment (CI/CD):
• Definition: CI/CD refers to automated processes that help integrate code changes
and deploy models.
• Importance: This approach reduces the risk of errors during updates and allows for
rapid iteration.

Internal Working:
• CI/CD Workflow Steps:
1. Code Repository: Store code in a version control system (e.g., Git).
2. Build Process: Automatically build and test code upon changes.
3. Deployment: Deploy changes to production after passing tests.

Practical Examples:
1. Setting Up a Simple CI/CD Pipeline:
bash

• # Example of a simple CI/CD command using Git

git add .
git commit -m "Update model"
git push origin main # Deploy changes to production

• Building and Testing Process:

yaml

• # Example CI/CD configuration file (e.g., .github/workflows/main.yml)

name: CI/CD Pipeline

on:
push:
branches:
- main

jobs:
build:
runs-on: ubuntu-latest
steps:
- name: Checkout code
uses: actions/checkout@v2

- name: Set up Python

uses: actions/setup-python@v2
with:
python-version: '3.x'

- name: Install dependencies

run: |
pip install -r requirements.txt

- name: Run tests

run: |
pytest tests/

• Logging Model Predictions:

python
• import logging

# Set up logging
logging.basicConfig(level=logging.INFO)

def log_prediction(input_text, prediction):

"""
Logs the input text and its corresponding prediction.
"""
logging.info(f"Input: {input_text}, Prediction: {prediction}")

# Example usage
log_prediction("I love this product!", "Positive")

• Version Control with Git:

bash

4. # Basic commands for version control

git init # Initialize a new Git repository
git add model.py # Stage changes
git commit -m "Initial commit" # Commit changes
git push origin main # Push to remote repository

7. When and How to Recalibrate, Re-train, Re-build Models (30 Minutes)

Explanation:
Recalibrating, re-training, or re-building models is essential when performance degrades or when
there is a shift in data distribution.
• When to Re-train:
• Performance Drop: Monitor model performance metrics to identify when retraining
is necessary.
• Data Drift: Changes in input data distribution can lead to decreased accuracy.

Internal Working:
• Monitoring Framework: Implement monitoring solutions that automatically trigger
retraining based on defined thresholds.

Practical Examples:
1. Monitoring Performance:
python

• from sklearn.metrics import accuracy_score

def check_retrain(y_true, y_pred):

"""
Check if the model needs retraining based on accuracy.
"""
accuracy = accuracy_score(y_true, y_pred)
if accuracy < 0.80: # Threshold
print("Consider retraining the model due to performance drop.")
• Triggering Retraining:
python

2. # Example function to trigger retraining

def retrain_model():
# Load new data
new_data = load_new_data()
# Retrain the model
model.fit(new_data.features, new_data.labels)
print("Model has been retrained.")

8. When Should I Re-train the Model (30 Minutes)

Internal Working:
• Monitoring Strategies: Implement effective monitoring strategies that alert you to
performance degradation.

Practical Example:
1. Performance Monitoring:
python

1. from sklearn.metrics import accuracy_score

def monitor_performance(y_true, y_pred):

"""
Monitor model performance and check if retraining is necessary.
"""
accuracy = accuracy_score(y_true, y_pred)
if accuracy < 0.80: # Example threshold
print("Performance drop detected, consider retraining the model.")

9. Model Retraining Options like PEFT, LoRA (30 Minutes)

Internal Working:
• Optimization Techniques: Understanding how these methods reduce computational load
while maintaining performance.

Practical Example:
1. Using PEFT:
python

• from peft import get_peft_model

# Load a pre-trained model and wrap it with PEFT

model = get_peft_model(pretrained_model, "peft_config.json")

• Using LoRA:
python

2. from lora import LoraModel

# Wrap the base model with LoRA

lora_model = LoraModel(base_model)

10. Perform Fine-tuning of a LLM Using PEFT (30 Minutes)

Internal Working:
• Training Dynamics: Understand how PEFT updates only a subset of parameters, leading to
faster convergence.

Practical Example:
1. Fine-tuning Example:
python

1. from transformers import Trainer, TrainingArguments

training_args = TrainingArguments(
output_dir='./results',
num_train_epochs=3,
per_device_train_batch_size=16,
)

trainer = Trainer(
model=model,
args=training_args,
train_dataset=train_dataset,
)

trainer.train() # Fine-tune the model

11. Small Language Models (30 Minutes)

Internal Working:
• Architecture Optimization: Techniques like quantization and pruning contribute to smaller
models without significant performance loss.

Practical Example:
1. Loading a Small Model:
python

• from transformers import AutoModelForSequenceClassification

# Load a smaller model

model = AutoModelForSequenceClassification.from_pretrained('distilbert-base-
uncased')

• Inference with Small Models:

python

• # Example input text

text = "This is a test."

# Perform inference
result = sentiment_analysis(text)
print(result) # Output sentiment prediction

• Quantization Example:
python

3. from torch.quantization import quantize_dynamic

# Quantize the model to reduce size and improve speed
quantized_model = quantize_dynamic(model, {torch.nn.Linear},
dtype=torch.qint8)

Feel free to let me know if there are any other specific adjustments you need!

ChatGPT can make mistakes. Check important info.

Object 1

Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
11 pages
CP Harmony Browse AdminGuide
No ratings yet
CP Harmony Browse AdminGuide
168 pages
Unit2 - Perfomance Measures
No ratings yet
Unit2 - Perfomance Measures
32 pages
March 3rd&4th
No ratings yet
March 3rd&4th
19 pages
Lecture 4 Evaluation
No ratings yet
Lecture 4 Evaluation
58 pages
Learning Airtable: Building Database-Driven Applications With No-Code 1st Edition Elliott Adams PDF Download
No ratings yet
Learning Airtable: Building Database-Driven Applications With No-Code 1st Edition Elliott Adams PDF Download
43 pages
Downloadfile Bin
No ratings yet
Downloadfile Bin
1 page
Python Essential Methods in Machine Learning
No ratings yet
Python Essential Methods in Machine Learning
6 pages
AIML-HC Mod 03
No ratings yet
AIML-HC Mod 03
46 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
Machine Learning Evaluation Metrics Lecturer
No ratings yet
Machine Learning Evaluation Metrics Lecturer
30 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
l09 Machine Learning
No ratings yet
l09 Machine Learning
39 pages
ML Report
No ratings yet
ML Report
14 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
AUC and The ROC Curve in Machine Learning - DataCamp
No ratings yet
AUC and The ROC Curve in Machine Learning - DataCamp
12 pages
IDS U-5 Answers
No ratings yet
IDS U-5 Answers
16 pages
Day 16 - Save The Best Performing Model's We...
No ratings yet
Day 16 - Save The Best Performing Model's We...
5 pages
Lecture03. Classification (Chapter 3)
No ratings yet
Lecture03. Classification (Chapter 3)
46 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Module 5.pptx - 20250608 - 201231 - 0000
No ratings yet
Module 5.pptx - 20250608 - 201231 - 0000
43 pages
Modelling-Project Notes-2
No ratings yet
Modelling-Project Notes-2
26 pages
Clase10 11
No ratings yet
Clase10 11
18 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
AI-Driven Approach For Fusion (Task 3)
No ratings yet
AI-Driven Approach For Fusion (Task 3)
28 pages
Model Evaluation in ML
No ratings yet
Model Evaluation in ML
12 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
18 pages
Phase 3 IBM
No ratings yet
Phase 3 IBM
7 pages
Machine Learning # 2
No ratings yet
Machine Learning # 2
17 pages
? Task
No ratings yet
? Task
23 pages
How To Use ROC Curves and Precision-Recall Curves For Classification in Python
No ratings yet
How To Use ROC Curves and Precision-Recall Curves For Classification in Python
47 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
Lab4 - Jupyter Notebook
No ratings yet
Lab4 - Jupyter Notebook
7 pages
Evaluating A Machine Learning Model
No ratings yet
Evaluating A Machine Learning Model
14 pages
Assisted Installation and Activation Manual For Cloudlabs Virtual Laboratories - V3.5 WINDOWS
No ratings yet
Assisted Installation and Activation Manual For Cloudlabs Virtual Laboratories - V3.5 WINDOWS
21 pages
Introduction To Computer Information Systems Chapter 5 Databases and Database Management System
No ratings yet
Introduction To Computer Information Systems Chapter 5 Databases and Database Management System
22 pages
Model Learning Steps
No ratings yet
Model Learning Steps
12 pages
How To Evaluate Machine Learning Models - Yulinda Rizky
No ratings yet
How To Evaluate Machine Learning Models - Yulinda Rizky
15 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
ML-Lecture-12 (Evaluation Metrics For Classification)
No ratings yet
ML-Lecture-12 (Evaluation Metrics For Classification)
15 pages
Codes and Concepts of ML-Developer-2
No ratings yet
Codes and Concepts of ML-Developer-2
17 pages
Unit Ii
No ratings yet
Unit Ii
23 pages
ROC Auc
No ratings yet
ROC Auc
5 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
Machine Learning Lecture1 - 26-27 Aug
No ratings yet
Machine Learning Lecture1 - 26-27 Aug
30 pages
C++ Exam
No ratings yet
C++ Exam
21 pages
FAI Lecture - 23-10-2023 PDF
No ratings yet
FAI Lecture - 23-10-2023 PDF
12 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
F1 Score Vs ROC AUC Vs Accuracy Vs PR AUC Which Evaluation Metric Should You Choose - Neptune - Ai
No ratings yet
F1 Score Vs ROC AUC Vs Accuracy Vs PR AUC Which Evaluation Metric Should You Choose - Neptune - Ai
1 page
ML Metrics
No ratings yet
ML Metrics
9 pages
MBOS Reactivation and MFA Reset 2
No ratings yet
MBOS Reactivation and MFA Reset 2
1 page
E 303
No ratings yet
E 303
3 pages
27 Concurrency
No ratings yet
27 Concurrency
22 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
UNIT-2 ML
No ratings yet
UNIT-2 ML
10 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Model Evaluation - II
No ratings yet
Model Evaluation - II
12 pages
Project Report
No ratings yet
Project Report
7 pages
Daily Code DSA
No ratings yet
Daily Code DSA
20 pages
Project Assessment Form
No ratings yet
Project Assessment Form
2 pages
NoSQL
No ratings yet
NoSQL
18 pages
Attendance Sheet - Dudhnoi College
No ratings yet
Attendance Sheet - Dudhnoi College
5 pages
Urdu 7th Class KPK Guide
No ratings yet
Urdu 7th Class KPK Guide
40 pages
Promptengineeringguide 240205121612 2d395db6
No ratings yet
Promptengineeringguide 240205121612 2d395db6
45 pages
ROC Analysis and The AUC - Area Under The Curve by Carolina Bento Towards Data Science
No ratings yet
ROC Analysis and The AUC - Area Under The Curve by Carolina Bento Towards Data Science
1 page
A3 Classification and Feature Engineering
No ratings yet
A3 Classification and Feature Engineering
2 pages
Case Study - Cse320
No ratings yet
Case Study - Cse320
17 pages
Prerequisites For All Programs: Install Required Packages
No ratings yet
Prerequisites For All Programs: Install Required Packages
5 pages
04 SmartRobotCarV4.0 ServoControl
No ratings yet
04 SmartRobotCarV4.0 ServoControl
5 pages
Rev Insurance Business Report
No ratings yet
Rev Insurance Business Report
4 pages
Machine Learning Cheatsheet
No ratings yet
Machine Learning Cheatsheet
12 pages
Final Assesment
No ratings yet
Final Assesment
1 page
Ass 01
No ratings yet
Ass 01
4 pages
Flutter Projects
No ratings yet
Flutter Projects
2 pages
Digital - Chapter3.k Map
No ratings yet
Digital - Chapter3.k Map
18 pages
SNLR 042
No ratings yet
SNLR 042
8 pages
Insemlab
No ratings yet
Insemlab
3 pages
Spring 2020 - CommSys - Midterm Solution Key
No ratings yet
Spring 2020 - CommSys - Midterm Solution Key
9 pages
EE36 Virtual Nurse-Controlling Wheel Chair Through Android Mobile
No ratings yet
EE36 Virtual Nurse-Controlling Wheel Chair Through Android Mobile
3 pages
HSST Computer Science Syllabus
No ratings yet
HSST Computer Science Syllabus
5 pages
Instruction Manual For Loading Monitor
100% (1)
Instruction Manual For Loading Monitor
114 pages
The TCS Difference: Offerings Global Network Delivery Model™
No ratings yet
The TCS Difference: Offerings Global Network Delivery Model™
20 pages
Lab 4
No ratings yet
Lab 4
5 pages
Connect Composition 4e Information
No ratings yet
Connect Composition 4e Information
8 pages
B.J.E.M School: Bank Management System
No ratings yet
B.J.E.M School: Bank Management System
21 pages
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
From Everand
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
Andrei Besedin
2.5/5 (2)
List of Agencies: No. Name Email Address Tel # Fax PB# A&B
No ratings yet
List of Agencies: No. Name Email Address Tel # Fax PB# A&B
5 pages
Zero Trust Essentials Ebook - Microsoft
No ratings yet
Zero Trust Essentials Ebook - Microsoft
11 pages
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet

Last Day

Uploaded by

Last Day

Uploaded by

1.

Identifying & Choosing the Right LLM Models (1 Hour)

# Initialize Hugging Face API

• from transformers import AutoModel, AutoTokenizer

model_names = ['gpt2', 'bert-base-uncased', 'distilbert-base-uncased'] # Note: 'gpt-2' is typically

for model_name in model_names:

• from transformers import pipeline

# Use a sentiment analysis model that is fine-tuned for this task

2. Understanding Model Metrics, Accuracy, and Other Benchmarks (1 Hour)

• from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score

y_true = [1, 0, 1, 1, 0] # True labels

accuracy = accuracy_score(y_true, y_pred)

print(f"Accuracy: {accuracy:.2f}, Precision: {precision:.2f}, Recall: {recall:.2f}, F1 Score: {f1:.2f}")

• from sklearn.metrics import roc_auc_score, roc_curve

# Calculate ROC AUC

# Compute ROC curve

# Save the plot as an image file

print("ROC curve saved as 'roc_curve.png'")

Area Under the Curve (AUC):

Interpretation of Your Graph

3. from sklearn.model_selection import cross_val_score

3. Hugging Face Datasets (1 Hour)

• ffrom datasets import load_dataset

# Load the IMDb dataset

• #from datasets import load_dataset

# Load the IMDb dataset

• from transformers import AutoTokenizer

4. from sklearn.model_selection import train_test_split

# Convert dataset to pandas DataFrame for splitting

print(f"Training set size: {len(train_df)}, Validation set size:

4. Code Documentation in Hugging Face (1 Hour)

• from transformers import pipeline

def __init__(self, model_name):

def analyze_sentiment(self, text):

• # Load the sentiment analysis model

• Usage Example with Documentation:

4. # Example usage of the SentimentModel

5. Loading & Running a Hugging Face Model in Google Colab (1 Hour)

• # Install the transformers library

• Loading a Model for Sentiment Analysis:

• !pip install transformers

from transformers import pipeline

# Load the sentiment analysis pipeline

• from transformers import GPT2LMHeadModel, GPT2Tokenizer

# Load pre-trained model and tokenizer

# Encode and generate text

# Decode and print the output

• Multi-GPU Utilization Example:

4. # Check available devices and utilize multiple GPUs if available

6. How to Industrialize Models and Enable ModelOps (1 Hour)

• # Example of a simple CI/CD command using Git

• Logging Model Predictions:

def log_prediction(input_text, prediction):

• # Basic commands for version control

• Model Monitoring Example:

def log_prediction(input_text, prediction):

def check_data_drift(old_data, new_data):

• !pip install scikit-learn

# Generate some example data

# Split data into training and testing sets

# Create and train the model

# Make predictions on the test set

# Evaluate the model

# Print some predictions

3. from transformers import AutoModelForSequenceClassification, Trainer,

# Load the tokenizer for the DistilBERT model

# Create a Hugging Face Dataset from the sample data

# Function to tokenize the dataset

# Apply the tokenization function to the dataset

# Load the pre-trained DistilBERT model for sequence classification

# Define training arguments for the Trainer

# Fine-tune the model on the new dataset

Breakdown of the Output:

• train_runtime: 52.3633: The total time taken to complete the training in

8. When Should I Re-train the Model (30 Minutes)

# Sample data (you would normally use a separate validation set)

def init(self, model_name):

def init(self, model_name):