0% found this document useful (0 votes)

30 views79 pages

P5L1. GPU NPU Architecture and Object Detection

CPU GPU NPU

Uploaded by

tobby10120

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views79 pages

P5L1. GPU NPU Architecture and Object Detection

CPU GPU NPU

Uploaded by

tobby10120

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 79

Edge AIoT & Microelectronics (EdgeAIoTM) Engineering Skills for Gifted Students

P5L1.1 Getting Familiar with GPU/NPU Architectures,

Edge Platforms and Developer Tools
& Object Detection
Today’s Lesson

❖ Overview of GPU/NPU architecture, applications and types

❖ Getting familiar with GPU/NPU Hardware tools and platforms
❖ Interfacing and testing GPU/NPU Hardware with camera
❖ Overview of YOLO object detection

❖ Lab session:
○ Object Detection with AI PC and Jetson AGX Orin GPU Kit

2
Intended Learning Outcomes
At the end of this lesson, students are expected to:
1. At the end of this lesson, students are expected to:
2. Understand the architecture, applications, and types of
○ GPU (Graphics Processing Unit) and NPU (Neural Processing Unit)
3. Recognise the role of GPU/NPU in accelerating AI and edge computing tasks
4. Get familiar with GPU/NPU hardware tools and platforms, including Jetson
AGX Orin and Ryzen AI NPU
5. Interface and test GPU/NPU hardware with a camera for practical tasks
6. Understand the basics of YOLO object detection for real-time vision
applications
7. Evaluate the suitability of GPU/NPU platforms for different AI workloads and
deployment scenarios

3
Program Structure

First lesson of
Phase 5

4
Further Reading/Recommended Books
[1] Multicore and GPU Programming, An Integrated Approach. Second Edition
[2] GPU Programming with C++ and CUDA, by Paulo Motta
[3] Efficient Processing of Deep Neural Networks,. by Vivienne Sze, Yu Hsin Chen
[4] https://developer.nvidia.com/embedded/jetpack
[5] https://developer.nvidia.com/embedded/learn/tutorials

2
1 3
5
AMD Ryzen AI
● AMD Ryzen AI features an AMD Ryzen processor core, AMD Radeon integrated
graphics engine, and a dedicated Neural Processing Unit (NPU) based on the AMD
XDNA architecture.
● The NPU is designed to efficiently handle AI tasks, enhancing performance and
enabling new features.

6
What is an NPU?
● A Neural Processing Unit (NPU) is a
specialized processor designed
specifically for AI and Machine Learning
tasks.
● It is optimized to perform AI
computations with outstanding energy
efficiency, making it ideal for
applications such as image recognition,
natural language processing, and other
AI-driven functions that run locally on
your laptop.
● With local processing, you benefit from
next-level privacy.

7
Ryzen AI processor

8
NPU / CPU / GPU → All in one

9
Cores and Memory

10
Architectural

11
Floating point vs Integer

FP64

12
Block FP NPU

13
Versal AI

14
YOLO (You Only Look Once)

15
Object Detection

16
Remember Neural Network?

17
Problem setting

18
YOLO Overview

19
Model Training

20
Model Architecture

21
Model Prediction

22
Suppression and Object Function

23
Remarkable Achievements in AI
• AI complexity dramatically increases and requires more powerful machines

24
AI at the Frontier of Autonomous Machines

25
How Much Autonomy achieved?

26
Nvidia: The AI Computing Company

GPU Computing Visual Computing Artificial Intelligence

27
Nvidia GPU: More than Graphics
• Huge Capital

28
Nvidia GPU: Ecosystem

Amaz Baid eBa Facebo

on u y ok

Flic Goog iQI JD.co

kr le YI m

Microso Netfl Perisco Pintere

ft ix pe st

Qihoo Shaza Skyp Sogo

360 m e u

Tence Twitt Yand Yel

nt er ex p

AI-powered AI-as-a-Service AI for Enterprise AI for Auto >1,500 AI

Consumer Services Startups

29
Various GPU Platforms

30
Why Edge AI?

31
Why Edge AI?: Huge demand

32
Edge AI Smart Industries

33
Edge AI Redefines Robotics

34
JETSON : AI AT THE EDGE
Serge Palaric, VP Sales & Marketing EMEA - Embedded
Nvidia Jetson Family: Edge GPU for AI

• Jetson is a powerful GPU series for edge AI

• They achieve high performance and low power
• Use same architecture
• Varying performance and memory

36
37
Jetson Ecosystem

38
Open Framework Support

39
Nvidia Jetson Family Edge GPU
• Available in a wide range of
performance, power-efficiency, and
form factors.

40
Jetson family Boards

41
Jetson family Comparison

42
Jetson Orin Boards Specifications

43
Example Use Cases

44
Example Use Cases

45
Example Use Cases

46
AI Redefines Robotics

47
Jetson AGX Orin

48
https://www.youtube.com/watch?v=eFgsOeHMAW4&t=2s
Jetson AGX Orin
• Server Class AI Performance at edge
• 275/200 TOPS (INT8)
• Many Peripherals: Wifi, USB, PCIe, DP port
• Provides a giant leap forward for Robotics and
Edge AI.

Customers can now deploy large and complex models to solve problems such as
natural language understanding, 3D perception and multi-sensor fusion.

https://www.youtube.com/watch?v=eFgsOeHMAW4&t=2s 49
Jetson AGX Orin Layout/Parts
Mark. Name Note

0 White LED
1 Power button
2 Force Recovery button
3 Reset button
4 USB Type-C port DFP only
5 DC power jack
6 Ethernet port
7 USB Type-A ports 2x USB 3.2 Gen 2
8 DisplayPort output This is the only
display interface on Jetson AGX Orin Developer Kit
9 USB micro-B port For debug
10 USB Type-C port Also for flashing
(UFP and DFP)
11 40-pin connector
12 USB Type-A ports 2x USB 3.2 Gen 1

https://developer.nvidia.com/embedded/learn/jetson-agx-orin-devkit-user-guide/developer_kit_layout.html 50
Carrier Board Layout

51
Jetson AGX Orin Series System-On-Module

52
Jetson AGX Orin Series System-On-Module

53
Jetson AGX Orin Series System-On-Module

54
Jetson AGX Orin
• Jetson AGX Orin delivers 8X the AI performance of Jetson AGX Xavier AI.

55
https://www.youtube.com/watch?v=eFgsOeHMAW4&t=2s
Jetson AGX Orin Quick Setup

• Connect the display cable (DP cable)

• Connect the Mouth and Keyboards
• Connect the power cable
• Power on the board.
• Follow the instructions to configure the Ubuntu
• This setup only installs Ubuntu, not yet enough for AI

56
https://www.youtube.com/watch?v=eFgsOeHMAW4&t=2s
Jetson SDK for Edge AI

57
Jetson SDK for Edge AI

58
Jetpack 59

NVIDIA SDK accelerates every major framework

59
Jetpack 4.1 Components

60
Jetson Jetpack Supports

61
Compute Demands
It is necessary to accelerate every major framework

62
Accelerate inferencing on the GPU using NVIDIA TensorRT

✔ NVIDIA® TensorRT is an SDK for

high-performance deep learning
inference.

✔ It includes a deep learning inference

optimizer and runtime that delivers low
latency and high throughput for deep
learning inference applications.

✔ With TensorRT, developers can focus

on creating novel AI-powered
applications rather than performance
tuning for inference deployment.

63
Tensor RT
Achieves better performance than onnx and CPU

64
Tensor RT
Achieves better performance than onnx and CPU

65
Tensorrt Compatible Hardware

66
66
Jetson Community: Comprehensive Developer Site

67
Additional learning Opportunities

68
Additional learning Opportunities

69
69
Setting the Jetson environment for AI
✔ Jetson Nano, Orin Nano, and Xavier NX: Use the SD card image installation method
✔ Jetson TX1/TX2, AGX Xavier, and AGX Orin: SDK Manager installation method is
recommended. Default setup flow could also be used.

https://developer.nvidia.com/embedded/learn/jetson-agx-orin-devkit-user-guide/two_ways_to_set_up_software.html
Install Jetson Software with SDK Manager
❖ NVIDIA SDK Manager is an all-in-one tool that bundles developer software
❖ Provides an end-to-end development environment setup solution for NVIDIA
SDKs.
❖ Allows you to flash and setup Jetson from a host PC

1. Create an Nvidia Developer Account

https://developer.nvidia.com/login
2. Download SDK Manager
https://developer.nvidia.com/sdk-manager
3. Run the setup
https://docs.nvidia.com/sdk-manager/install-with-sdkm-jetson/index.html

71
Install Jetson Software with SDK Manager
❖ Need to put the Jetson on Recovery mode
(Press the Recovery button, then reset while holding the recovery button,
release reset, release Recovery button)
❖ Use the lsusb command in the terminal to confirm

72
Install Jetson Software with SDK Manager 73

Follow the guide to complete:

https://docs.nvidia.com/sdk-manager/install-with-sdkm-jetson/index.html
Last week Lab Exercise

Have you Enjoyed?

PYNQ. Adafruit .
74
Lab Exercise
❖ Object detection using AMD AI PC and the Jetson AGX Orin GPU Kit

Jetson
AGX Orin

75
Control your home
• Some can control your home appliance remotely, centralized and
automated

76
Lab Exercise:

Board Username: eeuser

Board Password: eeuser

77
Other Resources
1. https://docs.nvidia.com/deeplearning/tensorrt/developer-guide/index.html
2. https://developer.nvidia.com/embedded-computing
3. https://developer.nvidia.com/embedded/jetpack
4. https://docs.ultralytics.com/guides/nvidia-jetson/
5. https://developer.nvidia.com/embedded/develop/software
6. https://developer.nvidia.com/embedded/learn/tutorials
7. https://docs.ultralytics.com/guides/object-counting/
8. https://dipankarmedh1.medium.com/real-time-object-detection-with-yolo-and-
webcam-enhancing-your-computer-vision-skills-861b97c78993
9. https://docs.nvidia.com/sdk-manager/index.html

78
Happy learning, please ask questions !!!

Purple Modern Futuristic Technology Presentation
No ratings yet
Purple Modern Futuristic Technology Presentation
6 pages
Unveiling The Powerhouses of AI A Comprehensive ST
No ratings yet
Unveiling The Powerhouses of AI A Comprehensive ST
9 pages
Notes
No ratings yet
Notes
29 pages
Transforming Edge Ai With Npus in Microcontrollers
No ratings yet
Transforming Edge Ai With Npus in Microcontrollers
12 pages
Understanding AI Part 2 Inference, Revised
No ratings yet
Understanding AI Part 2 Inference, Revised
4 pages
Nividia and The Gpu Revolution
No ratings yet
Nividia and The Gpu Revolution
14 pages
1.2 The New Disruptive Force in High-End IoT Markets - RSB-3810 (Mediatek Genio 1200) - MediaTek
No ratings yet
1.2 The New Disruptive Force in High-End IoT Markets - RSB-3810 (Mediatek Genio 1200) - MediaTek
17 pages
Lecture 09 Advances in Microcontroller Based System Design
No ratings yet
Lecture 09 Advances in Microcontroller Based System Design
19 pages
Jetson AGX Orin 64GB Datasheet
No ratings yet
Jetson AGX Orin 64GB Datasheet
2 pages
Hardware Accelerators Fueling AI
No ratings yet
Hardware Accelerators Fueling AI
3 pages
The Role of Field-Programmable Gate Arrays in The Acceleration of Modern High - Performance Computing Workloads
No ratings yet
The Role of Field-Programmable Gate Arrays in The Acceleration of Modern High - Performance Computing Workloads
11 pages
Jetson Orin Datasheet Nano Developer Kit 3575392 r24
No ratings yet
Jetson Orin Datasheet Nano Developer Kit 3575392 r24
2 pages
HPC Day 12 ppt-2
No ratings yet
HPC Day 12 ppt-2
139 pages
NVIDIA GPU Evolution: Gaming to AI
100% (1)
NVIDIA GPU Evolution: Gaming to AI
91 pages
Iot Assignment Module 1: Name: Rohit Yadav Roll No: CS19206702 1) Explain SOC / Short Note On SOC Solution
No ratings yet
Iot Assignment Module 1: Name: Rohit Yadav Roll No: CS19206702 1) Explain SOC / Short Note On SOC Solution
10 pages
NVIDIA's AI Stack
No ratings yet
NVIDIA's AI Stack
14 pages
2025 Geniatech Company Profile
No ratings yet
2025 Geniatech Company Profile
33 pages
MAM Unit 1 Notes 3
No ratings yet
MAM Unit 1 Notes 3
20 pages
CPU, GPU, FPGA, ASIC, CUDA Overview
No ratings yet
CPU, GPU, FPGA, ASIC, CUDA Overview
10 pages
48423B Fusion Whitepaper WEB
No ratings yet
48423B Fusion Whitepaper WEB
8 pages
NVIDIA Jetson Notes
No ratings yet
NVIDIA Jetson Notes
3 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
15 pages
Aeroespacial 2020
0% (1)
Aeroespacial 2020
28 pages
EAI - Lecture 2
No ratings yet
EAI - Lecture 2
21 pages
Unlocking The Secrets of The Embedded Systems
No ratings yet
Unlocking The Secrets of The Embedded Systems
9 pages
Onur Stanford SystemXSeminar FutureComputingPlatforms 8 February 2024
No ratings yet
Onur Stanford SystemXSeminar FutureComputingPlatforms 8 February 2024
423 pages
Seatwork For Chapter 1 Module
No ratings yet
Seatwork For Chapter 1 Module
4 pages
Jetson Agx Orindatasheet Update Module Series 2379600 v2
No ratings yet
Jetson Agx Orindatasheet Update Module Series 2379600 v2
2 pages
2019-20 Hanback Electronics Eng Compressed
No ratings yet
2019-20 Hanback Electronics Eng Compressed
308 pages
Jetson Nano Module Datasheet Us 1031771 r3 Web
No ratings yet
Jetson Nano Module Datasheet Us 1031771 r3 Web
2 pages
AI Accelerator
No ratings yet
AI Accelerator
5 pages
Jetson Orin Nano Developer Kit Datasheet
No ratings yet
Jetson Orin Nano Developer Kit Datasheet
2 pages
Report On Gpu
No ratings yet
Report On Gpu
39 pages
Jetson Nano for AI Developers
No ratings yet
Jetson Nano for AI Developers
40 pages
Jetson Orin Datasheet Nano Developer Kit 2659382 r3
No ratings yet
Jetson Orin Datasheet Nano Developer Kit 2659382 r3
2 pages
Chapter 01
No ratings yet
Chapter 01
56 pages
Ecejun 14
No ratings yet
Ecejun 14
49 pages
CUDA
No ratings yet
CUDA
46 pages
A Survey of FPGA-Based Robotic Computing
No ratings yet
A Survey of FPGA-Based Robotic Computing
27 pages
IoT PyqAns - HTML
No ratings yet
IoT PyqAns - HTML
66 pages
Aug 27 2020 Andes RISC V CON Webinar
No ratings yet
Aug 27 2020 Andes RISC V CON Webinar
29 pages
Short Overview For Ai With Model Based Design Week1
No ratings yet
Short Overview For Ai With Model Based Design Week1
11 pages
Nvidia Jetson Agx Orin Technical Brief
No ratings yet
Nvidia Jetson Agx Orin Technical Brief
21 pages
Dev Board Mini Datasheet: Features
No ratings yet
Dev Board Mini Datasheet: Features
14 pages
04 AMD Edge AI TechDay - Singapore - 2024 - FrankWang
No ratings yet
04 AMD Edge AI TechDay - Singapore - 2024 - FrankWang
29 pages
Getting Started With The AMD Robotics Hardware Portfolio - Final v2
No ratings yet
Getting Started With The AMD Robotics Hardware Portfolio - Final v2
38 pages
Nvidia Story
No ratings yet
Nvidia Story
30 pages
MCHP-UK-MEL3272-AI Trends-190889 Final
No ratings yet
MCHP-UK-MEL3272-AI Trends-190889 Final
10 pages
Technological Evolution and AI Revolution
No ratings yet
Technological Evolution and AI Revolution
2 pages
2 - Hardware
No ratings yet
2 - Hardware
29 pages
AI PC Market Analysis 2023
No ratings yet
AI PC Market Analysis 2023
8 pages
Presented by Ragasudha.B Pavitha.P
No ratings yet
Presented by Ragasudha.B Pavitha.P
13 pages
Introduction To GP-GPU and CUDA: High Performance Computing Center Hanoi University of Science & Technology
No ratings yet
Introduction To GP-GPU and CUDA: High Performance Computing Center Hanoi University of Science & Technology
43 pages
Systolic Array Architecture For Educational Use
No ratings yet
Systolic Array Architecture For Educational Use
6 pages
5 Introduction To Huawei AI Platforms v3.5
No ratings yet
5 Introduction To Huawei AI Platforms v3.5
113 pages
20200409riscv Con Online ACE Eng Secured
No ratings yet
20200409riscv Con Online ACE Eng Secured
26 pages
Nvidia Gears Up For Robotic Revolution, Unveils Powerful Ai Chip
No ratings yet
Nvidia Gears Up For Robotic Revolution, Unveils Powerful Ai Chip
4 pages
Nvitu 230307121950 c3b682cc
No ratings yet
Nvitu 230307121950 c3b682cc
24 pages
Lab P4L1.1 Intro To IoT and Edge Computing
No ratings yet
Lab P4L1.1 Intro To IoT and Edge Computing
18 pages
Lab P3L4.1 Convolutional Neural Network
No ratings yet
Lab P3L4.1 Convolutional Neural Network
8 pages
Passion Speaks
No ratings yet
Passion Speaks
1 page
P3L4.1 Convolutional Neural Networks (CNN)
No ratings yet
P3L4.1 Convolutional Neural Networks (CNN)
45 pages
P4L1.1 Intro To AIoT and Edge Computing
No ratings yet
P4L1.1 Intro To AIoT and Edge Computing
56 pages
Lab - P5L1 - Object Detection With Jetson AGX Orin Kit and AMD AI PC - 26 Sep - v3
No ratings yet
Lab - P5L1 - Object Detection With Jetson AGX Orin Kit and AMD AI PC - 26 Sep - v3
31 pages
Sample Interview Questions For Core Java
No ratings yet
Sample Interview Questions For Core Java
15 pages
Detailed Lesson Plan Grade 6 Ict 1 2 1
No ratings yet
Detailed Lesson Plan Grade 6 Ict 1 2 1
13 pages
IO Link
100% (1)
IO Link
16 pages
Ost2 LP Syssec
No ratings yet
Ost2 LP Syssec
1 page
30 Types of Hackers by Sree Charan C
100% (2)
30 Types of Hackers by Sree Charan C
9 pages
Spaghetti Diagram for Efficiency
No ratings yet
Spaghetti Diagram for Efficiency
7 pages
Design Concepts for UX Students
No ratings yet
Design Concepts for UX Students
90 pages
The Object-Oriented Approach To Requirements
No ratings yet
The Object-Oriented Approach To Requirements
51 pages
Edel-EMES60 Datasheet PDF
No ratings yet
Edel-EMES60 Datasheet PDF
6 pages
Cyber Crime and Environmental Laws and Protection
No ratings yet
Cyber Crime and Environmental Laws and Protection
31 pages
Profile HMMs for Bioinformatics
No ratings yet
Profile HMMs for Bioinformatics
36 pages
DBCC Secrets
No ratings yet
DBCC Secrets
56 pages
B.Tech CSBS Year 3 Syllabus
No ratings yet
B.Tech CSBS Year 3 Syllabus
69 pages
2021 AWS Glue Developer Guide
100% (1)
2021 AWS Glue Developer Guide
1,005 pages
LibreOffice Writer & Calc Guide
No ratings yet
LibreOffice Writer & Calc Guide
14 pages
OS Services & Components Guide
No ratings yet
OS Services & Components Guide
37 pages
Application of Disruptive Technologies in Business: Mba (M&S) Section-B Mba (RM) Section-A
No ratings yet
Application of Disruptive Technologies in Business: Mba (M&S) Section-B Mba (RM) Section-A
18 pages
Multi-Core Processor Technology:: Maximizing CPU Performance in A Power-Constrained World
No ratings yet
Multi-Core Processor Technology:: Maximizing CPU Performance in A Power-Constrained World
23 pages
SELinux Reference Policy Guide
No ratings yet
SELinux Reference Policy Guide
5 pages
New Examiner Form
No ratings yet
New Examiner Form
16 pages
Building Data Pipeline With Pentaho Lab Guide
No ratings yet
Building Data Pipeline With Pentaho Lab Guide
18 pages
Digital Technology Scheme of Work Jss2
No ratings yet
Digital Technology Scheme of Work Jss2
2 pages
Export PDF Field Names
100% (1)
Export PDF Field Names
2 pages
B ME User Guide 82
No ratings yet
B ME User Guide 82
82 pages
Malkharoda Nominated
No ratings yet
Malkharoda Nominated
4 pages
Kia7042ap PDF
No ratings yet
Kia7042ap PDF
4 pages
Solutions 131
No ratings yet
Solutions 131
1 page
Alphacam Router 13032012
0% (2)
Alphacam Router 13032012
4 pages
Berthing Aid Systems PDF
No ratings yet
Berthing Aid Systems PDF
11 pages
It Security Dissertation Topics
100% (2)
It Security Dissertation Topics
7 pages