0% found this document useful (0 votes)

197 views17 pages

Alpha 364 Architecture and HT Protocol

The Alpha 21364, code-named "Marvel", also known as EV7 is a microprocessor developed by Digital Equipment Corporation (DEC) and later by Compaq

Uploaded by

Divil Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

197 views17 pages

Alpha 364 Architecture and HT Protocol

The Alpha 21364, code-named "Marvel", also known as EV7 is a microprocessor developed by Digital Equipment Corporation (DEC) and later by Compaq

Uploaded by

Divil Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 17

Alpha 364 Architecture

Introduction:
The Alpha 21364, code-named "Marvel", also known as Alpha 364
and EV7 is microprocessor developed by Digital Equipment
Corporation(DEC), later Compaq Computer Corporation, that
implemented the Alpha instruction set architecture (ISA).
The Alpha 21364 processor provides a high-performance, highly
scalable, and highly reliable network architecture. The router runs at
I.2GHz and routes packets at a peak bandwidth of 22.4 GB/s. The
network architecture scales up to a 128-processor configuration,
which can support up to four terabytes of distributed Rambus
memory and hundreds of terabytes of disk storage. The distributed
Rambus memory is kept coherent via a scalable, directory-based,
cache coherence scheme.
The network also provides a variety of reliability features, such as
per-flit ECC. These features make the 21364 network architecture
well-suited to support communication-intensive server applications.
Main goal of EV7 was to achieve high memory bandwidth and
latency goal by incorporating two on-chip RDRAM memory
controllers and a very large 1.5MB L2 cache.
A second key goal for the processor was scalability.
The

EV7s

memory

bandwidth

scales

the

addition

Alpha Roadmap
Lower Cost

Higher Performance
0.5m

EV5/333
21164

0.35m

0.18m

EV6/575
21264

EV7/1000
21364

0.35m

0.13m

EV8

0.28m

EV56/600
21164

EV67/750
21264
0.35m

...
0.18m

PCA56/533
21164PC

EV68/1000
21264
0.28m

PCA57/600
21164PC
1995

1996

1997

1998

1999

2000

2001

Higher integration

Higher MHz

New core

Estimated time for TPC-C

21364 Chip Block Diagram

16 L1
Miss Buffers
64K Icache
21264
Core
64K Dcache
16 L1
Victim Buf

Address In
R
A
M
B
U
S

Address Out

L2
Cache

Memory
Controller
Network
Interface

16 L2
Victim Buf

N
S
E
W
I/O

We will start with core of the 21264. The number of

outstanding cache block fills, will be increased from 8 to 16.
Misses to the L1 caches will first access the L2 cache. Data
will be returned on a 128 byte wide bus.
References that miss the L2 cache will access the local
memory and return data to the core. Memory locations not
located in the local memory will access the network.
The integrated network interface will route the request to
the appropriate node in the network using one of the 4
ports (N, S, E and W).
The 21164 cores 8 entry victim buffer is currently used for
both L1 and L2 victims. The new design will increase the
size of the victim buffer to 16 x 64 byte blocks for L1->L2
victims. A new 16 x 64 bytes victim buffer will be used to
hold victims leaving the L2 cache for the local memory or
the network.

Heres the block diagram of a 12 processor system using

the 2D torus topology.
Each processor may have its own local memory and may
have its own local I/O connection.
It is possible for a processor to operate in the system
without memory or I/O if that is attractive

Integrated Memory Controller

The chip contains an integrated Direct RAMbus memory controller. Direct RAMbus provides high data
capacity per bin along with outstanding bandwidth and latency. The pin to pin delay for a page hit in the
RDRAM is 30ns.
The memory controller will provide 6GB/sec of read or write bandwidth to the core. With 2GFLOPs, the chip
provides 3byte/FLOP of usable memory bandwidth, a significant improvement over current systems.
To reduce memory latency the memory controller will track 100s of open pages in the RDRAM array.
A directory based cache coherence protocol is an integral part of the memory controller.
The memory is protected by a single error correct, double error detect ECC code.
The EV7 contains two integrated Direct Rambus (RDRAM) memory controllers. Direct Rambus provides the
highest data rate per pin, with outstanding bandwidth and good access latency.
Direct RAMbus :
High data capacity per pin
800 MHz operation
30ns CAS latency pin to pin
6 GB/sec read or write bandwidth
100s of open pages
Directory based cache coherence.

Integrated L2 Cache
The

1.5MB

write

bytes/cycle

1GHz,

resulting

16GB/second of read or write band6-set L2 cache has a 12 cycle

load to use latency. This latency is set by the existing control in
the core and is used to significantly reduce the power
consumption of the L2 array.
The L2 cache and read or width. The array is protected by a
single error correct, double error detect ECC code. Errors are
corrected on the fly in hardware.

Integrated Network
Interface

The integrated network interface allows multi-processor

systems to be built using a 2D torus topology. Each node is
capable of moving 10GB/second. Each hop in the network
will take an average of 15ns.
The network moves data and control packets from the
source to the destination. It does not guarantee ordering.
Adaptive routing of packets allow the network to detect and
avoid hot spots.
Asynchronous clocking between processors removes the
need to distribute a low skew clock within a large system.
A fifth port provides up to 3GB/sec on bandwidth to
industry standard buses, PCI, PCI-X, AGP, and ServerNet to
name a few.

Alpha 21364 Technology

Design Specifications
0.18 m CMOS
1000+ MHz
100 Watts @ 1.5 volts
3.5 cm2
6 Layer Metal
100 million transistors
8 million logic
92 million RAM

Hyper Transport
protocol

Introduction
HyperTransport (HT) is a technology for
interconnection of computer processors. It is a
bidirectional serial/parallel high-bandwidth,
low-latency point-to-point link that was
introduced on April 2, 2001.
The HyperTransport Consortium is in charge of
promoting and developing HyperTransport
technology.
HyperTransport is best known as the system
bus architecture of modern AMD central
processing units (CPUs) and the associated
Nvidia nForce motherboard chipsets.
HyperTransport has also been used by IBM
and Apple for the Power Mac G5 machines, as
well as a number of modern MIPS systems.

Links and rates

HyperTransport comes in four versions1.x, 2.0, 3.0, and 3.1

which run from 200 MHz to 3.2 GHz. It is also a DDR or "double
data rate" connection, meaning it sends data on both the rising
and falling edges of the clock signal. This allows for a maximum
data rate of 6400 MT/s when running at 3.2 GHz. The operating
frequency is auto negotiated with the motherboard chipset (North
Bridge) in current computing.
HyperTransport supports an auto negotiated bit width, ranging
from 2 to 32 bits per link; there are two unidirectional links per
HyperTransport bus. With the advent of version 3.1, using full 32bit links and utilizing the full
HyperTransport 3.1 specification's operating frequency, the
theoretical transfer rate is 25.6 GB/s (3.2 GHz 2 transfers per
clock cycle 32 bits per link) per direction, or 51.2 GB/s
aggregated throughput, making it faster than most existing bus
standard for PC workstations and servers as well as making it
faster than most bus standards for high-performance computing
and networking.
Links of various widths can be mixed together in a single system
configuration as in one 16-bit link to another CPU and one 8-bit
link to a peripheral device, which allows for a wider interconnect
between CPUs, and a lower bandwidth interconnect to peripherals
as appropriate. It also supports link splitting, where a single 16-bit

Packet-Orientation
HyperTransport is packet-based, where each packet consists
of a set of 32-bit words, regardless of the physical width of
the link. The first word in a packet always contains a
command field. Many packets contain a 40-bit address. An
additional 32-bit control packet is prepended when 64-bit
addressing is required. The data payload is sent after the
control packet. Transfers are always padded to a multiple of
32 bits, regardless of their actual length.
HyperTransport packets enter the interconnect in segments
known as bit times. The number of bit times required depends
on the link width. HyperTransport also supports system
management messaging, signaling interrupts, issuing probes
to adjacent devices or processors, I/O transactions, and
general data transactions.
There are two kinds of write commands supported: posted
and non-posted. Posted writes do not require a response from
the target. This is usually used for high bandwidth devices
such as uniform memory access traffic or direct memory
access transfers. Non-posted writes require a response from
the receiver in the form of a "target done" response. Reads

Frequency Specifications

Implementations
AMD AMD64 and Direct Connect Architecture based CPUs
SiByte MIPS CPUs from Broadcom
PMC-Sierra RM9000X2 MIPS CPU
Raza Thread Processors
Loongson-3 MIPS processor
ht_tunnel from OpenCores project (MPL licence)
ATI Radeon Xpress 200 for AMD Processor
Nvidia nForce chipsets
nForce Professional MCPs (Media and Communication Processor)
nForce 4 series
nForce 500 series
nForce 600 series
nForce 700 series
ServerWorks (now Broadcom) HyperTransport SystemI/O Controllers
HT-2000
HT-2100
The IBM CPC925 and CPC945 PowerPC 970 northbridges, as codesigned and used by Apple in the Power Mac G5[6]
Several open source cores from the HyperTransport Center of
Excellence

Hypertransport Technology Seminar
50% (4)
Hypertransport Technology Seminar
27 pages
03 Buses
No ratings yet
03 Buses
71 pages
Lecture 6 SoC
No ratings yet
Lecture 6 SoC
24 pages
Bus Architecture
No ratings yet
Bus Architecture
48 pages
4773handbook of CPU Sockets 1st Edition Odalys Morrill Download PDF
No ratings yet
4773handbook of CPU Sockets 1st Edition Odalys Morrill Download PDF
77 pages
Performance Analysis of The Alpha 21364-Based HP GS1280 Multiprocessor
No ratings yet
Performance Analysis of The Alpha 21364-Based HP GS1280 Multiprocessor
11 pages
Chapter7 2
No ratings yet
Chapter7 2
23 pages
03 On-Chip Bus PDF
No ratings yet
03 On-Chip Bus PDF
54 pages
Embedded System Interfacing and Peripherals
No ratings yet
Embedded System Interfacing and Peripherals
92 pages
Bus Technology
No ratings yet
Bus Technology
9 pages
Unit Iii - Networks
No ratings yet
Unit Iii - Networks
91 pages
4-Embedded Buses 1x1
No ratings yet
4-Embedded Buses 1x1
96 pages
Amba Specification Advanced Extensible Interface Bus (Axi)
No ratings yet
Amba Specification Advanced Extensible Interface Bus (Axi)
37 pages
Class Presentation
No ratings yet
Class Presentation
24 pages
COA Lecture 24 DMA PDF
No ratings yet
COA Lecture 24 DMA PDF
25 pages
Module 3 Chapter 1
No ratings yet
Module 3 Chapter 1
58 pages
Garcia DesignImplemHighPerfMemory
No ratings yet
Garcia DesignImplemHighPerfMemory
12 pages
GB Report
No ratings yet
GB Report
26 pages
04 Router
No ratings yet
04 Router
109 pages
Submitted By, Divya.C.Babu Jyothi Karthika.M.S S5Mca LMCST
No ratings yet
Submitted By, Divya.C.Babu Jyothi Karthika.M.S S5Mca LMCST
70 pages
Unit1 1.6 Inter Structure
No ratings yet
Unit1 1.6 Inter Structure
45 pages
Unit1 1.6 Inter Structure
No ratings yet
Unit1 1.6 Inter Structure
45 pages
Bus
No ratings yet
Bus
45 pages
System Busses / Networks-on-Chip: EECE 579 - Advanced Topics in VLSI Design Spring 2009 Brad Quinton
No ratings yet
System Busses / Networks-on-Chip: EECE 579 - Advanced Topics in VLSI Design Spring 2009 Brad Quinton
102 pages
Embedded Communications: Version 2 EE IIT, Kharagpur 1
100% (1)
Embedded Communications: Version 2 EE IIT, Kharagpur 1
11 pages
Lecture 4 On Chip Interfaces 2021
No ratings yet
Lecture 4 On Chip Interfaces 2021
37 pages
AXI IIT Paper
No ratings yet
AXI IIT Paper
60 pages
Elements of Bus Design
No ratings yet
Elements of Bus Design
35 pages
Lesson 3 - Top Level View of Computer Function and Interconnection
No ratings yet
Lesson 3 - Top Level View of Computer Function and Interconnection
74 pages
Buses
No ratings yet
Buses
23 pages
Embeddednetworking
No ratings yet
Embeddednetworking
35 pages
Announcements
No ratings yet
Announcements
19 pages
1 CS Bus Hub Arch Overview
No ratings yet
1 CS Bus Hub Arch Overview
24 pages
Hypertransport Technology:: Simplifying System Design
No ratings yet
Hypertransport Technology:: Simplifying System Design
22 pages
03 - Top Level View of Computer Function and Interconnection
No ratings yet
03 - Top Level View of Computer Function and Interconnection
74 pages
IntroductionTo Databases
100% (2)
IntroductionTo Databases
191 pages
Hyper Transport Technology
No ratings yet
Hyper Transport Technology
23 pages
Direct Memory Access
No ratings yet
Direct Memory Access
26 pages
Digital Communication For Field Devices and Data Acquisition
No ratings yet
Digital Communication For Field Devices and Data Acquisition
54 pages
Ir15 06 Router Overview Inet - Tu-Berlin - de
No ratings yet
Ir15 06 Router Overview Inet - Tu-Berlin - de
17 pages
What Is The Difference Between SDRAM
No ratings yet
What Is The Difference Between SDRAM
6 pages
Intel Corporation - Intel IXP2400 Network Processor - 2nd Generation Intel NPU
No ratings yet
Intel Corporation - Intel IXP2400 Network Processor - 2nd Generation Intel NPU
13 pages
Lecture Notes of Week 4-5
No ratings yet
Lecture Notes of Week 4-5
28 pages
Bus (Computing)
No ratings yet
Bus (Computing)
11 pages
PC Hardware and Inside The BOX
No ratings yet
PC Hardware and Inside The BOX
48 pages
Fundamental Guide To Industrial Networking: Jeff Kordik
No ratings yet
Fundamental Guide To Industrial Networking: Jeff Kordik
52 pages
Lecture 2
No ratings yet
Lecture 2
27 pages
Computer Buses and Interfaces
No ratings yet
Computer Buses and Interfaces
34 pages
EDK Tutorial 1
No ratings yet
EDK Tutorial 1
28 pages
A Low Power, Programmable Networking Platform and Development Environment
No ratings yet
A Low Power, Programmable Networking Platform and Development Environment
19 pages
Stm32g4 Memory Flash Flash
No ratings yet
Stm32g4 Memory Flash Flash
36 pages
Selecting A Serial Bus: Application Note 3967
No ratings yet
Selecting A Serial Bus: Application Note 3967
7 pages
Introduction To Information Technology Turban, Rainer and Potter Chapter 6
50% (2)
Introduction To Information Technology Turban, Rainer and Potter Chapter 6
31 pages
Quadrics Interconnection Network
100% (1)
Quadrics Interconnection Network
24 pages
Computer Bus Architecture: Von Neumann Computer Model
No ratings yet
Computer Bus Architecture: Von Neumann Computer Model
19 pages
Hyper Transport Technology
100% (1)
Hyper Transport Technology
27 pages
Overview of Embedded Busses
No ratings yet
Overview of Embedded Busses
6 pages
Network Communication
No ratings yet
Network Communication
6 pages
WP SocbaseSoC-Based Microcontroller Bus Design in High Bandwidth Embedded Applicationsdmicrocontroller
No ratings yet
WP SocbaseSoC-Based Microcontroller Bus Design in High Bandwidth Embedded Applicationsdmicrocontroller
5 pages
Iteris Manual
No ratings yet
Iteris Manual
12 pages
Unit - III - Interfacing: Anusha.K Ap, Site
No ratings yet
Unit - III - Interfacing: Anusha.K Ap, Site
26 pages
Compiler Designs and Constructions: Chapter 9: Translation Objectives
No ratings yet
Compiler Designs and Constructions: Chapter 9: Translation Objectives
23 pages
Study of Submicron-Resolution, High-Accuracy Overlay and Large-Field Lithography For Advanced Packaging (Canon)
No ratings yet
Study of Submicron-Resolution, High-Accuracy Overlay and Large-Field Lithography For Advanced Packaging (Canon)
9 pages
Lab1 - Keil MDK - ARM and FRDM - KL46Z
No ratings yet
Lab1 - Keil MDK - ARM and FRDM - KL46Z
19 pages
Operating System
No ratings yet
Operating System
6 pages
Safety Certificate - SINAMICS - G220 - Z10 059719 0047 - 24 05 08
No ratings yet
Safety Certificate - SINAMICS - G220 - Z10 059719 0047 - 24 05 08
9 pages
ICS-2204-PPL Main
No ratings yet
ICS-2204-PPL Main
8 pages
CSE 1061 - Problem Solving Using Computers Lab Manual-Aug2020
No ratings yet
CSE 1061 - Problem Solving Using Computers Lab Manual-Aug2020
41 pages
PSP Solution
No ratings yet
PSP Solution
4 pages
En 050 BRAUMAT SISTAR TechnologyObjects
No ratings yet
En 050 BRAUMAT SISTAR TechnologyObjects
24 pages
Linux New Media Docker Open Source Developer Tools Article
No ratings yet
Linux New Media Docker Open Source Developer Tools Article
5 pages
Ramesh Kumar MResume Mendix Architect
No ratings yet
Ramesh Kumar MResume Mendix Architect
5 pages
Sri Venkata Anirudh Kopparthy
No ratings yet
Sri Venkata Anirudh Kopparthy
7 pages
Msi Ms-168a R0a Schematics
No ratings yet
Msi Ms-168a R0a Schematics
52 pages
Secp 256 K 1
No ratings yet
Secp 256 K 1
11 pages
Porting The xv6 OS To The Nezha D1 RISC-V Board: Michael Engel Department of Computer Science Ntnu
No ratings yet
Porting The xv6 OS To The Nezha D1 RISC-V Board: Michael Engel Department of Computer Science Ntnu
16 pages
Jobvacancyresult Com
No ratings yet
Jobvacancyresult Com
4 pages
Installing Asterisk With Yum
No ratings yet
Installing Asterisk With Yum
3 pages
Openmp: Parallel Processing
No ratings yet
Openmp: Parallel Processing
40 pages
Final Minor Project File
No ratings yet
Final Minor Project File
35 pages
Resume DevinHan (2023
No ratings yet
Resume DevinHan (2023
3 pages
Quantum Cryptography: Presented By: Sarika.K II Sem M.Tech, DEC
No ratings yet
Quantum Cryptography: Presented By: Sarika.K II Sem M.Tech, DEC
32 pages
Eric Nelson
No ratings yet
Eric Nelson
4 pages
U50SI1 Schematics RevC
No ratings yet
U50SI1 Schematics RevC
32 pages
IP Camera Installation and Setup Guide: XL Series DVR/NVR Models
No ratings yet
IP Camera Installation and Setup Guide: XL Series DVR/NVR Models
19 pages
Guide To Draw Stick Diagrams in VlSI
No ratings yet
Guide To Draw Stick Diagrams in VlSI
53 pages
Practice Exam Computer Architecture
No ratings yet
Practice Exam Computer Architecture
15 pages
Case Study-Railway Reservation
No ratings yet
Case Study-Railway Reservation
11 pages
Academic Calendar II Sem M.tech
No ratings yet
Academic Calendar II Sem M.tech
1 page
Wifi
No ratings yet
Wifi
2 pages
Exchange Server Interview Questions & Answers Part 1 PDF
No ratings yet
Exchange Server Interview Questions & Answers Part 1 PDF
3 pages
Routing in Wireless Mesh Networks
From Everand
Routing in Wireless Mesh Networks
Raghav Kumar
No ratings yet
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
From Everand
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
Rodrigo Copetti
No ratings yet

Alpha 364 Architecture and HT Protocol

Uploaded by

Alpha 364 Architecture and HT Protocol

Uploaded by

Alpha 364 Architecture

Estimated time for TPC-C

21364 Chip Block Diagram

We will start with core of the 21264. The number of

Heres the block diagram of a 12 processor system using

Integrated Memory Controller

16GB/second of read or write band6-set L2 cache has a 12 cycle

The integrated network interface allows multi-processor

Alpha 21364 Technology

Links and rates

HyperTransport comes in four versions1.x, 2.0, 3.0, and 3.1

You might also like