0% found this document useful (0 votes)

236 views31 pages

SDRAM Architecture & Efficiency

The document outlines a presentation on SDRAM. It begins with an introduction to SDRAM basics like operation and efficiency. It then discusses SDRAM controller architecture, including how the front-end schedules requests and the back-end translates them into command sequences using memory mapping and command generation. It concludes by emphasizing the importance of preventing worst-case memory efficiency scenarios.

Uploaded by

Hadley Magno

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

236 views31 pages

SDRAM Architecture & Efficiency

Uploaded by

Hadley Magno

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Presentation Outline

Introduction to SDRAM Basic SDRAM operation Memory efficiency SDRAM controller architecture Conclusions

Static RAM (SRAM)

SRAM is on-chip memory Found in higher levels of the memory hierarchy

Commonly used for caches and scratchpads

Either local to processor or centralized

Local memory has very short access time Centralized shared memories have intermediate access time

An SRAM cell consists of six transistors

Limits memory to a few megabytes, or even smaller

Dynamic RAM (DRAM)

DRAM was patented in 1968 by Robert Dennard at IBM Significantly cheaper than SRAM
DRAM cell has 1 transistor and 1 capacitor vs. 6 transistors for SRAM A bit is represented by a high or low charge on the capacitor Charge dissipates due to leakage hence the term dynamic RAM Capacity of up to a gigabyte per chip

DRAM is (shared) off-chip memory

Long access time compared to SRAM Off-chip pins are expensive in terms of area and power
SDRAM bandwidth is scarce and must be efficiently utilized

Found in lower levels of memory hierarchy

Used as remote high-volume storage

The DRAM evolution

Evolution of the DRAM design in the past 15 years

A clock signal was added making the design synchronous (SDRAM) The data bus transfers data on both rising and falling edge of the clock (DDR SDRAM) Second and third generation of DDR memory (DDR2/DDR3) scales to higher clock frequencies (up to 800 MHz) DDR4 currently being standardized by JEDEC Special branches of DDR memories for graphic cards (GDDR) and for low power systems (LPDDR)

SDRAM Architecture

The SDRAM architecture is organized in banks, rows and columns

A row buffer stores a currently active (open) row

The memory interface has a command bus, address bus, and a data bus
Busses shared between all banks to reduce the number of off-chip pins A bank is essentially is an independent memory, but with shared I/O

Typical values DDR2/DDR3:

4 or 8 banks 8K 65K rows / bank 1K 2K columns / row 4, 8, 16 bits / column 200-800 MHz 32 MB 1 GB density

Example memory: 16-bit DDR2-400B 64 MB

4 banks 8K rows / bank 1024 columns / row 16 bits / column

800 MB/s peak bandwidth

Presentation Outline

Introduction to SDRAM Basic SDRAM operation Memory efficiency SDRAM controller architecture Conclusions

Basic SDRAM Operation

Requested row is activated and copied into the row buffer of the bank Read bursts and/or write bursts are issued to the active row
Programmed burst length (BL) of 4 or 8 words

Row is precharged and stored back into the memory array

Command Activate Read Write Precharge Refresh No operation

Abbr ACT RD WR PRE REF NOP

Description Activate a row in a particular bank Initiate a read burst to an active row Initiate a write burst to an active row Close a row in a particular bank Start a refresh operation Ignores all inputs
8

Timing Constraints

Timing constraints determine which commands can be scheduled

More than 20 constraints, some are inter-dependent Limits the efficiency of memory accesses
Wait for precharge, activate and read/write commands before data on bus

Timing constraints get increasingly severe for faster memories

The physical design of the memory core has not changed much Constaint in nanoseconds constant, but clock period gets shorter

Parameter ACT to RD/WR ACT to ACT (diff. banks) ACT to ACT (same bank) Read latency RD to RD

Abbr. tRCD tRRD tRAS tRL -

Cycles 3 2 12 3 BL/2 9 7 December 2009

Pipelined SDRAM access

Multiple banks provide parallelism

SDRAM has separate data and command buses Commands to different banks are pipelined Activate, precharge and transfer data in parallell (bank preparation) Increases efficiency

Figure shows pipelined memory accesses with burst length 8

10 7 December 2009

Presentation Outline

Introduction to SDRAM Basic SDRAM operation Memory efficiency SDRAM controller architecture Conclusions

Memory Efficiency

Memory efficiency is the fraction of clock cycles with data transfer

Defines the exchange rate between peak bandwidth and net bandwidth Net bandwidth is the actual useful bandwidth after considering overhead

Five categories of memory efficiency for SDRAM:

Refresh efficiency Read/write efficiency Bank efficiency Command efficiency Data efficiency

Memory efficiency is the product of these five categories

Refresh Efficiency

SDRAM need to be refreshed regularly to retain data

DRAM cell contains leaking capacitor Refresh command must be issued every 7.8 s for DDR2/DDR3 SDRAM All banks must be precharged Data cannot be transfered during refresh

Refresh efficiency is largely independent of traffic

Depends on density of the memory device (generally 95 99%)

Read / Write Efficiency

Cycles are lost when switching direction of the data bus

Extra NOPs must be inserted between read and write commands

Read/write efficiency depends on traffic

Determined by frequency of read/write switches Switching too often has a significant impact on memory efficiency
Switching after every burst of 8 words gives 57% r/w efficiency with DDR2-400

Bank Efficiency

Bank conflict when a read or write targets an inactive row (row miss)
Significantly impacts memory efficiency Requires precharge followed by activate
Less than 40% bank efficiency if always row miss in same bank

Bank efficiency depends on traffic

Determined by address of request and memory map

Command Efficiency

Command bus uses single data rate

Congested if precharge and activate is required simultaneously One command has to be delayed may delay data on bus

Command efficiency depends on traffic

Small bursts reduce command efficiency
Potentially more activate and precharge commands issued

Generally quite high (95-100%)

Data Efficiency

A memory burst can access segments of the programmed burst size.

Minimum access granularity
Burst length 8 words is 16 B with 16-bit memory and 64 B with 64-bit memory

If data is poorly aligned an extra segment have to be transferred

Cycles are lost when transferring unrequested data

Data efficiency depends on the application

Smaller requests and bigger burst length reduce data efficiency

Conclusions on Memory Efficiency

Memory efficiency is highly dependent on traffic Worst-case efficiency is very low

Every burst targets different rows in the same bank Read/write switch after every burst

Results in
Less than 40% efficiency for all DDR2 memories Efficiency drops as memories become faster (DDR3)

Conclusion
Worst-case efficiency must be prevented!
18

Presentation Outline

Introduction to SDRAM Basic SDRAM operation Memory efficiency SDRAM controller architecture Conclusions

A general memory controller architecture

A general controller architecture consists of two parts The front-end

buffers requests and responses per requestor schedules one (or more) requests for memory access is independent of the memory type

The back-end
translates scheduled request(s) into SDRAM command sequence is dependent on the memory type

Front-end arbitration

Front-end provides buffering and arbitration Arbiter can schedule requests in many different ways
Priorities common to give low-latency access to critical requestors
E.g. stalling processor waiting for a cache line Important to prevent starvation of low priority requestors

Common to schedule fairly in case of multiple processors Next request may be scheduled before previous is finished
Gives more options to command generator in back-end

Scheduled requests are sent to the back-end for memory access

Back-end

Back-end contains a memory map and a command generator Memory map decodes logical address to physical address
Physical address is (bank, row, column) Can be done in different ways choice affects efficiency
Logical addr. 0x10FF00

Memory map

Physical addr. (2, 510, 128)

Command generator schedules commands for the target memory

Customized for a particular memory generation Programmable to handle different timing constraints

Continuous memory map

The memory map decodes a memory address into (bank, row, column)
Decoding is done by slicing the bits in the logical address

Continuous memory map

Map sequential address to columns in row Switch bank when are columns in row are visited Switch row when all banks are visited

Continuous memory map

Continuous memory map very sensitive to locality Advantage:

Very efficient in best case
No bank conflicts when reading sequential addresses 10 cycles to issue four read commands with burst length 4 words

Disadvantage:
Very inefficient if requesting different rows in same bank 37 cycles to issue the four read commands

Interleaving memory map

Maps bursts to different banks in interleaving fashion Active row in a bank is not changed until all columns are visited

Interleaving memory map

Interleaving memory map is largely insensitive to locality Advantage:

Makes extensive use of bank parallelism to hide overhead Average case and worst case is almost the same
Takes 10 or 11 cycles for the four read commands depending on locality Compare to 10 or 37 for continuous memory map

Disadvantages:
Requires bursts to all banks to be efficient
Solved if requests are large, such as 64 B

Issues many activate and precharges which increases power consumption

Command generator

Generates and schedules commands for scheduled requests

May work with both requests and commands

Many ways to determine which request to process

Increase bank efficiency
Prefer requests targeting open rows

Increase read/write efficiency

Prefer read after read and write after write

Reduce stall cycles of processor

Always prefer reads, since reads are blocking and writes often posted

Command generator

Generate SDRAM commands without violating timing constraints

Often use bank controller to determine valid commands to a bank

Many possible policies to determine which command to schedule

Precharge policies
Close rows as soon as possible to activate new one faster Keep rows open as long as possible to benefit from locality

Command priorities
Read and write commands have high priority, as they put data on the bus Precharge and activate commands have lower priorities

Algorithms often try to put data on the bus as soon as possible

Microsoft proposes a self-learning memory controller that uses reinforment-learning to do long-term planning

Presentation Outline

Introduction to SDRAM Basic SDRAM operation Memory efficiency SDRAM controller architecture Conclusions

Conclusions

SDRAM is used as shared off-chip high-volume storage

Cheaper but slower than SRAM

The efficiency of SDRAM is highly variable and depends on

Refresh efficiency, bank efficiency, read/write efficiency, command effiency, and data efficiency

Controller tries to minimize latency and maximize efficiency

Low-latency for critical requestors using priorities Fairness among multiple processors High efficiency by reordering requests to fit with memory state

Memory map impacts efficiency

Continuous memory map good if small requests and good locality Interleaving memory map good if large requests and poor locality
30

k.b.akesson@tue.nl

DDR Controller
No ratings yet
DDR Controller
30 pages
ECE 554 Computer Architecture Main Memory Spring 2013
No ratings yet
ECE 554 Computer Architecture Main Memory Spring 2013
35 pages
CPU Cache and Memory
No ratings yet
CPU Cache and Memory
57 pages
Memory: by Shimi Cohen
No ratings yet
Memory: by Shimi Cohen
29 pages
Lecture: DRAM Main Memory: Topics: DRAM Intro and Basics (Section 2.3)
No ratings yet
Lecture: DRAM Main Memory: Topics: DRAM Intro and Basics (Section 2.3)
14 pages
DDR, DDR3, DDR4, DDR5 Ram Architecture
No ratings yet
DDR, DDR3, DDR4, DDR5 Ram Architecture
11 pages
ApplicationSpecific DRAM Architectures and Designs
No ratings yet
ApplicationSpecific DRAM Architectures and Designs
81 pages
FCA2
No ratings yet
FCA2
46 pages
Abstract
No ratings yet
Abstract
23 pages
L05 Memory
No ratings yet
L05 Memory
45 pages
DDRSTRPPT
No ratings yet
DDRSTRPPT
44 pages
Computer Memories
No ratings yet
Computer Memories
26 pages
Lecture 7 Main Memory
No ratings yet
Lecture 7 Main Memory
36 pages
DRAM Basics by Prof. Matthew D. Sinclair
No ratings yet
DRAM Basics by Prof. Matthew D. Sinclair
103 pages
Memory: Computer Architecture and Assembly Language
No ratings yet
Memory: Computer Architecture and Assembly Language
15 pages
DRAM
No ratings yet
DRAM
24 pages
DDR4 Sdram
No ratings yet
DDR4 Sdram
29 pages
Introduction To DDRX Technology 1732457404
No ratings yet
Introduction To DDRX Technology 1732457404
75 pages
Memory Ram
No ratings yet
Memory Ram
15 pages
Computer Memory Types & Operations
No ratings yet
Computer Memory Types & Operations
75 pages
Mentor: Sanjeev Kumar: Poorva Anand GROUP-11 (ECE)
No ratings yet
Mentor: Sanjeev Kumar: Poorva Anand GROUP-11 (ECE)
31 pages
Lecture 10
No ratings yet
Lecture 10
44 pages
CS5204/EE5364 - Advanced Computer Architecture - Memory
No ratings yet
CS5204/EE5364 - Advanced Computer Architecture - Memory
67 pages
11 Memory
No ratings yet
11 Memory
41 pages
UNIT3
No ratings yet
UNIT3
50 pages
Types of Dram Theory
No ratings yet
Types of Dram Theory
5 pages
Chapter 5-The Memory System
100% (1)
Chapter 5-The Memory System
80 pages
SDRAM Controller Design Using VHDL
No ratings yet
SDRAM Controller Design Using VHDL
6 pages
Performance Comparison of Contempora Ry DRAM Architectures
No ratings yet
Performance Comparison of Contempora Ry DRAM Architectures
36 pages
1.3.5 Memory, Storage Devices and Media
No ratings yet
1.3.5 Memory, Storage Devices and Media
25 pages
DRAM and Cache Memory Architectures
No ratings yet
DRAM and Cache Memory Architectures
28 pages
Lecture 10
No ratings yet
Lecture 10
44 pages
Main Memory Architecture Lecture
No ratings yet
Main Memory Architecture Lecture
50 pages
Lecture 10
No ratings yet
Lecture 10
44 pages
Chapter 2
No ratings yet
Chapter 2
66 pages
Memory Architecture Essentials
No ratings yet
Memory Architecture Essentials
62 pages
Memory System
No ratings yet
Memory System
70 pages
Memory Hierarchy Levels: Block (Aka Line) : Unit of Copying If Accessed Data Is Present in Upper Level
No ratings yet
Memory Hierarchy Levels: Block (Aka Line) : Unit of Copying If Accessed Data Is Present in Upper Level
16 pages
7 Memory
No ratings yet
7 Memory
89 pages
The New DRAM Interfaces: Sdram, Rdram and Variants: Brian Davis, Bruce Jacob, Trevor Mudge
No ratings yet
The New DRAM Interfaces: Sdram, Rdram and Variants: Brian Davis, Bruce Jacob, Trevor Mudge
6 pages
Internal Memory (RAM + ROM) 2
No ratings yet
Internal Memory (RAM + ROM) 2
32 pages
DRAM Schedule
No ratings yet
DRAM Schedule
11 pages
How RAM Works
No ratings yet
How RAM Works
9 pages
Memory Access Sheduling
No ratings yet
Memory Access Sheduling
11 pages
Chapter 1
No ratings yet
Chapter 1
6 pages
Lec 4 New
No ratings yet
Lec 4 New
30 pages
Understanding RAM: Types and Features
100% (1)
Understanding RAM: Types and Features
25 pages
PRESENTATION2
No ratings yet
PRESENTATION2
13 pages
Seminar Report of Nano Ram
No ratings yet
Seminar Report of Nano Ram
37 pages
DRAM Terminology and Basics, Energy Innovations
No ratings yet
DRAM Terminology and Basics, Energy Innovations
14 pages
ClassSession 58932020621639390
No ratings yet
ClassSession 58932020621639390
4 pages
Adobe Scan May 28, 2025
No ratings yet
Adobe Scan May 28, 2025
38 pages
Memory Types and Access Methods
No ratings yet
Memory Types and Access Methods
32 pages
Dip Ali I Jcs It 2011020452
No ratings yet
Dip Ali I Jcs It 2011020452
9 pages
Module 5.3
No ratings yet
Module 5.3
39 pages
DDR1 SDRAM: Basics & Performance
No ratings yet
DDR1 SDRAM: Basics & Performance
3 pages
The Memory System
No ratings yet
The Memory System
50 pages
04 Cache Memory Internal Memory Revised 2
No ratings yet
04 Cache Memory Internal Memory Revised 2
43 pages
Interfacing Through IC Peripheral Chips: Interface
No ratings yet
Interfacing Through IC Peripheral Chips: Interface
11 pages
300+ Top Microprocessors Questions and Answers PDF: Prisma™ Cloud Security
No ratings yet
300+ Top Microprocessors Questions and Answers PDF: Prisma™ Cloud Security
14 pages
OsoPandaColgado VitaliStore Pared
No ratings yet
OsoPandaColgado VitaliStore Pared
24 pages
RISC vs CISC Architecture
No ratings yet
RISC vs CISC Architecture
4 pages
Electronics Deals at PC World
No ratings yet
Electronics Deals at PC World
8 pages
EVB-I94124ADI User Manual
No ratings yet
EVB-I94124ADI User Manual
19 pages
DataSheet HP470 PDF
No ratings yet
DataSheet HP470 PDF
4 pages
2 Evolution of Computer
No ratings yet
2 Evolution of Computer
9 pages
Emu Log
No ratings yet
Emu Log
7 pages
Manual StandardCPU
50% (2)
Manual StandardCPU
26 pages
Desktop
No ratings yet
Desktop
10 pages
Debugger M8051ew
No ratings yet
Debugger M8051ew
41 pages
PCC-CS402
No ratings yet
PCC-CS402
7 pages
Computer Architecture
No ratings yet
Computer Architecture
667 pages
Practical 1: AIM: Identify The Different Components of Computer
No ratings yet
Practical 1: AIM: Identify The Different Components of Computer
6 pages
PIC Questions
100% (3)
PIC Questions
4 pages
Canon LBP 1210 Spare Parts CS KCCL 21.08.16
No ratings yet
Canon LBP 1210 Spare Parts CS KCCL 21.08.16
1 page
DSD Memory
No ratings yet
DSD Memory
37 pages
The 8086 Input/output Interface: Dr. Mohanad A. Shehab/ Electrical Engineering Department/ Mustansiriyah University
No ratings yet
The 8086 Input/output Interface: Dr. Mohanad A. Shehab/ Electrical Engineering Department/ Mustansiriyah University
12 pages
Addendum Genesys OS 1 11 Engl
No ratings yet
Addendum Genesys OS 1 11 Engl
8 pages
Manual Asus p5l Vm1394
No ratings yet
Manual Asus p5l Vm1394
96 pages
Computer Science Course Syllabus
No ratings yet
Computer Science Course Syllabus
5 pages
Nebil Cover2
No ratings yet
Nebil Cover2
8 pages
Name
No ratings yet
Name
24 pages
Cino F680
No ratings yet
Cino F680
2 pages
Not An Exam Paper
No ratings yet
Not An Exam Paper
5 pages
Cse Coa
No ratings yet
Cse Coa
16 pages
Grade 9 Computer Systems Guide
No ratings yet
Grade 9 Computer Systems Guide
7 pages
CH01 COA11e
No ratings yet
CH01 COA11e
45 pages
AVR KIT Manual
No ratings yet
AVR KIT Manual
7 pages

SDRAM Architecture & Efficiency

Uploaded by

SDRAM Architecture & Efficiency

Uploaded by

Presentation Outline

Static RAM (SRAM)

SRAM is on-chip memory Found in higher levels of the memory hierarchy

Either local to processor or centralized

An SRAM cell consists of six transistors

Dynamic RAM (DRAM)

DRAM is (shared) off-chip memory

Found in lower levels of memory hierarchy

The DRAM evolution

Evolution of the DRAM design in the past 15 years

The SDRAM architecture is organized in banks, rows and columns

Typical values DDR2/DDR3:

Example memory: 16-bit DDR2-400B 64 MB

800 MB/s peak bandwidth

Basic SDRAM Operation

Row is precharged and stored back into the memory array

Command Activate Read Write Precharge Refresh No operation

Abbr ACT RD WR PRE REF NOP

Timing constraints determine which commands can be scheduled

Timing constraints get increasingly severe for faster memories

Abbr. tRCD tRRD tRAS tRL -

Cycles 3 2 12 3 BL/2 9 7 December 2009

Pipelined SDRAM access

Multiple banks provide parallelism

Figure shows pipelined memory accesses with burst length 8

Memory efficiency is the fraction of clock cycles with data transfer

Five categories of memory efficiency for SDRAM:

Memory efficiency is the product of these five categories

SDRAM need to be refreshed regularly to retain data

Refresh efficiency is largely independent of traffic

Read / Write Efficiency

Cycles are lost when switching direction of the data bus

Read/write efficiency depends on traffic

Bank efficiency depends on traffic

Command bus uses single data rate

Command efficiency depends on traffic

Generally quite high (95-100%)

A memory burst can access segments of the programmed burst size.

If data is poorly aligned an extra segment have to be transferred

Data efficiency depends on the application

Conclusions on Memory Efficiency

Memory efficiency is highly dependent on traffic Worst-case efficiency is very low

A general memory controller architecture

A general controller architecture consists of two parts The front-end

Scheduled requests are sent to the back-end for memory access

Physical addr. (2, 510, 128)

Command generator schedules commands for the target memory

Continuous memory map

Continuous memory map

Continuous memory map

Continuous memory map very sensitive to locality Advantage:

Interleaving memory map

Interleaving memory map

Interleaving memory map

Interleaving memory map is largely insensitive to locality Advantage:

Issues many activate and precharges which increases power consumption

Generates and schedules commands for scheduled requests

Many ways to determine which request to process

Increase read/write efficiency

Reduce stall cycles of processor

Generate SDRAM commands without violating timing constraints

Many possible policies to determine which command to schedule

Algorithms often try to put data on the bus as soon as possible

SDRAM is used as shared off-chip high-volume storage

The efficiency of SDRAM is highly variable and depends on

Controller tries to minimize latency and maximize efficiency

Memory map impacts efficiency

You might also like