Null-Terminated String - Wikipedia

A null-terminated string is a character array ending with a null character, primarily used in C programming, where the string length is determined by searching for the first NUL. This method has historical roots in early assembly languages and was chosen for its memory efficiency, though it presents limitations such as security vulnerabilities and performance issues. Modern alternatives and improvements have been developed to address these shortcomings, including safer string handling functions and data structures that store string lengths explicitly.

Uploaded by

David Haoyu Sun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views3 pages

Null-Terminated String - Wikipedia

Uploaded by

David Haoyu Sun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Null-terminated string

In computer programming, a null-terminated string is a character string stored as an array

containing the characters and terminated with a null character (a character with an internal value of
zero, called "NUL" in this article, not same as the glyph zero). Alternative names are C string, which
refers to the C programming language and ASCIIZ[1] (although C can use encodings other than
ASCII).

The length of a string is found by searching for the (first) NUL. This can be slow as it takes O(n)
(linear time) with respect to the string length. It also means that a string cannot contain a NUL (there
is a NUL in memory, but it is after the last character, not in the string).

History
Null-terminated strings were produced by the .ASCIZ directive of the PDP-11 assembly languages
and the ASCIZ directive of the MACRO-10 macro assembly language for the PDP-10. These predate
the development of the C programming language, but other forms of strings were often used.

At the time C (and the languages that it was derived from) was developed, memory was extremely
limited, so using only one byte of overhead to store the length of a string was attractive. The only
popular alternative at that time, usually called a "Pascal string" (a more modern term is "length-
prefixed"), used a leading byte to store the length of the string. This allows the string to contain NUL
and made finding the length need only one memory access (O(1) (constant) time), but limited string
length to 255 characters. C designer Dennis Ritchie chose to follow the convention of null-termination
to avoid the limitation on the length of a string and because maintaining the count seemed, in his
experience, less convenient than using a terminator.[2][3]

This had some influence on CPU instruction set design. Some CPUs in the 1970s and 1980s, such as
the Zilog Z80 and the DEC VAX, had dedicated instructions for handling length-prefixed strings.
However, as the null-terminated string gained traction, CPU designers began to take it into account,
as seen for example in IBM's decision to add the "Logical String Assist" instructions to the ES/9000
520 in 1992 and the vector string instructions to the IBM z13 in 2015.[4]

FreeBSD developer Poul-Henning Kamp, writing in ACM Queue, referred to the victory of null-
terminated strings over a 2-byte (not one-byte) length as "the most expensive one-byte mistake"
ever.[5]

Limitations
While simple to implement, this representation has been prone to errors and performance problems.
Null-termination has historically created security problems.[6] A NUL inserted into the middle of a
string will truncate it unexpectedly.[7] A common bug was to not allocate the additional space for the
NUL, so it was written over adjacent memory. Another was to not write the NUL at all, which was
often not detected during testing because the block of memory already contained zeros. Due to the
expense of finding the length, many programs did not bother before copying a string to a fixed-size
buffer, causing a buffer overflow if it was too long.

The inability to store a zero requires that text and binary data be kept distinct and handled by
different functions (with the latter requiring the length of the data to also be supplied). This can lead
to code redundancy and errors when the wrong function is used.

The speed problems with finding the length can usually be mitigated by combining it with another
operation that is O(n) anyway, such as in strlcpy. However, this does not always result in an
intuitive API.

Character encodings
Null-terminated strings require that the encoding does not use a zero byte (0x00) anywhere; therefore
it is not possible to store every possible ASCII or UTF-8 string.[8][9][10] However, it is common to
store the subset of ASCII or UTF-8 – every character except NUL – in null-terminated strings. Some
systems use "modified UTF-8" which encodes NUL as two non-zero bytes (0xC0, 0x80) and thus
allow all possible strings to be stored. This is not allowed by the UTF-8 standard, because it is an
overlong encoding, and it is seen as a security risk. Some other byte may be used as end of string
instead, like 0xFE or 0xFF, which are not used in UTF-8.

UTF-16 uses 2-byte integers and as either byte may be zero (and in fact every other byte is, when
representing ASCII text), cannot be stored in a null-terminated byte string. However, some languages
implement a string of 16-bit UTF-16 characters, terminated by a 16-bit NUL (0x0000).

Improvements
Many attempts to make C string handling less error prone have been made. One strategy is to add
safer functions such as strdup and strlcpy, whilst deprecating the use of unsafe functions such as
gets. Another is to add an object-oriented wrapper around C strings so that only safe calls can be
done. However, it is possible to call the unsafe functions anyway.

Most modern libraries replace C strings with a structure containing a 32-bit or larger length value (far
more than were ever considered for length-prefixed strings), and often add another pointer, a
reference count, and even a NUL to speed up conversion back to a C string. Memory is far larger now,
such that if the addition of 3 (or 16, or more) bytes to each string is a real problem the software will
have to be dealing with so many small strings that some other storage method will save even more
memory (for instance there may be so many duplicates that a hash table will use less memory).
Examples include the C++ Standard Template Library std::string, the Qt QString, the MFC
CString, and the C-based implementation CFString from Core Foundation as well as its Objective-
C sibling NSString from Foundation, both by Apple. More complex structures may also be used to
store strings such as the rope.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Null-terminated_string&oldid=1184745718"

Strings
No ratings yet
Strings
10 pages
String and Character Treatment
No ratings yet
String and Character Treatment
6 pages
Intro to Strings in C/C++
No ratings yet
Intro to Strings in C/C++
16 pages
C String and Data Structures Guide
No ratings yet
C String and Data Structures Guide
8 pages
C String Handling: char*, char[], strcat()
No ratings yet
C String Handling: char*, char[], strcat()
32 pages
String Data Type
No ratings yet
String Data Type
8 pages
String Data Type
No ratings yet
String Data Type
2 pages
Strings Material (2 Classes) PDF
No ratings yet
Strings Material (2 Classes) PDF
14 pages
C String Manipulation Guide
No ratings yet
C String Manipulation Guide
19 pages
01 Strings Fdsfds
No ratings yet
01 Strings Fdsfds
73 pages
Sas14 Bes043
No ratings yet
Sas14 Bes043
5 pages
Strings
No ratings yet
Strings
12 pages
Chapter 3 4
100% (1)
Chapter 3 4
52 pages
C Strings for Safety Programming
No ratings yet
C Strings for Safety Programming
12 pages
Chapter 6 Strings-1
No ratings yet
Chapter 6 Strings-1
63 pages
Intro to Strings & ASCII Codes
No ratings yet
Intro to Strings & ASCII Codes
20 pages
Unit 7 Strings: Structure
No ratings yet
Unit 7 Strings: Structure
15 pages
DS Necessary
No ratings yet
DS Necessary
14 pages
Strings 1
No ratings yet
Strings 1
9 pages
C and C++ String Handling Guide
No ratings yet
C and C++ String Handling Guide
12 pages
Programming in C #3: Characters and Strings
No ratings yet
Programming in C #3: Characters and Strings
24 pages
Strings
No ratings yet
Strings
35 pages
C++ Strings & Pointers Lab Guide
No ratings yet
C++ Strings & Pointers Lab Guide
6 pages
CS107 Exam C Reference Sheet
No ratings yet
CS107 Exam C Reference Sheet
3 pages
Computer Programming Basics
No ratings yet
Computer Programming Basics
25 pages
STRINGS
No ratings yet
STRINGS
12 pages
C Technical Questions and Answers
No ratings yet
C Technical Questions and Answers
17 pages
String PPTs
No ratings yet
String PPTs
14 pages
04 Strings
No ratings yet
04 Strings
31 pages
Ds Unit 2
No ratings yet
Ds Unit 2
35 pages
Strings
No ratings yet
Strings
15 pages
Strings
No ratings yet
Strings
32 pages
String (Computer Science) - Wikipedia
No ratings yet
String (Computer Science) - Wikipedia
16 pages
Turbo C/C++ For Windows 7/8/8.1 and 10 32/64 Bit : Codeplex
100% (1)
Turbo C/C++ For Windows 7/8/8.1 and 10 32/64 Bit : Codeplex
4 pages
Unit 4
No ratings yet
Unit 4
49 pages
Strings
No ratings yet
Strings
31 pages
Absolute C 6th Edition Savitch Solutions Manual
No ratings yet
Absolute C 6th Edition Savitch Solutions Manual
30 pages
C Strings for Engineering Students
No ratings yet
C Strings for Engineering Students
8 pages
2 Strings PDF
No ratings yet
2 Strings PDF
8 pages
C Programming String
No ratings yet
C Programming String
28 pages
C Strings: A Beginner's Guide
100% (1)
C Strings: A Beginner's Guide
11 pages
C Programming: String Basics
No ratings yet
C Programming: String Basics
32 pages
Unit 9 - V1
No ratings yet
Unit 9 - V1
16 pages
Advanced Programming With LCC - Win32
100% (1)
Advanced Programming With LCC - Win32
62 pages
Strings
No ratings yet
Strings
8 pages
Better C Strings, Simply - Hackaday
No ratings yet
Better C Strings, Simply - Hackaday
18 pages
String Handling for Beginners
No ratings yet
String Handling for Beginners
32 pages
C++ Strings and Functions Lecture
No ratings yet
C++ Strings and Functions Lecture
37 pages
CSCI 240 Lecture Notes
No ratings yet
CSCI 240 Lecture Notes
14 pages
Character Arrays: One Dimensional Arrays Strings
No ratings yet
Character Arrays: One Dimensional Arrays Strings
35 pages
Strings in C Language
No ratings yet
Strings in C Language
28 pages
String
No ratings yet
String
16 pages
String
No ratings yet
String
16 pages
x86-64 (AMD64 & Intel64) - Wikipedia
No ratings yet
x86-64 (AMD64 & Intel64) - Wikipedia
31 pages
NTFS Volume Mount Point NTFS卷挂载点 - Wikipedia
No ratings yet
NTFS Volume Mount Point NTFS卷挂载点 - Wikipedia
2 pages
Akhil Pranay Discussion Week12
No ratings yet
Akhil Pranay Discussion Week12
34 pages
1 Colloquium Onyisi Aug 26 2024
No ratings yet
1 Colloquium Onyisi Aug 26 2024
2 pages
Akhil Pranay Week13Discussion
No ratings yet
Akhil Pranay Week13Discussion
30 pages
Week 13 - Kavya's and Prithvi's Annotated Version
No ratings yet
Week 13 - Kavya's and Prithvi's Annotated Version
27 pages
Exam 1 Discussion
No ratings yet
Exam 1 Discussion
11 pages
Week 12 - Kavya's and Prithvi's Annotated Version
No ratings yet
Week 12 - Kavya's and Prithvi's Annotated Version
29 pages
Fa24 Week 2
No ratings yet
Fa24 Week 2
36 pages
Akhil Pranay Fa24 Week1
No ratings yet
Akhil Pranay Fa24 Week1
27 pages
Ruihan Zhao - Google Scholar
No ratings yet
Ruihan Zhao - Google Scholar
2 pages
Crusade - Transcript (Stargate SG-1) GateWorld
No ratings yet
Crusade - Transcript (Stargate SG-1) GateWorld
30 pages
Fa24 Week 9
No ratings yet
Fa24 Week 9
38 pages
"American Billions" Puzzle: Find A 10-Digit Number (No Repeats), Where The First Digits Give A Number Divisible By, For All
No ratings yet
"American Billions" Puzzle: Find A 10-Digit Number (No Repeats), Where The First Digits Give A Number Divisible By, For All
3 pages
Oguzhan Akcin - Google Scholar
No ratings yet
Oguzhan Akcin - Google Scholar
1 page
(Ehrenberg, Siday) The Refractive Index in Electron Optics (1949)
No ratings yet
(Ehrenberg, Siday) The Refractive Index in Electron Optics (1949)
15 pages
(Ruiyuan Chen) Structurable equivalence relations and Lω1ω interpretations
No ratings yet
(Ruiyuan Chen) Structurable equivalence relations and Lω1ω interpretations
55 pages
Assignment Seven
No ratings yet
Assignment Seven
10 pages
Assignment 4 Dig Tech Harith Aqasha 3AVM2
No ratings yet
Assignment 4 Dig Tech Harith Aqasha 3AVM2
3 pages
TIP OR Course Homework 3 Problem 1:: 40 Tons Produced $2,000/ton 30 Tons Max 100 Tons Needed
No ratings yet
TIP OR Course Homework 3 Problem 1:: 40 Tons Produced $2,000/ton 30 Tons Max 100 Tons Needed
1 page
IMAGING Life Cycle and Quality
No ratings yet
IMAGING Life Cycle and Quality
58 pages
HKIMO 2018 G6 - Primary 6
100% (1)
HKIMO 2018 G6 - Primary 6
6 pages
Asm 4
No ratings yet
Asm 4
2 pages
Nanomaterials: Properties & Types
No ratings yet
Nanomaterials: Properties & Types
7 pages
Offshore Structure Repair Prioritization
No ratings yet
Offshore Structure Repair Prioritization
12 pages
Class7 Atmosphere Objective MCQs Final
No ratings yet
Class7 Atmosphere Objective MCQs Final
5 pages
Mathematical Modeling in Industrial Systems
No ratings yet
Mathematical Modeling in Industrial Systems
28 pages
WRM Y6 Autumn b3 Fractions Assessment B
No ratings yet
WRM Y6 Autumn b3 Fractions Assessment B
2 pages
ch03 - Part2 - RealVector v2 (42) - DONE
No ratings yet
ch03 - Part2 - RealVector v2 (42) - DONE
42 pages
Bircher Reglomat Switching Units PDF
100% (2)
Bircher Reglomat Switching Units PDF
18 pages
Bridge Equipment
0% (1)
Bridge Equipment
8 pages
Lampiran 2 LKPD - Id.en
No ratings yet
Lampiran 2 LKPD - Id.en
12 pages
Technical Parts List for MN2-2045
No ratings yet
Technical Parts List for MN2-2045
48 pages
Mammalian Tissues Post-Lab Guide
100% (1)
Mammalian Tissues Post-Lab Guide
5 pages
Horticultural Production Occupational Standard Level 6
No ratings yet
Horticultural Production Occupational Standard Level 6
148 pages
Datex-Ohmeda Tec6+ - Service Manual
No ratings yet
Datex-Ohmeda Tec6+ - Service Manual
90 pages
Unified Thread Standard
No ratings yet
Unified Thread Standard
3 pages
MRAC - 1st Year Bengali
No ratings yet
MRAC - 1st Year Bengali
125 pages
Haemostasis Training Basics
No ratings yet
Haemostasis Training Basics
11 pages
Galil DMC-4000
No ratings yet
Galil DMC-4000
9 pages
PG New Tansche
No ratings yet
PG New Tansche
7 pages
Maths Class X Mock Paper Test 03 For Board Exam 2025 Answers
No ratings yet
Maths Class X Mock Paper Test 03 For Board Exam 2025 Answers
14 pages
IEEE 13 Bus Power System
No ratings yet
IEEE 13 Bus Power System
5 pages
Chapter 1: Functions and Relations: Precalculus, 1st Ed
No ratings yet
Chapter 1: Functions and Relations: Precalculus, 1st Ed
49 pages
PHYSICS - Quiz Bee Reviewer
100% (6)
PHYSICS - Quiz Bee Reviewer
2 pages
(Percent Per Annum) I.28 Interest Rate of Time Deposits in Rupiah by Group of Banks and Type of Maturity
No ratings yet
(Percent Per Annum) I.28 Interest Rate of Time Deposits in Rupiah by Group of Banks and Type of Maturity
36 pages
Backstepping Control of Speed Sensorless Permanent Magnet Synchronous Motor Based On Slide Model Observer
No ratings yet
Backstepping Control of Speed Sensorless Permanent Magnet Synchronous Motor Based On Slide Model Observer
7 pages

Null-Terminated String - Wikipedia

Uploaded by

Null-Terminated String - Wikipedia

Uploaded by

Null-terminated string

In computer programming, a null-terminated string is a character string stored as an array

Retrieved from "https://en.wikipedia.org/w/index.php?title=Null-terminated_string&oldid=1184745718"

You might also like