doc: PyUnicode_AsUTF8String() fails if string contains surrogates

python · vstinner · Sep 27, 2024 · Sep 26, 2024 · Sep 27, 2024 · Sep 26, 2024
commit 770362ca09284a9b208930515efdb417798b80f9
diff --git a/Doc/c-api/unicode.rst b/Doc/c-api/unicode.rst
@@ -990,6 +990,8 @@ These are the UTF-8 codec APIs:
    object.  Error handling is "strict".  Return ``NULL`` if an exception was
    raised by the codec.
 
+   The function fails if the string contains surrogate characters.
+
 
 .. c:function:: const char* PyUnicode_AsUTF8AndSize(PyObject *unicode, Py_ssize_t *size)
 
@@ -1002,6 +1004,8 @@ These are the UTF-8 codec APIs:
    On error, set an exception, set *size* to ``-1`` (if it's not NULL) and
    return ``NULL``.
 
+   The function fails if the string contains surrogate characters.
+
    This caches the UTF-8 representation of the string in the Unicode object, and
    subsequent calls will return a pointer to the same buffer.  The caller is not
    responsible for deallocating the buffer. The buffer is deallocated and