8000 [3.14] gh-132762: Fix underallocation bug in `dict.fromkeys()`(gh-133627) by miss-islington · Pull Request #133685 · python/cpython · GitHub
[go: up one dir, main page]

Skip to content

[3.14] gh-132762: Fix underallocation bug in dict.fromkeys()(gh-133627) #133685

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 8, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
gh-132762: Fix underallocation bug in dict.fromkeys()(gh-133627)
The function `dict_set_fromkeys()` adds elements of a set to an existing
dictionary. The size of the expanded dictionary was estimated with
`PySet_GET_SIZE(iterable)`, which did not take into account the size of the
existing dictionary.
(cherry picked from commit 421ba58)

Co-authored-by: Angela Liss <59097311+angela-tarantula@users.noreply.github.com>
  • Loading branch information
angela-tarantula authored and miss-islington committed May 8, 2025
commit 460aedaf9e661be94cd38b683c86344cd8d7a5d2
23 changes: 20 additions & 3 deletions Lib/test/test_dict.py
Original file line number Diff line number Diff line change
Expand Up @@ -338,17 +338,34 @@ def __setitem__(self, key, value):
self.assertRaises(Exc, baddict2.fromkeys, [1])

# test fast path for dictionary inputs
res = dict(zip(range(6), [0]*6))
d = dict(zip(range(6), range(6)))
self.assertEqual(dict.fromkeys(d, 0), dict(zip(range(6), [0]*6)))

self.assertEqual(dict.fromkeys(d, 0), res)
# test fast path for set inputs
d = set(range(6))
self.assertEqual(dict.fromkeys(d, 0), res)
# test slow path for other iterable inputs
d = list(range(6))
self.assertEqual(dict.fromkeys(d, 0), res)

# test fast path when object's constructor returns large non-empty dict
class baddict3(dict):
def __new__(cls):
return d
d = {i : i for i in range(10)}
d = {i : i for i in range(1000)}
res = d.copy()
res.update(a=None, b=None, c=None)
self.assertEqual(baddict3.fromkeys({"a", "b", "c"}), res)

# test slow path when object is a proper subclass of dict
class baddict4(dict):
def __init__(self):
dict.__init__(self, d)
d = {i : i for i in range(1000)}
res = d.copy()
res.update(a=None, b=None, c=None)
self.assertEqual(baddict4.fromkeys({"a", "b", "c"}), res)

def test_copy(self):
d = {1: 1, 2: 2, 3: 3}
self.assertIsNot(d.copy(), d)
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
:meth:`~dict.fromkeys` no longer loops forever when adding a small set of keys to a large base dict. Patch by Angela Liss.
7 changes: 4 additions & 3 deletions Objects/dictobject.c
Original file line number Diff line number Diff line change
Expand Up @@ -3178,9 +3178,10 @@ dict_set_fromkeys(PyInterpreterState *interp, PyDictObject *mp,
Py_ssize_t pos = 0;
PyObject *key;
Py_hash_t hash;

if (dictresize(interp, mp,
estimate_log2_keysize(PySet_GET_SIZE(iterable)), 0)) {
uint8_t new_size = Py_MAX(
estimate_log2_keysize(PySet_GET_SIZE(iterable)),
DK_LOG_SIZE(mp->ma_keys));
if (dictresize(interp, mp, new_size, 0)) {
Py_DECREF(mp);
return NULL;
}
Expand Down
Loading
0