bpo-45829: Specialize BINARY_SUBSCR for getitem implemented in Python. #29592

markshannon · 2021-11-17T10:13:09Z

Specializes BINARY_SUBSCR for classes with a __getitem__ method implemented in Python by making the call directly in the instruction using the same mechanism as CALL_FUNCTION_PY_SIMPLE.

A respectable 1% speedup including a 20% speedup for one benchmark that makes heavy use of __getitem__.

https://bugs.python.org/issue45829

…ted in Python.

brandtbucher

This is really cool! A few notes:

brandtbucher · 2021-11-17T23:14:46Z

Misc/NEWS.d/next/Core and Builtins/2021-11-17-10-14-35.bpo-45829.5Cf6fY.rst

@@ -0,0 +1,2 @@
+Specialize BINARY_SUBSCR for classes with a ``__getitem__`` method


Suggested change

Specialize BINARY_SUBSCR for classes with a ``__getitem__`` method

Specialize :opcode:`BINARY_SUBSCR` for classes with a ``__getitem__`` method

brandtbucher · 2021-11-17T23:16:00Z

Python/ceval.c

+                assert(cache->adaptive.original_oparg == 0);
+                oparg = 0;


Is this just to clarify that the oparg is still unused (even though we use cache entries)?

brandtbucher · 2021-11-17T23:19:16Z

Python/specialize.c

+    int flags = code->co_flags;
+    if (flags & (CO_GENERATOR | CO_COROUTINE | CO_ASYNC_GENERATOR)) {
+        return SPEC_FAIL_GENERATOR;
+        return -1;


Suggested change

return -1;

brandtbucher · 2021-11-17T23:22:48Z

Python/specialize.c

+    if (code->co_nfreevars) {
+        return SPEC_FAIL_FREE_VARS;
+    }
+    return 0;


I don't really like this, since 0 is a valid failure code. Perhaps return -1 on success and check for that instead (it's sort of a weird interface, but whatever).

I'll add a success code, for clarity.

In many other C programs, yes 0 is a failure.
However, in CPython, 0 is usually a success, as -1 (or any negative number in some cases) is a failure.
E.g. PyObject_RichCompareBool return 0 for False, and -1 for an error.

compile.c is an unfortunate counter example, where some functions return 0 for a failure and others return 0 for a success.

brandtbucher · 2021-11-17T23:23:26Z

Python/specialize.c

+_Py_IDENTIFIER(__getitem__);
+
+static int
+function_kind(PyCodeObject *code) {


Suggested change

function_kind(PyCodeObject *code) {

function_spec_fail_kind(PyCodeObject *code) {

brandtbucher · 2021-11-17T23:46:06Z

Python/ceval.c

@@ -4914,7 +4945,7 @@ MISS_WITH_CACHE(LOAD_GLOBAL)
 MISS_WITH_CACHE(LOAD_METHOD)
 MISS_WITH_CACHE(CALL_FUNCTION)
 MISS_WITH_CACHE(BINARY_OP)
-MISS_WITH_OPARG_COUNTER(BINARY_SUBSCR)
+MISS_WITH_CACHE(BINARY_SUBSCR)


I'm guessing it's still worth keeping the logic for oparg counters, just in case we implement a new family that can uses it in the future?

Might as well delete it. It's in the git history.

Fidget-Spinner

I have the same questions as Brandt, but otherwise this LGTM.

While not in pyperformance, I'm excited for code using typing, since that module heavily uses __getitem__ (and __class_getitem__). Awesome!

markshannon · 2021-11-18T10:23:07Z

I have the same questions as Brandt, but otherwise this LGTM.

While not in pyperformance, I'm excited for code using typing, since that module heavily uses __getitem__ (and __class_getitem__). Awesome!

Type hints are run-once code (at least they should be) so won't be affected.

Fidget-Spinner · 2021-11-18T12:56:19Z

Type hints are run-once code (at least they should be) so won't be affected.

Agreed that it's true for the vast majority. But recently I noticed variable annotations sometimes appear in hot code paths and they might be affected.

…thon. (pythonGH-29592)

Experimental specialization of BINARY_SUBSCR for __getitem__ implemen…

4cbbeee

…ted in Python.

markshannon requested a review from brandtbucher November 17, 2021 10:13

the-knights-who-say-ni added the CLA signed label Nov 17, 2021

bedevere-bot added the awaiting core review label Nov 17, 2021

markshannon added 3 commits November 17, 2021 10:14

Add NEWS

1c5a9c0

Make function static.

2ad803b

Make sure we check type version *before* accessing the function.

4eda0e8

brandtbucher approved these changes Nov 17, 2021

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting core review labels Nov 17, 2021

Fidget-Spinner approved these changes Nov 17, 2021

View reviewed changes

markshannon added 3 commits November 18, 2021 09:11

Fix formatting in news item, and clarify code a bit.

02f736c

Delete unused code.

d76c386

Merge branch 'main' into specialize-for-getitem

8da92e3

markshannon merged commit 21fa7a3 into python:main Nov 18, 2021

bedevere-bot removed the awaiting merge label Nov 18, 2021

remykarem pushed a commit to remykarem/cpython that referenced this pull request Dec 7, 2021

bpo-45829: Specialize BINARY_SUBSCR for __getitem__ implemented in Py…

ff986b0

…thon. (pythonGH-29592)

Fidget-Spinner mentioned this pull request Apr 5, 2022

bpo-47189: What's New in 3.11: Faster CPython #32235

Merged

1 task

markshannon mentioned this pull request Sep 13, 2022

Remove C stack use by specializing BINARY_SUBSCR, STORE_SUBSCR, LOAD_ATTR, and STORE_ATTR #89987

Closed

markshannon deleted the specialize-for-getitem branch September 26, 2023 12:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-45829: Specialize BINARY_SUBSCR for getitem implemented in Python. #29592

bpo-45829: Specialize BINARY_SUBSCR for getitem implemented in Python. #29592

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		@@ -0,0 +1,2 @@
		Specialize BINARY_SUBSCR for classes with a ``__getitem__`` method

	Specialize BINARY_SUBSCR for classes with a ``__getitem__`` method
	Specialize :opcode:`BINARY_SUBSCR` for classes with a ``__getitem__`` method

	function_kind(PyCodeObject *code) {
	function_spec_fail_kind(PyCodeObject *code) {

Uh oh!

bpo-45829: Specialize BINARY_SUBSCR for __getitem__ implemented in Python. #29592

bpo-45829: Specialize BINARY_SUBSCR for __getitem__ implemented in Python. #29592

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bpo-45829: Specialize BINARY_SUBSCR for getitem implemented in Python. #29592

bpo-45829: Specialize BINARY_SUBSCR for getitem implemented in Python. #29592