[Hackweek] Add explain plan to db spans. #2315

antonpirker · 2023-08-21T12:58:01Z

This is a proof of concept of adding the explain plan to db spans. The explain plan will be added to the span in the db.explain_plan data item.

There is a cache to make sure that the explain plan for each db query is only executed ever X seconds and there is also a max number of elements that are cached. To make sure we do not put to much strain on CPU or memory.

Usage:

sentry_sdk.init(
    dsn="...",
    _experiments={
        "attach_explain_plans": {
            "explain_cache_size": 1000,  # Run explain plan for the 1000 most run queries
            "explain_cache_timeout_seconds": 60 * 60 * 24,  # Run the explain plan for each statement only every 24 hours
            "use_explain_analyze": True,  # Run "explain analyze" instead of only "explain"
        }
    }

If you then look at the db span in Sentry.io it looks like this:

sentrivana

One nit: if we add the new experiment to

sentry-python/sentry_sdk/consts.py

Lines 35 to 49 in 692c0e9

    
           Experiments = TypedDict( 
        
               "Experiments", 
        
               { 
        
                   "max_spans": Optional[int], 
        
                   "record_sql_params": Optional[bool], 
        
                   # TODO: Remove these 2 profiling related experiments 
        
                   "profiles_sample_rate": Optional[float], 
        
                   "profiler_mode": Optional[ProfilerMode], 
        
                   "otel_powered_performance": Optional[bool], 
        
                   "transport_zlib_compression_level": Optional[int], 
        
                   "enable_metrics": Optional[bool], 
        
                   "before_emit_metric": Optional[Callable[[str, MetricTags], bool]], 
        
               }, 
        
               total=False, 
        
           )

we should get nicer code completion I assume.

And one thing regarding the cache. If I read it right, once a key is in the cache, it'll never get deleted if it's never accessed again. Is this what we want? Let's say my app does 50 one-time SELECTs at startup which I'm not that interested in, but they'll then occupy the cache, so this will always apply for any future SELECTS

    if len(EXPLAIN_CACHE.keys()) >= explain_cache_size:
        return False

and we won't run any additional explains.

antonpirker · 2023-10-02T06:29:14Z

You are right about the cache! Good catch. (how did I pass our coding interviews? ;-) )
And about the experiments thing: right! will add it to the TypedDict.

sentrivana

LGTM, left a couple of comments.

sentry_sdk/db/explain_plan/__init__.py

Co-authored-by: Ivana Kellyerova <ivana.kellyerova@sentry.io>

antonpirker added 10 commits August 21, 2023 13:15

Add explain plan to db spans

2f8c735

Make sure to run explain analyze only once in a while.

4632218

Added missing files

bba3ba2

Cleanup

9a9bbeb

Added option choose between explain analyze and simply analyze.

fd6b33e

Merge branch 'master' into antopirker/hackweek

f219b3a

Merge branch 'master' into antopirker/hackweek

f70cc30

Fixed imports in Python 2.7

bb64324

Make linter happy

ff2799a

Cleanup

fc6a577

antonpirker marked this pull request as ready for review September 29, 2023 12:23

antonpirker self-assigned this Sep 29, 2023

sentrivana reviewed Sep 29, 2023

View reviewed changes

antonpirker added 2 commits October 2, 2023 08:31

Added new expiremtn to Experiments dict.

461885a

Better cache invalidation

543ef42

sentrivana approved these changes Oct 2, 2023

View reviewed changes

sentry_sdk/db/explain_plan/__init__.py Outdated Show resolved Hide resolved

sentry_sdk/db/explain_plan/__init__.py Outdated Show resolved Hide resolved

sentry_sdk/db/explain_plan/__init__.py Outdated Show resolved Hide resolved

antonpirker and others added 4 commits October 2, 2023 10:48

Update sentry_sdk/db/explain_plan/__init__.py

44a2f9d

Co-authored-by: Ivana Kellyerova <ivana.kellyerova@sentry.io>

Update sentry_sdk/db/explain_plan/__init__.py

dbbb95c

Co-authored-by: Ivana Kellyerova <ivana.kellyerova@sentry.io>

Merge branch 'master' into antopirker/hackweek

acfe004

Calculate expiration time while caching

57cb21a

antonpirker merged commit 2faf03d into master Oct 2, 2023

antonpirker deleted the antopirker/hackweek branch October 2, 2023 09:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Hackweek] Add explain plan to db spans. #2315

[Hackweek] Add explain plan to db spans. #2315

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

	Experiments = TypedDict(
	"Experiments",
	{
	"max_spans": Optional[int],
	"record_sql_params": Optional[bool],
	# TODO: Remove these 2 profiling related experiments
	"profiles_sample_rate": Optional[float],
	"profiler_mode": Optional[ProfilerMode],
	"otel_powered_performance": Optional[bool],
	"transport_zlib_compression_level": Optional[int],
	"enable_metrics": Optional[bool],
	"before_emit_metric": Optional[Callable[[str, MetricTags], bool]],
	},
	total=False,
	)

[Hackweek] Add explain plan to db spans. #2315

[Hackweek] Add explain plan to db spans. #2315

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!