ENH: stats: Add matrix variate t distribution #22925

johnmdusel · 2025-05-02T18:51:50Z

Reference issue

Closes issue #22870.

What does this implement/fix?

Background explanation in this post on the developer forum

Additional information

From the PR checklist:

This code can be distributed under a BSD license. I followed the existing random matrix implementation for data validation and wrote the methods from scratch.
I provided unit tests in stats.tests.test_multivariate.TestMatrixT. These are mostly simiar to the ones for matrix_normal, except that the moment checking and sample checking also perform a spot-check of two basic theorems from Gupta and Nagar (2000). All unit tests pass locally.
Re: style: I used black to format my additions to stats._multivariate. For unit tests and benchmarking I followed the style of existing code.
I provided benchmarks in stats.MatrixSampling. This also includes benchmarks for rvs methods of matrix_normal and invwishart.
All public functions have docstrings with examples, and I've verified that the documentation builds locally and renders correctly.

… that frozen instance checks for singularity of spread matrices.

- Completed `test_bad_input` - Stubs of other tests

- Adds unit tests for bad input and default inputs and covariance expansion to matrix variate t test class. - Fixes bugs in matrix variate t distribution which were exposed by those tests.

- Adds unit tests for array input to matrix variate t test class. - Fixes bugs in matrix variate t distribution which were exposed by those tests. _logpdf did not expect to receive an array input with > 3 dimensions and its Einstein sum is defined based on that expectation. So, if there are > 3 dimensions then _logpdf will flatten out the extra ones, perform the calculation, then reshape before returning.

Covers matrix normal, matrix t, and inverse Wishart.

…stribution

tylerjereddy

Looks like you made a solid effort to follow our usual practices. I added a few surface-level comments ahead of the review by stats regulars/domain experts.

scipy/stats/_multivariate.py

…t and remove leftover debugging lines in `_multivariate.py`

…ix_t_gen._process_parameters`

johnmdusel · 2025-05-12T19:56:27Z

Is this PR still active? Some tests show as failed, but I’m unsure how to resolve them on my end. I’m happy to make any necessary changes to fix them.

lucascolley

the lint failures shown at https://github.com/scipy/scipy/actions/runs/14981775469/job/42087379243?pr=22925 are an easy fix—they are just due to some lines being too long

scipy/stats/_multivariate.py

dschmitz89

This looks very promising, thank you @johnmdusel . Could you add references to the paper in the docstring?

About tests: how can we evaluate that this is indeed the matrix t distribution? Can we test the PDF method against some external reference? Or is there a special case where this reduces to a distribution we already have implemented?

johnmdusel · 2025-05-13T19:34:08Z

This looks very promising, thank you @johnmdusel . Could you add references to the paper in the docstring?

Yes, I will add the reference to Gupta and Nagar.

About tests: how can we evaluate that this is indeed the matrix t distribution? Can we test the PDF method against some external reference? Or is there a special case where this reduces to a distribution we already have implemented?

When I was developing this class, I tested the rvs and pdf and logpdf methods against Mathematica's MatrixTDistribution. The matrix_t_gen implemented here does agree with the Mathematica version. I can add some or all of that validation to the TestMatrixT class, if there's interest.

Motivated by their role in the `rvs` method

…stribution

…terface consistency Post `0dceecb` they need to be `row_spread=1` and `col_spread=1` instead of `None`. This is how they're defined in `matrix_normal_gen.__call__`

mdhaber · 2025-07-03T06:09:16Z

Would you have time to also have a look @mdhaber (especially the mathematica reference)?

I'm limited to using Mathematica version "14.2.1 for Linux x86 (64-bit) (March 16, 2025)". When I execute that code, I don't get the sample sample or pdf values.

dschmitz89 · 2025-07-04T08:42:35Z

Would you have time to also have a look @mdhaber (especially the mathematica reference)?

I'm limited to using Mathematica version "14.2.1 for Linux x86 (64-bit) (March 16, 2025)". When I execute that code, I don't get the sample sample or pdf values.

Hm, unfortunate. Can we do a minimal test with a fixed set of matrices to test the pdf values if we cannot rely on the random variates? I do not expect differences in the first few digits between different mathematica versions. @johnmdusel @mdhaber

johnmdusel · 2025-07-04T16:56:46Z

Can we do a minimal test with a fixed set of matrices to test the pdf values if we cannot rely on the random variates?

@dschmitz89 I could set up a minimal Julia script to generate random variates and compute PDF values. Would that be alright, as an alternative to Mathematica? I would provide a Dockerfile and a .jl file and copy-paste the outputs as is currently done in the unit test.

mdhaber · 2025-07-04T17:17:16Z

Oops that was a misclick. I'll see if I can just verify the PDfF values @johnmdusel generated.

johnmdusel · 2025-07-05T15:45:54Z

@dschmitz89 @mdhaber I added a unit test that checks PDF values against Julia. The source in the test's docstring and this small repo. All tests pass locally.

scipy/stats/tests/test_multivariate.py

dschmitz89

I am happy with this PR and the pdf values were also confirmed against an external source (Mathematica). There is one last stylistic nit regarding tests: we typically use relative tolerances, not absolute ones. Will wait for other reviewers before merging.

…stribution

…here appropriate

johnmdusel · 2025-07-10T16:19:11Z

@rgommers The tests against Julia and Mathematica are now based on a handful of reference values, instead of a file.

scipy/stats/tests/test_multivariate.py

…stribution

johnmdusel · 2025-07-16T10:06:06Z

@dschmitz89 -- questions about the failing CI checks above

Linux tests: The test_samples test took too long; it takes more samples for the test to pass with a lower relative tolerance. How would you like me to manage this tradeoff? With 10k samples it will pass with a 5% relative tolerance; with 100k it will pass with 2.5% but will take > 1 second.
Windows tests: This seems to be related to HiGHS; can I assume it'll be fixed in some other PR?

dschmitz89 · 2025-07-19T13:06:58Z

@dschmitz89 -- questions about the failing CI checks above

Linux tests: The test_samples test took too long; it takes more samples for the test to pass with a lower relative tolerance. How would you like me to manage this tradeoff? With 10k samples it will pass with a 5% relative tolerance; with 100k it will pass with 2.5% but will take > 1 second.

Let's go with the 5% tolerance for simplicity. Sry about the churn.

Windows tests: This seems to be related to HiGHS; can I assume it'll be fixed in some other PR?

That one is already fixed in main.

…stribution

johnmdusel · 2025-07-20T10:39:37Z

@dschmitz89 -- questions about the failing CI checks above

Linux tests: The test_samples test took too long; it takes more samples for the test to pass with a lower relative tolerance. How would you like me to manage this tradeoff? With 10k samples it will pass with a 5% relative tolerance; with 100k it will pass with 2.5% but will take > 1 second.

Let's go with the 5% tolerance for simplicity. Sry about the churn.

No problem! I made the change.

johnmdusel · 2025-07-24T16:55:48Z

@rgommers The tests against Julia and Mathematica are now based on a handful of reference values, instead of a file.

@rgommers would you be able to approve if no further changes are required? On the other hand, I'm happy to make further changes as needed. Thank you very much!

johnmdusel added 11 commits April 30, 2025 11:34

ENH: stats: Add matrix variate t-distribution code to _multivariate

d5e2ecb

BUG: Correct handling of missing degrees-of-freedom parameter. Ensure…

b74e0e8

… that frozen instance checks for singularity of spread matrices.

TST: stats.tests.test_multivariate: Add TestMatrixT class

83d4939

- Completed `test_bad_input` - Stubs of other tests

TST/BUG: stats: (WIP) Update unit tests and fix bugs.

eaafbb1

- Adds unit tests for bad input and default inputs and covariance expansion to matrix variate t test class. - Fixes bugs in matrix variate t distribution which were exposed by those tests.

Finish adding analogues of unit tests from matrix normal.

acb3c4e

BENCH: Add matrix variate sampling benchmark.

c465c0d

Covers matrix normal, matrix t, and inverse Wishart.

STY: Remove extra whitespace.

005b475

DOC: (WIP) Resolving Sphinx build error

adf72cd

DOC: stats: Fixes from local docs build.

f640031

Merge remote-tracking branch 'upstream/main' into matrix-variate-t-di…

8ef3675

…stribution

github-actions bot added scipy.stats enhancement A new feature or improvement labels May 2, 2025

MAINT: stats: Fix docstring typo.

ea17a7a

tylerjereddy reviewed May 2, 2025

View reviewed changes

johnmdusel added 2 commits May 3, 2025 06:13

MAINT: stats: Make checks for dimensions equal to zero more consisten…

e45cb56

…t and remove leftover debugging lines in `_multivariate.py`

TST: stats: Add test coverage for remaining error code paths in `matr…

b1916d6

…ix_t_gen._process_parameters`

Merge branch 'main' into matrix-variate-t-distribution

db05fd1

lucascolley reviewed May 12, 2025

View reviewed changes

scipy/stats/_multivariate.py Outdated Show resolved Hide resolved

DOC: stats: Reorganize docstrings for CI lint test

69cbebb

lucascolley reviewed May 13, 2025

View reviewed changes

scipy/stats/_multivariate.py Outdated Show resolved Hide resolved

scipy/stats/_multivariate.py Outdated Show resolved Hide resolved

lucascolley added 2 commits May 13, 2025 15:31

Update _multivariate.py

cf90d15

Update _multivariate.py

93c1f2d

dschmitz89 requested changes May 13, 2025

View reviewed changes

TST: add tests for ncfdtri

4d72327

johnmdusel requested review from ev-br, person142 and steppi as code owners May 13, 2025 19:48

DOC: stats: Add explanation of equivalent PDF expressions

550fe3d

Motivated by their role in the `rvs` method

johnmdusel requested a review from dschmitz89 July 2, 2025 13:54

johnmdusel added 3 commits July 2, 2025 10:10

DOC: stats: Fix docstrings for CircleCI tests

38742ef

Merge remote-tracking branch 'upstream/main' into matrix-variate-t-di…

e780e05

…stribution

MAINT: stats: Change default values in matrix_t_gen.__call__ for in…

86786af

…terface consistency Post `0dceecb` they need to be `row_spread=1` and `col_spread=1` instead of `None`. This is how they're defined in `matrix_normal_gen.__call__`

mdhaber closed this Jul 4, 2025

mdhaber reopened this Jul 4, 2025

TST: stats: Add test of PDF values against Julia MatrixTDist

46622b0

mdhaber reviewed Jul 5, 2025

View reviewed changes

scipy/stats/tests/test_multivariate.py Show resolved Hide resolved

dschmitz89 approved these changes Jul 10, 2025

View reviewed changes

scipy deleted a comment from abuazamatov1919 Jul 10, 2025

johnmdusel added 2 commits July 10, 2025 11:39

Merge remote-tracking branch 'upstream/main' into matrix-variate-t-di…

822b000

…stribution

MAINT/TST: stats: Change absolute tolerances to relative tolerances w…

d861da7

…here appropriate

dschmitz89 reviewed Jul 14, 2025

View reviewed changes

scipy/stats/tests/test_multivariate.py Outdated S 8000 how resolved Hide resolved

johnmdusel and others added 3 commits July 14, 2025 05:54

Merge remote-tracking branch 'upstream/main' into matrix-variate-t-di…

ce6a38e

…stribution

MAINT/TST: stats: Lower relative tolerance for transpose check

1ea3522

Merge branch 'main' into matrix-variate-t-distribution

4117bc0

Merge branch 'main' into matrix-variate-t-distribution

f10f1a5

johnmdusel added 2 commits July 20, 2025 06:32

Merge remote-tracking branch 'upstream/main' into matrix-variate-t-di…

6123c36

…stribution

MAINT/TST: stats: Set relative tolerance for transpose check to 5%

1e07e55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: stats: Add matrix variate t distribution #22925

ENH: stats: Add matrix variate t distribution #22925

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ENH: stats: Add matrix variate t distribution #22925

Are you sure you want to change the base?

ENH: stats: Add matrix variate t distribution #22925

Uh oh!

Conversation

Uh oh!

Reference issue

What does this implement/fix?

Additional information

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!