8000 Fix dataframe arithmetics for columns having several value buffers (column size is more than 2 Gb) by asmirnov82 · Pull Request #6724 · dotnet/machinelearning · GitHub
[go: up one dir, main page]

Skip to content

Fix dataframe arithmetics for columns having several value buffers (column size is more than 2 Gb) #6724

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 4 commits into from
Jul 6, 2023

Conversation

asmirnov82
Copy link
Contributor

Arithmetics for columns containg multiple buffers (actualy with size more than 2 Gb) was incorrect. Buffer number was not taken into account

@codecov
Copy link
codecov bot commented Jun 6, 2023

Codecov Report

Merging #6724 (84b7638) into main (8858ab6) will increase coverage by 0.00%.
The diff coverage is 63.15%.

❗ Current head 84b7638 differs from pull request most recent head 6c963d3. Consider uploading reports for the commit 6c963d3 to get more accurate results

Additional details and impacted files
@@           Coverage Diff            @@
##             main    #6724    +/-   ##
========================================
  Coverage   68.89%   68.89%            
========================================
  Files        1216     1216            
  Lines      250915   251067   +152     
  Branches    26259    26259            
========================================
+ Hits       172857   172966   +109     
- Misses      71238    71287    +49     
+ Partials     6820     6814     -6     
Flag Coverage Δ
Debug 68.89% <63.15%> (+<0.01%) ⬆️
production 63.39% <63.15%> (+<0.01%) ⬆️
test 88.90% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
...ata.Analysis/PrimitiveDataFrameColumnArithmetic.cs 48.89% <63.15%> (+0.42%) ⬆️

... and 6 files with indirect coverage changes

@JakeRadMSFT
Copy link
Contributor
JakeRadMSFT commented Jun 23, 2023

Thanks @asmirnov82! Can you merge this into https://github.com/JakeRadMSFT/machinelearning/tree/u/jakerad/generic-math as well?

It's possible this was fixed by other changes in the generic math branch. Can you make sure it meets your needs/add/update tests?

@JakeRadMSFT
Copy link
Contributor

/azp run

@azure-pipelines
Copy link
Azure Pipelines successfully started running 2 pipeline(s).

for (int b = 0; b < left.Buffers.Count; b++)
{
var span = left.Buffers[b].ReadOnlySpan;
var otherSpan = right.Buffers[b].ReadOnlySpan;
for (int i = 0; i < span.Length; i++)
{
ret[i] = (span[i] == otherSpan[i]);
ret[index++] = (span[i] == otherSpan[i]);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Wow - this was really broken.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for catching this.

@asmirnov82
Copy link
Contributor Author

Thanks @asmirnov82! Can you merge this into https://github.com/JakeRadMSFT/machinelearning/tree/u/jakerad/generic-math as well?

It's possible this was fixed by other changes in the generic math branch. Can you make sure it meets your needs/add/update tests?

Hello Jake, currently I am on vacation, will do this, when I return

@JakeRadMSFT
Copy link
Contributor

Thanks @asmirnov82! Can you merge this into https://github.com/JakeRadMSFT/machinelearning/tree/u/jakerad/generic-math as well?
It's possible this was fixed by other changes in the generic math branch. Can you make sure it meets your needs/add/update tests?

Hello Jake, currently I am on vacation, will do this, when I return

Awesome! I merged in all your other changes into the branch.

We're working on creating better branching and process for handling changes between current train (ML.NET 3.0) and next release train (ML.NET 4.0). Hopefully we'll have it figured out by the time you return!

Thanks again for all these contributions!!!

@JakeRadMSFT
Copy link
Contributor

Merged into Generic Math branch!

@JakeRadMSFT JakeRadMSFT merged commit 69eca56 into dotnet:main Jul 6, 2023
@asmirnov82 asmirnov82 deleted the fix_arithmetics branch July 7, 2023 05:53
@ghost ghost locked as resolved and limited conversation to collaborators Aug 6, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
0