-
Notifications
You must be signed in to change notification settings - Fork 853
Long query with 3 level transversal crashes ArangoDB #10412
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi @gustavo80br It could be an OOM issue, then to rule this out would you kindly provide the followings:
Best |
Hi Omar, thanks for your time answering me. I managed to "solve" my problem, by spliting the query. In the first query I get all the Studies, and than in my application I run a for loop and for each item I run another query to get the images. My time is short, but as soon as possible I will get the logs and return to this issue. Thank you one more time! |
Hi @gustavo80br Nice that you were able to work around this somehow For the time being, I am going to close this ticket, feel free to comment at any time here and we will reopen it Best, |
Hi Omar, Follow the dmesg output:
I realize that every time I need to iterate over a very large result set, I get this OOM problem. Arango logs don't output any error, nothing happens there. I reduced my query to the simple as possible. The Images collection has 25 million documents, a simple query just to return all of them trigger the OOM problem.
If I use LIMIT it's OK, but also have problems when the limit is like above:
The bigger the starting point, slower the query. For example this query takes more than 10 seconds, but if the starting point is 0, is will take less than 1ms. This kind of query in a system like Postgres is very simples, just do a SELECT and than iterate over the cursor to get the results. The results will take some time to retrieve, of course, but the the query will not trigger any memory issue. If you can help me understanding how Arango works, I highly appreciate. I coded a lot of my application on Arango and now that I have the real data the queries don't work as expected. I raised memory to 16Gb and it just take more time before the OOM problem, never solve it. Also played with arangod.conf without any improvement. Thanks in advance! |
Use stream cursors, otherwise Arango builds the entire result in memory |
Thank you Graetzer. Will try that. |
Uh oh!
There was an error while loading. Please reload this page.
My Environment
Component, Query & Data
Affected feature:
AQL query using web interface
AQL query (if applicable):
AQL explain (if applicable):
Dataset:
Graph Database. Images have 22mi items. On Study has multiple Series that have multiple Images.
Size of your Dataset on disk:
Steps to reproduce
Problem:
The Query takes eternity to complete, than ArangoDB crashes.
Expected result:
The task could be long, but not expected to have ArangoDB crashing. I'm evaluating the product and unfortunately I cannot make it to work with my dataset, that would be like 10x bigger in production.
The text was updated successfully, but these errors were encountered: