User Story #11936 (accepted)
Opened 11 years ago
Last modified 9 years ago
Improve search indexing robustness, performance, and reliability
Reported by: | jballanco-x | Owned by: | jballanco-x |
---|---|---|---|
Priority: | major | Milestone: | Asynchronous |
Component: | General | Keywords: | n.a. |
Cc: | atarkowska, ejrozbicki, jamoore, cxallan, jburel | Story Points: | n.a. |
Sprint: | n.a. | Importance: | n.a. |
Total Remaining Time: | 0.0d | Estimated Remaining Time: | n.a. |
Description
Under certain circumstances, the Indexer process can get into an Out Of Memory state and, subsequently, images that have had tags added to them are not findable via a search on the tag.
We suspect that what is occurring in such cases is that the Indexer process is doing far more work than in needs to, that this is exacerbating an (as yet unidentified) memory leak, and that when the Indexer reaches the OOM state it is not gracefully dying. This means that at the very least we need to:
- Improve the performance of the Indexer. Currently, events are processed in order in limited batches. When a large number of updates are made to the same image, this can result in the same image being re-indexed for each batch, instead of only after all the queued-up events related to the image have been seen.
- Ensure that the Indexer can survive an OOM situation and continue making progress in processing pending events.
- Find and eliminate the source of the memory leak. Even when the indexer gets stuck doing too much work, it should not be running out of memory.
Attachments (3)
Change History (11)
Changed 11 years ago by jballanco-x
comment:1 Changed 11 years ago by jburel
- Cc jburel added
comment:2 Changed 11 years ago by jballanco-x
After further investigation, it seems likely that Hibernate is the culprit leading to the heap overflow. It does not appear that anything is leaking, but it does seem that loading items from the database during indexing is exhausting the default 256 MB heap.
There are a few things we will try to resolve the issue:
- Make Hibernate flush the search indexes after each index operation. This should reduce some of the memory overhead during indexing. (See here: http://docs.jboss.org/hibernate/search/3.1/reference/en/html_single/#search-batchindex-indexing)
- Reduce the batch size
- Increase the heap
Additionally, it seems like we should be able to catch threads that die from a heap OOM state and restart them, so that at least the entire Indexer process doesn't eventually hang. Along these lines we will:
- investigate why the thread-pool threads are not recovering from the OOM state
- improve the granularity of updating the Indexer's progress, so we don't have to repeat entire batches
comment:3 Changed 11 years ago by jballanco-x
- Milestone changed from 5.0.0 to 5.0.1
- Priority changed from blocker to critical
Key components have been fixed for 5.0.0 so that search will work in most situations. Remaining fixes are being prepared for 5.0.1. Thus, this story should no longer be blocking 5.0.0 (except for documentation being written).
comment:4 Changed 11 years ago by jballanco-x
- Priority changed from critical to major
- Summary changed from Searching for tags sometimes does not return expected images to Improve search indexing robustness, performance, and reliability
comment:5 Changed 10 years ago by jballanco-x
- Milestone changed from 5.0.1 to 5.0.2
comment:6 Changed 10 years ago by agilo
- Status changed from new to accepted
Updated status, related task in progress
comment:7 Changed 10 years ago by jamoore
- Milestone changed from 5.1.0-m3 to 5.x
I don't see anything else happening for 5.1.0. Pushing to 5.x
comment:8 Changed 9 years ago by jamoore
- Milestone changed from 5.x to Asynchronous
Log of an Indexer getting into an OOM state