I'm seeing some errors like this in journalctl:
Sep 13 15:47:46 quercus localsearch-3[1238408]:
(localsearch-extractor-3:1238408): Tracker-WARNING **: 15:47:46.853:
File 'file:///home/dmarti/Documents/CR/Final_report_1_July_2020_.pdf'
took too long to process. Shutting down everything
Sep 13 15:47:46 quercus localsearch-3[1238277]: Extractor subprocess
died unexpectedly: Child process exited with code 1
Sep 13 15:47:53 quercus localsearch-3[1238729]: XML parsing failure
When I look at the status of the file it's showing as:
localsearch status CR/Final_report_1_July_2020_.pdf
URI: file:///home/dmarti/Documents/CR/Final_report_1_July_2020_.pdf
Message: Crash/hang handling file
Is there any way to tell localsearch that this is just a big PDF (437pp)
and it should take its time and go ahead and index the whole thing? (it
seems like the longer the PDF the more that a full-text index of it is a
win for the user/searcher)
--
_______________________________________________
users mailing list -- users@xxxxxxxxxxxxxxxxxxxxxxx
To unsubscribe send an email to users-leave@xxxxxxxxxxxxxxxxxxxxxxx
Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: https://lists.fedoraproject.org/archives/list/users@xxxxxxxxxxxxxxxxxxxxxxx
Do not reply to spam, report it: https://pagure.io/fedora-infrastructure/new_issue