In issue #2913510 - Newest version of Tika giving warnings/messages → there was talk about tika warnings about
As mentioned in that issue I got rid of all these warnings (sqlite-jdbc, JBIG2ImageReader and fallback font) by adding "jp2 jpc j2k sqlite3 sqlite db db3" to the "Excluded file extensions" setting.
Maybe we could add "jp2 jpc j2k sqlite3 sqlite db db3" to the default "Excluded file extensions" setting?
An old but similiar issue ( #1083824 - Add jpg to excluded defaults → ) stated:
jpg images can contain meta data that can be indexed as well, and you want to index as much data as possible, so this works as designed.
I think the mentioned embedded extensions have metadata as well which could be indexed.
But since it does not work with the default configuration, I think we should exclude them by default.
Any thoughts?
Closed: works as designed
1.0
Code
Not all content is available!
It's likely this issue predates Contrib.social: some issue and comment data are missing.