Can I save images to Elasticsearch?

Can I store images in Elasticsearch clusters? If so, is there a resource about the worker thread? I have checked the following link: https://github.com/kzwang/elasticsearch-image

Since we have to handle large image files (over 500GB), we plan on using HDFS.

+3


source to share


1 answer


Saving entire images in Elasticsearch will not be very useful because if the image is scaled / cropped and then used as a query, it gives incorrect results. What you need depends on why you want to index these images.

In my case, I need to find if the image, after some scaling or cropping, has a close match in my database. I am fetching local descriptors (SIFT / SURF) of images and using them to create an Elasticsearch index. This will reduce the size of the image index, instead of storing the entire image, only a few functions are saved. I will now save all of these images to S3 and Elasticsearch will store the IDs for these images along with the functions extracted from them.

Regarding elasticsearch-image: this plugin hasn't been updated in a while and the most recent questions have been answered from last year. This plug-in integrates LIRE with Elasticsearch, where LIRE provides functionality extractor fingerprint multiple images.



Possible solutions:

  • Integrate the OpenCv library ( for computing object vectors for an image ) and Elasticsearch and create your own index using these image functions instead of storing the whole image. For product architecture, you can get some hints here .

  • Use an older version of Elasticsearch with a compatible version of the elasticsearch image.

  • Update your elasticsearch image to work with the latest Elasticsearch.

  • You can also use SOLR along with LireSolr to integrate with the LireSolr library.

+5


source







All Articles