Support OCR to extract text from images for alt text

Here are some tools that could be used for performing the OCR (there may be better ones out there, this is just what I found quickly/already knew of):

tesseract
- pure javascript library (aka, can be run in the browser)
- Note: some image processing is recommended, such as inverting images which have white text on a dark background, applying a threshold filter to the image, and more. See: Improving the quality of the output
- Server admins should be allowed to choose between
  - tessdata_fast, which is fastest but gives worse results
  - tessdata_best, which is slower but gives better results
- Server admins should be allowed to choose which language(s) they wish to enable, as tesseract supports many. (additional languages require downloading
EasyOCR
- javascript library
  - requires python to be installed
MMOCR
- supports multiple different model types
- written in python (with no js library that I can find), so one of the two is needed:
  - a python web server which performs the OCR via http requests
  - a cli python app which performs the OCR when executed

See also feat: alt text ocr via tesseract.js (!269)

We would probably not consider server-side solutions, to avoid overloading the instances

Support OCR to extract text from images for alt text

What feature would you like implemented?

Why should we add this feature?

Version

Instance

Contribution Guidelines

Designs

Child items ...

Activity