OCR Tools for Uni and Research Notekeeping

its_me_xiphos ( @its_me_xiphos@beehaw.org ) · 6 months ago

OCR Tools for Uni and Research Notekeeping

Samsy ( @Samsy@lemmy.ml ) · 6 months ago

I work in a digitalisation environment, we use OCR in different ways, sometimes with tesseract and sometimes with adobe. Both are differently effective. Tesseract needs training and adobe has mostly a propetary better recognition. Handwriting is mostly a special part which needs manual control.

In my private environment I use a mix with paperless-ngx (which only does tesseract-ocr if it doesn’t is already OCR recognised). Paperless is able to change and export the output of the PDFs in a json database which I partly convert to trilium (a database based notebook).

Didn’t found a better solution yet and it isn’t mostly not handwritten.

its_me_xiphos ( @its_me_xiphos@beehaw.org ) · 6 months ago

I have some reading and learning to do, and I appreciate your reply.