How do you encode your paper scans?

Atemu · 2 years ago

How do you encode your paper scans?

@[email protected] · edit-2 2 years ago

I’ve never used paperless but just checked it out and it looks pretty neat. My first thought would be to scan documents in a higher resolution, let the OCR happen, then convert the file to a JPEG or something smaller after you’ve extracted the text.

I spent a few minutes looking at their wiki and it looks like it might be possible.

Like I said though, no experience with this software so I’m not sure that’d actually work.

Atemu · 2 years ago

Interesting idea but I think I’d like to retain similar to original quality in case I wanted to redo OCR if/when Paperless’ OCR improves in the future.

@surewhynotlem · 1 year ago

By ‘paperless’, y’all mean this one? https://docs.paperless-ngx.com/

Atemu · 1 year ago

Correct. That’s the currently maintained paperless project.

@surewhynotlem · 1 year ago

Thanks! There’s a very interesting trail of dead projects to follow. But I got ngx working and it’s great so far.

Atemu · 1 year ago

I for one am still waiting for paperless-ngnxn2-next-3.0_hypr.