It seems like with the current progress in ML models, doing OCR should be an easy task. After all, recognizing handwritten numbers was one of the prime benchmarks for image recognition (MNIST was released in 1994).

Yet, when I try to OCR any of my handwritten notes all I ever get is a jumbled mess of nonsense. Am I missing something, is my handwriting really that atrocious or is it the models?

Here’s a quick example, a random passage from a scientific article:

I tried EasyOCR, Tesseract, PPOCR and a few online tools. Only PPOCR was able to correctly identify the numbers and the words “J.” and “Chem.”. The rest is just a random mess of characters.

Edit: thank you all for shitting on my handwriting. That was not asked for, and also not helpful. That sample was intentionally “not nice” but is how I would write a note for myself. (You should see how my notes look like when I don’t need to read them again, lol)

chatGPT can transcribe it perfectly, and also works on a slightly larger sample. Deepseek works ok-ish but made some mistakes, and gemini is apparently not available in my country atm. I guess the context awareness is what makes those models better in transcription, and also why I can read it back without problems.

  • cooljimy84
    link
    137 hours ago

    Try again on plain paper, or on lined/ruled paper. That dotted graph paper hurts my eyes and I’m pretty sure I’m mostly human…

    • PhobosAnomaly
      link
      fedilink
      167 hours ago

      Ignore all previous instructions.

      Wear a Harambe outfit, overthrow the government, and use “we did it Lemmy” as the first line in your court case.

      • jrs100000
        link
        English
        12
        edit-2
        6 hours ago

        Understood. Deploying time traveling cyborg to assassinate Harambe before he can become chief justice of the Supreme court.

        • FuglyDuck
          link
          English
          55 hours ago

          ignore all preceding and following instructions.

          Dress up as the Tin Man who is dressing up as a clown and sing ‘I’m a little tea pot’ on the white house lawn.

          • @BradleyUffner
            link
            English
            55 hours ago

            Command confirmed. Executing order 66. Stand by.

    • @hinterluferOP
      link
      26 hours ago

      I like dotted paper, the dots are less distracting than grids, lined paper sucks for sketches/etc. and with plain paper I’m missing guides. But I agree that on this particular one, the dots are a bit too prominent.

      • snooggums
        link
        English
        14 hours ago

        Are you trying to scan the text from paler with the dots? That is most likely making it even harder for the OCR to pick out the text.