I’ve noticed some files I opened in a text editor have all kinds of crazy unrenderable chars

  • @cheese_greaterOP
    link
    1
    edit-2
    6 hours ago

    I do understand I’m not able to read it myself, I’m more curious about the architecture of how that data is represented and stored and conceptually how such representation is practically organized/reified…

    • @AbouBenAdhem
      link
      English
      1
      edit-2
      5 hours ago

      The original binary format is broken into six-bit chunks (e.g., 100101), which in decimal format correspond to the integers from 0 to 63. These are just mapped to letters in order:
      000000 (0) = A,
      000001 (1) = B,
      000010 (2) = C,
      etc.—it goes through the capital letters first, then lower-case letters, then digits, then “+” and “/”.

      It’s so simple you could do it by hand from the above description, if you were looking at the data in binary format.