• @2pt_perversion
    link
    English
    34 hours ago

    I’d love to debate politics with you but first tell me how many r’s are in the word strawberry. (AI models are starting to get that answer correct now though)

      • @2pt_perversion
        link
        English
        1
        edit-2
        1 hour ago

        Over simplification but partly it has to do with how LLMs split language into tokens and some of those tokens are multi-letter. To us when we look for R’s we split like S - T - R - A - W - B - E - R - R - Y where each character is a token, but LLMs split it something more like STR - AW - BERRY which makes predicting the correct answer difficult without a lot of training on the specific problem. If you asked it to count how many times STR shows up in “strawberrystrawberrystrawberry” it would have a better chance.