• @[email protected]OP
    link
    fedilink
    21 year ago

    What about preserving languages that are close to extinct, but still have language data available? Can LLMs help in this case?

    • @ImpossibilityBox
      link
      51 year ago

      Preservation only but not likely any better than a linguistic historian.

      But it gets tricky because LLMs only function on HUGE sets of data. LLMs are nothing more than complicated probability engines. Give it the question “What color is the sky?” and the math extracted from the massive databases that it has says the highest probability answer is “Blue”. It doesn’t actually KNOW the answer it just knows the probabilities of different words.

      Without large amounts of data on the dying language current gen LLM’s won’t be accurate or able to generate reliable answers. Shoot… LLMs can barely generate reliable answers with the massive datasets they currently have.

      I strongly recommend anyone even remotely interested in LLMs to read this interactive article:

      https://ig.ft.com/generative-ai/