I want to preface this by saying I’m not doubting you, I just don’t know how it works.
Ok, but wouldn’t the training be weighted against older phrases that are no longer used? Or is all training data given equal weight?
Additionally, if the goal is to create bedtime stories or similar, couldn’t the person generating it ask for a more contemporary style? Would that affect the use of that phrase and similar cheesy lines that keep appearing?
I would never use an LLM for creative or factual work, but I use them all the time for code scaffolding, summarization, and rubber ducking. I’m super interested and just don’t understand why they do the things they do.
Those phrases are not common anymore but once was very common, among the corpus the llm is trained on (mid 20th century books)
I want to preface this by saying I’m not doubting you, I just don’t know how it works.
Ok, but wouldn’t the training be weighted against older phrases that are no longer used? Or is all training data given equal weight?
Additionally, if the goal is to create bedtime stories or similar, couldn’t the person generating it ask for a more contemporary style? Would that affect the use of that phrase and similar cheesy lines that keep appearing?
I would never use an LLM for creative or factual work, but I use them all the time for code scaffolding, summarization, and rubber ducking. I’m super interested and just don’t understand why they do the things they do.