We put the Thing That Can't Do Numbers™ in your spreadsheets

irelephant [he/him]@programming.dev · 2 months ago

We put the Thing That Can't Do Numbers™ in your spreadsheets

bus_factor · 2 months ago

How is looking at ascii values supposed to help when someone prompts it with “calculate the sum of the numbers above”? The whole point is that no matter what kind of prescreening you add to an LLM, people will write prompts which are missed by the screening.

Knock_Knock_Lemmy_In · 2 months ago

How is looking at ascii values supposed to help when someone prompts it with “calculate the sum of the numbers above”?

Because you can check if values input from the spreadsheet are non-numeric.

bus_factor · 2 months ago

There are no values from the spreadsheet in this case. “The numbers above” are just text to the LLM.

They could of course require that optional cell or cell range parameter after the prompt, but that would eliminate some use cases. “Generate some text”, one of the stated use cases in the help text, doesn’t reference any cells.

Also, numbers in Excel aren’t necessarily as clear cut as you make it seem. Excel famously thinks everything is a date, and how number-y must a number be before it isn’t okay?

Not to mention there are other things to do with numbers which don’t require arithmetic. What if someone wants to have Excel translate 34 to “thirty-four”? Or have Excel generate a poem 34 words long? Or whatever else nonsense people might try.

Knock_Knock_Lemmy_In · 2 months ago

There are no values from the spreadsheet in this case.

What hill are you trying to die on here? In the picture, the numbers input are clearly 1,2 and 3.

They could of course require that optional cell or cell range parameter after the prompt,

No. It is trivial to identify if the input is text or value. You don’t need an LMM to do that.

Excel famously thinks everything is a date

That’s only interpreting data entry. Date values are stored in cells as doubles.

Not to mention there are other things to do with numbers which don’t require arithmetic.

Yes these are the use cases where an LLM would add some (questionable) value.

bus_factor · edit-2 2 months ago

The text field says =COPILOT("sum the numbers above"). It doesn’t work that way. Excel does not have any concept of what “above” means here. Those numbers are not used in the calculation whatsoever. To reference those numbers, the field should say =COPILOT("sum the numbers in", A1:A3).

What the user did here was ask the LLM to generate some text based on a text prompt and no other data, and the LLM decided to answer with a string containing only digits.

Knock_Knock_Lemmy_In · 2 months ago

Excel does not have any concept of what “above” means here.

Excel knows the address of the calling function.

bus_factor · 2 months ago

Yes, but “above” just goes into the LLM, which does who-knows-what with it, and certainly isn’t designed to address cells that way. So to the LLM that’s just like any other arbitrary text.

Knock_Knock_Lemmy_In · 2 months ago

Yes, but “above” just goes into the LLM,

I don’t believe even Microsoft are this stupid. The llm will have tools to interrogate the spreadsheet.

TachyonTele@piefed.social · 2 months ago

The COPILOT function only has access to data provided through the context arguments. It does not have access to:
Other data from the workbook

You give them too much credit.

bus_factor · 2 months ago

There would be no way to do that reliably. There’s too much weird stuff people might say to reference things, and the LLM would definitely act on the wrong cells more often than not.

Excel already has a perfectly unambiguous way to provide a specific range of cells, Which is why the =COPILOT() function lets you supply those in the second parameter. I’m assuming they get passed to the LLM as context, likely encoded as a markdown table. LLMs love parsing markdown, apparently.

The user provided no such range of cells, though, so the LLM is most likely seeing none of those other cells, and is just working based on random values from the Internet.