Excerpt:

“Even within the coding, it’s not working well,” said Smiley. “I’ll give you an example. Code can look right and pass the unit tests and still be wrong. The way you measure that is typically in benchmark tests. So a lot of these companies haven’t engaged in a proper feedback loop to see what the impact of AI coding is on the outcomes they care about. Lines of code, number of [pull requests], these are liabilities. These are not measures of engineering excellence.”

Measures of engineering excellence, said Smiley, include metrics like deployment frequency, lead time to production, change failure rate, mean time to restore, and incident severity. And we need a new set of metrics, he insists, to measure how AI affects engineering performance.

“We don’t know what those are yet,” he said.

One metric that might be helpful, he said, is measuring tokens burned to get to an approved pull request – a formally accepted change in software. That’s the kind of thing that needs to be assessed to determine whether AI helps an organization’s engineering practice.

To underscore the consequences of not having that kind of data, Smiley pointed to a recent attempt to rewrite SQLite in Rust using AI.

“It passed all the unit tests, the shape of the code looks right,” he said. It’s 3.7x more lines of code that performs 2,000 times worse than the actual SQLite. Two thousand times worse for a database is a non-viable product. It’s a dumpster fire. Throw it away. All that money you spent on it is worthless."

All the optimism about using AI for coding, Smiley argues, comes from measuring the wrong things.

“Coding works if you measure lines of code and pull requests,” he said. “Coding does not work if you measure quality and team performance. There’s no evidence to suggest that that’s moving in a positive direction.”

  • jaykrown
    link
    fedilink
    arrow-up
    2
    arrow-down
    7
    ·
    1 day ago

    For people looking for jobs it will get more difficult, competition will continue to rise, and anyone not well versed in using AI will be left behind.

    • Korhaka@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      2
      ·
      19 hours ago

      Depends on the job, I do support for a proprietary SaaS product. Wtf use of AI do I have? They are trying to make an AI support agent, that isn’t me using AI, that is AI outright replacing some part of my job and I have no input in the process.

      Using AI to write the ticket feels pointless to me, takes longer to verify its slop and correct it than it does to write a message myself.

      “This is broken and we have raised it with the dev team” isn’t a phrase I need an LLM for.

      • jaykrown
        link
        fedilink
        arrow-up
        1
        arrow-down
        1
        ·
        16 hours ago

        AI is a tool which is great at creating standalone modular solutions. Look at the stuff I’ve built, the best I’ve ever created is the serial number extractor which actually solves a real world business problem. https://masland.tech/