Breakthrough Technique: Meta-learning for Compositionality

Original :
https://www.nature.com/articles/s41586-023-06668-3

Vulgarization :
https://scitechdaily.com/the-future-of-machine-learning-a-new-breakthrough-technique/

How MLC Works
In exploring the possibility of bolstering compositional learning in neural networks, the researchers created MLC, a novel learning procedure in which a neural network is continuously updated to improve its skills over a series of episodes. In an episode, MLC receives a new word and is asked to use it compositionally—for instance, to take the word “jump” and then create new word combinations, such as “jump twice” or “jump around right twice.” MLC then receives a new episode that features a different word, and so on, each time improving the network’s compositional skills.

  • @A_AOP
    link
    English
    9
    edit-2
    11 months ago

    Edit : Please read @[email protected]’s comment before mine.


    Hey folks, I believe this is really big.

    Traditional deep neural network’s training requires millions of example and so, despite its great success, is immensely inefficient.

    Now what if learning of these machines was as fast or faster than a human’s ? Well, it seems this is it.

    Look at how large language models are disruptive for many sectors of society. This new technology could accelerate the process exponentially.

    • @TropicalDingdong
      link
      English
      311 months ago

      Traditional deep neural network’s training requires millions of example and so, despite its great success, is immensely inefficient.

      Is this a limited advancement in training techniques? Right now I’m working on several types of image classification models. How would this be able to help me?

      • @A_AOP
        link
        English
        211 months ago

        Sorry, I just read a lot… but I don’t work in this field.

        • ripcord
          link
          fedilink
          211 months ago

          If you don’t really understand it, why do you believe this is so big?

          • @A_AOP
            link
            English
            1
            edit-2
            11 months ago

            I am not sure what “image classification models” incompasses. I would have to read more and understand and I don’t have enough time and energy.
            Yet in the past I have read and understand a few books about neural networks and this new article in nature is something else : it’s clear when reading it.
            ( also to @[email protected] )

            • @TropicalDingdong
              link
              English
              111 months ago

              I mean is this any different than standard gradient descent with something like Adam as optimiser.

              That’s my assumption based on the headline. But the quick skim I gave the article seemed to only discuss it in the context of NLP. Not exactly my field of study.

          • @[email protected]
            cake
            link
            fedilink
            English
            1
            edit-2
            11 months ago

            Admittedly, they were quoting someone else in the message you responded to. That may have been edited after the fact, but the person they’re quoting did in fact say those words (“this is big”).

            It was I who couldn’t read, as that is not what happened.

    • @[email protected]
      link
      fedilink
      English
      111 months ago

      Now what if learning of these machines was as fast or faster than a human’s ?

      What do you mean? It’s already faster than human’s. I takes years for a person to learn basic language and decades to gain expert knowledge in any field.

      • @A_AOP
        link
        English
        111 months ago

        What is meant here (and said as such in the article) is that humans can learn from a single example while deep neural networks takes thousands or millions (of examples) to learn.

        • @[email protected]
          link
          fedilink
          English
          111 months ago

          Ok, but neural networks can process way more examples per second so ‘faster’ is not really the right term here.

          • @A_AOP
            link
            English
            111 months ago

            Yes you are right. And I was hoping for someone more knowledgeable to help clarify this topic.

            Well I was lucky with the comment of @DigitalMus in here, if you would like to read it.