• Sem
    link
    fedilink
    English
    21 month ago

    If you are doing data processing in pandas CoW allows to avoid of a lot of redundant computations on intermediate steps. Before CoW any data processing in Pandas required manual and careful working with code to avoid the case described in the blog post. To be honest I cannot imagine the case of offloading each result of each operation in the pipeline to the storage…

    • Nomecks
      link
      fedilink
      English
      21 month ago

      So you would be using CoW in-memory in this case?

      • Sem
        link
        fedilink
        English
        11 month ago

        If I already use Pandas for processing my data in-memory, CoW can significantly improve the performance. That was my point.