• @pixxelkick
    link
    227 hours ago

    They probably do use lots of NoSQL DBs too, which perform better for non relational “data lake” style architectures where you just wanna dump mountains of data as fast as possible into storage, to be perused later.

    When you have cases where you have very very high volume of data in, but very low need to query it (but some potential need, just very low), nosql DBs excel

    Stuff like census data where you just gotta legally store it for historical reasons, and very rarely some person will wanna query it for a study or something.

    Keep in mind when I talk about low need to query, the opposite high need us on the scale of like, "this db gets queried multiple times per minute’

    Stuff like… logins to a website, data that gets queried many times per minute or even second, then sometimes nosql DBs fall off.

    Depends what is queried.

    Super basic “lookup by ID” Stuff that operates as just a big ole KeyValuePair mapping ID -> Value? And thats all you gotta query?

    NoSql is still the right tool for the job.

    The moment any kind of JOIN enters the discussion though, chances are you actually wanna use sql now

    • @Maggoty
      link
      34 hours ago

      Just so you know census data is very heavily queried. Everything from civil engineering to economics wants to look at that dataset every day.

      • @pixxelkick
        link
        23 hours ago

        Like I said, in the scale compared to actual high frequency data though, that’s still be infrequent.

        High frequency DBs are on the scale of many queried per second

        Even with tonnes of data scientists and engineers querying the data, that’s still in the scale of queries per minute, which is low frequency in the data world.

        • @Maggoty
          link
          11 hour ago

          I wouldn’t put it past them to experience numbers in the per second realm, especially as new data posts and everyone is rushing to grab it.

    • @[email protected]
      link
      fedilink
      226 hours ago

      So you’re saying Relational DataBase Management Systems do really well as soon as Relations are involved?

      • Morphit
        link
        fedilink
        54 hours ago

        And Structured Query Language is a handy language for querying structured data?

      • @pixxelkick
        link
        13 hours ago

        Eyup, it’s intuitive overall but there’s just weirdly some people out there that are all or nothing, and don’t understand “right tool for the job” lol

      • @[email protected]
        link
        fedilink
        English
        14 hours ago

        What’s funny is that Relational Databases in fact sucks when somewhat complex Relations are involved. Moment you step out the of the realm of Tabular data you’ll have very miserable time. Like good luck modeling and querying simple nested product catalog.

        Graph databases are better choice for truly relational data

    • @[email protected]
      link
      fedilink
      English
      34 hours ago

      I wouldn’t even consider “per minute” frequently queried. Per millisecond for stock market shit

    • @athairmor
      link
      76 hours ago

      To nitpick, Census data is heavily queried. They use Oracle now, I believe.