• @[email protected]
    link
    fedilink
    English
    52 hours ago

    Every one who bought the 7900xtx laughing their arse off running 20GiB models with MUCH better performance than a 4080/4080Super lol

    • Eager Eagle
      link
      English
      246 hours ago

      I bet he just wants a card to self host models and not give companies his data, but the amount of vram is indeed ridiculous.

      • Jeena
        link
        fedilink
        English
        105 hours ago

        Exactly, I’m in the same situation now and the 8GB in those cheaper cards don’t even let you run a 13B model. I’m trying to research if I can run a 13B one on a 3060 with 12 GB.

        • @[email protected]
          link
          fedilink
          English
          248 minutes ago

          I’m running deepseek-r1:14b on a 12GB rx6700. It just about fits in memory and is pretty fast.

        • The Hobbyist
          link
          fedilink
          English
          54 hours ago

          You can. I’m running a 14B deepseek model on mine. It achieves 28 t/s.

          • @levzzz
            link
            English
            118 minutes ago

            You need a pretty large context window to fit all the reasoning, ollama forces 2048 by default and more uses more memory

          • Jeena
            link
            fedilink
            English
            54 hours ago

            Oh nice, that’s faster than I imagined.

          • @[email protected]
            link
            fedilink
            English
            12 hours ago

            I also have a 3060, can you detail which framework (sglang, ollama, etc) you are using and how you got that speed? i’m having trouble reaching that level of performance. Thx

  • @[email protected]
    link
    fedilink
    English
    74 hours ago

    How could Snowden get a hold of one of these in Russia? Maybe through an intermediary in Kazakhstan?

    Then again it’s hard finding one here even in the US since they all went out of stock within 5 minutes of being listed.

    • @[email protected]
      link
      fedilink
      English
      153 hours ago

      According to russian over at r/hardware GPUs have become cheaper in Russia since the ban as they are now being smuggled instead of imported via Europe with all extra cost that implies.

  • @deleted
    link
    English
    105 hours ago

    I legit tried to understand how a lackluster VRAM capacity could spy on us.

  • John Richard
    link
    English
    15
    edit-2
    5 hours ago

    The video card monopoly (but also other manufacturers) have been limiting functionality for a long time. It started with them restricting vGPU to enterprise garbage products, which allows Linux users to virtualize their GPU for things like playing games with near-native speeds using Windows on Linux. This is one of the big reasons Windows still has such a large marketshare as the main desktop OS.

    Now they want to restrict people running AI locally so that they get stuck with crap like Copilot-enabled PCs or whatever dumb names they want to come up. These actions are intentional. It is anti-consumer & anti-trust, but don’t expect our government to care or do anything about it.

    • MudMan
      link
      fedilink
      65 hours ago

      So to put the likelihood of this in perspective, let me just repeat it to see if I understand the claim.

      You’re saying that one of the big reasons of Windows’ market share is how artificially inefficient it is to install Linux, spin up a Virtual Machine, run Windows inside THAT and then run a game?

      That’s the mainstream use case that is propping up Windows adoption in this scenario?

      • John Richard
        link
        English
        05 hours ago

        The main thing propping up Windows as the main OS (meaning it is running at the root layer) is exclusive hardware GPU support which is used for gaming & many apps. Otherwise, automating running Windows apps & Windows on Linux would have become much more mainstream.

        • MudMan
          link
          fedilink
          114 hours ago

          This is demonstrably wrong on a scale where it loops around to becoming hard to explain, so that’s a neat trick.

          There are enough people who have never heard of or don’t understand the concept of virtual machines to keep Windows as the biggest mainstream OS several times over. There isn’t a “root layer” in computers as far as normal humans are concerned. They’re computers and then a Windows pops up and that’s how that works.

          At the very most, they understand conversion layers on the basis of having gone from an old Macbook to a new Macbook, and even that is like a tenth of the market (still several times bigger than Linux adoption, though).

          The idea that a mass of people are waiting on the sidelines, chomping at the bit for direct GPU access through an extra layer of software fine tuning to be able to run some brand name Windows app with no Linux version is absurd. Even games are not the problem, as evidenced by that being mostly solved via Proton and not changing much.

          I don’t mind either way, but man, consider what other assumptions you may be making that are wildly off, particularly if they’re on something more important than your hopes for relative OS market share on home computers.

  • no banana
    link
    English
    95 hours ago

    What the fuck is going on with the world

  • MudMan
    link
    fedilink
    65 hours ago

    Wait, did the guy refuse to call AMD’s 9070 by its official name out of spite there at the end? Is this a weird tech The Onion thing?

    • @[email protected]
      link
      fedilink
      English
      22 hours ago

      AMD, as usual, misses an opportunity here. The 5xxx series is exactly Fermi again (they even removed hotspot data so reviewers would miss the throttling). AMD could leverage the nostalgia of one of ATI’s best gens and call the cards 9700 and 9700pro. Damn, those were the days. (since 4750 conflicts with current scheme)

    • Eager Eagle
      link
      English
      3
      edit-2
      5 hours ago

      Probably a mistake, considering the current generation follows the RX 7_00 naming pattern.

      • MudMan
        link
        fedilink
        15 hours ago

        What type of news editor for a hardware review outlet gets that wrong? That’s as weird as the Snowden thing. If you have that job you’ve surely been joking about AMD’s shameful “we just want to use the same name as Nvidia” thing for ages by now. This thing is so surreal.

  • @800XL
    link
    English
    -224 hours ago

    Shut the fuck up, Snowden.You had everyone behind you until you defected to Russia. There’s no free lunch and you had a lot of info Putin would like to have. Oddly enough things really started getting bad shortly therafter.

    • @[email protected]
      link
      fedilink
      English
      253 hours ago

      He was being chasen by the US government, and Assange proved that being in an US allied country will still get your arrested/tortured. What other options did Snowden had other than escaping to Russia?

      IMO don’t hate the player, hate the game.

    • @[email protected]
      link
      fedilink
      English
      72 hours ago

      Think of it from Snowdens perspective. You get to choose: either be tortured for the rest of your life, or chill in Russia and pretend Putin is a nice guy. I know what I’d pick.

      • Alphane Moon
        link
        English
        31 hour ago

        He is not simply pretending Putin is a nice guy, he is clearly collaborating with russian security services. Just look at his comments on internal US politics. And he also was spreading misinformation that russia wasn’t going to invade Ukraine in Feb 2022.

        He might be a hero for many, but if you’re Ukrainian (like I am), he is clearly a piece of shit.