DeepSeek v3.1 is not having a moment

(thezvi.wordpress.com)

38 points | by speckx 10 hours ago

4 comments

  • esafak 4 hours ago
    Because it did not top the open source leaderboard on any benchmark, except the agent one maybe. The hosted versions are not currently cheaper or faster than the other open source models, either.
  • dang 4 hours ago
    Recent and related:

    DeepSeek-v3.1 - https://news.ycombinator.com/item?id=44976764 - Aug 2025 (253 comments)

  • varsketiz 5 hours ago
    From the perspective of China, it probably makes sense to try and train on local chips and try to dethrone Nvidia. I guess this means PRC thinks AGI isnt around the corner and they can catch up on hardware.
    • gchamonlive 4 hours ago
      It also seems reasonable for me to think AGI isn't around the corner given how much current AI technology has failed in all fronts to deliver anything both general and intelligent.
      • mikae1 4 hours ago
        Not even Altman thinks AGI is around the corner. It keeps the hype and money flow alive though.
        • dingnuts 4 hours ago
          is that why he's talking up Dyson Spheres in interviews? the guy is a lunatic and conman, either completely insane or evil, no other option. here's the stupid quote:

          Sam Altman: I do guess that a lot of the world gets covered in data centers over time.

          Theo Von: Do you really?

          Altman: But I don’t know, because maybe we put them in space. Like, maybe we build a big Dyson sphere around the solar system and say, “Hey, it actually makes no sense to put these on Earth.”

          • delichon 4 hours ago
            Why is that wrong? If like Altman you think that energy is the bottleneck to intelligence, and social and economic power grows with intelligence, then predicting that intelligence will optimize for energy collection seems reasonable. It isn't evil to predict that. And if he is insane to predict it, then I must be insane for not dismissing it.

            Cassandra wasn't evil or crazy, she just had bad news.

            • klipklop 3 hours ago
              But there is no proof more energy == more intelligence. In some areas I am smarter than the best ChatGPT model and my energy source is Taco Bell double deckers. Clearly there is a lot of low hanging fruit for efficiency before needing to encompass the entire sun and suck it dry of energy. It's an absurd thing to suggest. It's exactly the type of thing a conman would suggest. Something cool, fantastic and completely impossible to actually implement.
            • MobiusHorizons 3 hours ago
              Because we are talking about likely outcomes, not optimizing for one tho to the exclusion of all else. Even if AGI is right around the corner (which is a pretty low percentage bet these days) cost alone would prevent such an outcome from being likely. Altman knows this, but being reasonable rarely sells.
            • janalsncm 3 hours ago
              If he really thinks the shortest path to building a synthetic brain is to build an entire Dyson sphere I would submit his bottleneck is the algorithm, not energy.
          • rpdillon 3 hours ago
            I think he's just thinking about a longer timeline.
            • ares623 3 hours ago
              i.e. A prophet for profit
    • esafak 4 hours ago
      They're already making the robots to run these models, which are the complements they are commoditizing.
  • yahoozoo 4 hours ago
    This blog is unreadable
    • jsnell 2 hours ago
      No space between paragraphs and blockquotes definitely hurts, and the subheading formatting is clearly not right. But the same content is posted on multiple sites. Maybe you'd find e.g. the Substack formatting more readable?

      https://thezvi.substack.com/p/deepseek-v31-is-not-having-a-m...

    • urbandw311er 4 hours ago
      This blog is excellent in general, particularly the AI posts. Yes they go extremely deep in places but the author freely admits that they are designed to be read selectively / skimmed in places.
      • yahoozoo 3 hours ago
        I mean it’s just a huge blob of text with emojis
        • black_puppydog 3 hours ago
          Jup, shame really. Just a little bit of better formatting and typesetting could really go a long way there. :)